feat(prof): use a trampoline for FLF functions to intercept timings by bwoebi · Pull Request #3595 · DataDog/dd-trace-php

bwoebi · 2026-01-21T17:14:21Z

Frameless functions (FLF) do not check the EG(vm_interrupt) flag like it would happen usually on internal function returns. This means that from the point of view of the profiler, those functions do not exist, as the engine never checks the interrupt flag and as such never calls the profilers interrupt handler.

This PR adds a trampoline for aarch64 and x84_64 to all known frameless functions and a tail call to check the EG(vm_interrupt) flag and in case it is raised, call the interrupt handler.

One (acceptable) trade-off is that using these trampolines, libunwind in the crash tracker is not able to unwind passed the trampoline. Anyway, the impact is minimal:

Stack frames from the crash site up to the trampoline are still captured
Only frames above the trampoline (from caller to main) are lost
For most crashes, the relevant context is near the crash site, not at the program entry point

The profiler itself is unaffected of this, as we are unwinding the VM stack, following the execute_data linked list.

https://datadoghq.atlassian.net/browse/PROF-12085

datadog-datadog-prod-us1 · 2026-01-21T17:27:12Z

✅ Tests

🎉 All green!

❄️ No new flaky tests detected
🧪 All tests passed

🎯 Code Coverage (details)
• Patch Coverage: 100.00%
• Overall Coverage: 60.68% (+0.03%)

_{This comment will be updated automatically if new data arrives.

🔗 Commit SHA: ce9eaa3 | Docs | Datadog PR Page | Was this helpful? React with 👍/👎 or give us feedback!}

profiling/src/wall_time.rs

codecov-commenter · 2026-01-21T17:30:42Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 67.48%. Comparing base (f1af9ca) to head (ce9eaa3).
⚠️ Report is 1 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #3595      +/-   ##
==========================================
- Coverage   68.79%   67.48%   -1.31%     
==========================================
  Files         166      166              
  Lines       19015    19015              
  Branches     1792     1792              
==========================================
- Hits        13081    12832     -249     
- Misses       5121     5373     +252     
+ Partials      813      810       -3

Flag	Coverage Δ
helper-rust-integration	`69.53% <ø> (-9.30%)`	⬇️
helper-rust-unit	`49.36% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.
see 15 files with indirect coverage changes

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f1af9ca...ce9eaa3. Read the comment docs.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

morrisonlevi

I assume this can run afoul of permissions somehow, since we're making executable code at runtime? I'm not well versed here, and yes, if the JIT is enabled you'd have to have that capability anyway, but I'm trying to understand implications of this WIP.

pr-commenter · 2026-01-21T17:35:54Z

Benchmarks [ profiler ]

Benchmark execution time: 2026-03-27 18:32:25

Comparing candidate commit ce9eaa3 in PR branch bob/prof-flf-test with baseline commit f1af9ca in branch master.

Found 0 performance improvements and 2 performance regressions! Performance is the same for 26 metrics, 8 unstable metrics.

scenario:php-profiler-timeline-memory-control

🟥 cpu_user_time [+32.466ms; +37.703ms] or [+5.345%; +6.208%]
🟥 execution_time [+36.226ms; +41.481ms] or [+5.720%; +6.550%]

Signed-off-by: Bob Weinand <bob.weinand@datadoghq.com>

bwoebi · 2026-01-21T18:07:22Z

@morrisonlevi dynasmrt takes care of setting RX permissions after compiling the code. As long as you're not running this under a hardened runtime (like app store apps or android (?)), there's no fundamental problem. Also I currently call assembler.finalize().unwrap() (which takes care of settign RX) - proper usage should just handle the potential error here and abort.

Signed-off-by: Bob Weinand <bob.weinand@datadoghq.com>

cataphract · 2026-01-21T18:47:50Z

profiling/src/wall_time.rs

+            dynasm!(assembler
+                ; mov rax, QWORD original as i64
+                ; call rax
+                ; mov rax, QWORD interrupt_addr as i64


I assume original is not returning anything, because you're writing over RAX.

cataphract · 2026-01-21T18:52:10Z

profiling/src/wall_time.rs

+            #[cfg(target_arch = "aarch64")]
+            dynasm!(assembler
+                ; mov x16, original as u64
+                ; blr x16


this overwrites x30/lr, aren't you gonna lose the original return location of the handler? so when br x16 returns, it goes back to calling interrupt_addr

Ah, right. call on x86_64 pushes %eip to the stack, but blr doesn't.

Fixed up, is it correct now?

Signed-off-by: Bob Weinand <bob.weinand@datadoghq.com>

profiling/src/wall_time.rs

Signed-off-by: Bob Weinand <bob.weinand@datadoghq.com>

realFlowControl

Thanks @bwoebi this is awesome!

profiling/src/wall_time.rs

bwoebi force-pushed the bob/prof-flf-test branch from 1976c3c to 222e3a4 Compare January 21, 2026 17:19

morrisonlevi reviewed Jan 21, 2026

View reviewed changes

profiling/src/wall_time.rs Outdated Show resolved Hide resolved

morrisonlevi reviewed Jan 21, 2026

View reviewed changes

Test flf trampoline

0eb96ee

Signed-off-by: Bob Weinand <bob.weinand@datadoghq.com>

bwoebi force-pushed the bob/prof-flf-test branch from 222e3a4 to 0eb96ee Compare January 21, 2026 17:52

Use EG(current_execute_data)

c6bdefc

Signed-off-by: Bob Weinand <bob.weinand@datadoghq.com>

bwoebi force-pushed the bob/prof-flf-test branch from dc825a7 to c6bdefc Compare January 21, 2026 18:06

bwoebi force-pushed the bob/prof-flf-test branch from 95f422a to 7471599 Compare January 21, 2026 18:17

Remove redundant ret

4358ab8

Signed-off-by: Bob Weinand <bob.weinand@datadoghq.com>

bwoebi force-pushed the bob/prof-flf-test branch from 7471599 to 4358ab8 Compare January 21, 2026 18:24

cataphract reviewed Jan 21, 2026

View reviewed changes

Batch allocate all trampolines

df41452

Signed-off-by: Bob Weinand <bob.weinand@datadoghq.com>

cataphract reviewed Jan 21, 2026

View reviewed changes

profiling/src/wall_time.rs Show resolved Hide resolved

bwoebi force-pushed the bob/prof-flf-test branch from a3c79dc to 078acef Compare January 21, 2026 19:34

Batch allocate infos as well for easier cleanup

60c7da9

Signed-off-by: Bob Weinand <bob.weinand@datadoghq.com>

bwoebi force-pushed the bob/prof-flf-test branch 2 times, most recently from e4930fe to 2604b22 Compare January 21, 2026 19:35

Store aarch64 link register; and fix stack align on x86_64

6307860

Signed-off-by: Bob Weinand <bob.weinand@datadoghq.com>

bwoebi force-pushed the bob/prof-flf-test branch from 2604b22 to 6307860 Compare January 21, 2026 19:38

Avoid updating infos multiple times

bcebfb0

Signed-off-by: Bob Weinand <bob.weinand@datadoghq.com>

bwoebi force-pushed the bob/prof-flf-test branch from a246940 to bcebfb0 Compare January 21, 2026 20:37

bwoebi added 3 commits January 21, 2026 22:24

Fix aarch64 asm with immediates

a407712

Signed-off-by: Bob Weinand <bob.weinand@datadoghq.com>

Fix flf functions with multiple handlers

b9fc370

Signed-off-by: Bob Weinand <bob.weinand@datadoghq.com>

Resolve TODOs

d9d3f43

Signed-off-by: Bob Weinand <bob.weinand@datadoghq.com>

bwoebi force-pushed the bob/prof-flf-test branch from b0995bc to d9d3f43 Compare January 22, 2026 13:45

Merge branch 'master' into bob/prof-flf-test

ee8a3b2

log error instead of unwrap()

2f81b3a

realFlowControl force-pushed the bob/prof-flf-test branch from ab68993 to 5513636 Compare February 16, 2026 15:45

realFlowControl changed the title ~~WIP: use a trampoline for FLF functions to intercept timings~~ feat(prof): use a trampoline for FLF functions to intercept timings Feb 16, 2026

realFlowControl force-pushed the bob/prof-flf-test branch from b0d9759 to b2149b6 Compare February 16, 2026 16:01

realFlowControl added 3 commits February 16, 2026 17:21

release borrow as soon as possible

628969a

make clippy happy

d59c715

Merge branch 'master' into bob/prof-flf-test

655fb2d

realFlowControl force-pushed the bob/prof-flf-test branch from 514102d to 655fb2d Compare February 16, 2026 16:26

fix tests

0dd28b4

realFlowControl marked this pull request as ready for review February 16, 2026 17:17

realFlowControl requested review from a team as code owners February 16, 2026 17:17

realFlowControl approved these changes Feb 16, 2026

View reviewed changes

Merge branch 'master' into levi/prof-flf-test

70b7fa1

morrisonlevi reviewed Mar 20, 2026

View reviewed changes

profiling/src/wall_time.rs Show resolved Hide resolved

profiling/src/wall_time.rs Show resolved Hide resolved

morrisonlevi added 4 commits March 20, 2026 08:50

build: update Cargo.lock after merge

2f62501

Merge remote-tracking branch 'origin/master' into bob/prof-flf-test

feff322

fix: potential misalignment on aarch64

12fc69b

Merge branch 'master' into bob/prof-flf-test

ce9eaa3

Conversation

bwoebi commented Jan 21, 2026 • edited by realFlowControl Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

datadog-datadog-prod-us1 bot commented Jan 21, 2026 • edited by datadog-official bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

morrisonlevi left a comment

Choose a reason for hiding this comment

Uh oh!

pr-commenter bot commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmarks [ profiler ]

scenario:php-profiler-timeline-memory-control

Uh oh!

bwoebi commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cataphract Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

bwoebi Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

cataphract Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

bwoebi Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

bwoebi Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

realFlowControl left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

bwoebi commented Jan 21, 2026 •

edited by realFlowControl

Loading

datadog-datadog-prod-us1 bot commented Jan 21, 2026 •

edited by datadog-official bot

Loading

codecov-commenter commented Jan 21, 2026 •

edited

Loading

pr-commenter bot commented Jan 21, 2026 •

edited

Loading

bwoebi commented Jan 21, 2026 •

edited

Loading