Deduplicate walltime perf symbols, unwind data and debug info across pids by GuillaumeLagrange · Pull Request #240 · CodSpeedHQ/codspeed

GuillaumeLagrange · 2026-02-11T14:44:04Z

Since the real deduplicator source of truth is actually the path of the elf file on disk, I settled on

{path_index}__{file_name}.unwind_data
{path_index}__{file_name}.symbols.map

it allows for the filename to be determined by the actual path, without having to either nest all the files, or have to sanitize paths to file name and end up in awkward file-name length limitations

Here's what it looks like on a 1000 iterations report of one simple program

$ ls
0__exec-harness.symbols.map          3__ld-linux-x86-64.so.2.unwind_data  7__nix-ld.symbols.map         perf.metadata
0__exec-harness.unwind_data          4__ld-linux-x86-64.so.2.symbols.map  7__nix-ld.unwind_data         perf.pipedata
1__libm.so.6.symbols.map             4__ld-linux-x86-64.so.2.unwind_data  8__libc.so.6.symbols.map      results
1__libm.so.6.unwind_data             5__libgcc_s.so.1.symbols.map         8__libc.so.6.unwind_data      runner.log
2__graph-benchmark.symbols.map       5__libgcc_s.so.1.unwind_data         9__libgcc_s.so.1.symbols.map
2__graph-benchmark.unwind_data       6__libc.so.6.symbols.map             9__libgcc_s.so.1.unwind_data
3__ld-linux-x86-64.so.2.symbols.map  6__libc.so.6.unwind_data             ExecutionTimestamps.msgpack

codspeed-hq · 2026-02-11T14:48:06Z

Merging this PR will not alter performance

✅ 4 untouched benchmarks

_{Comparing cod-2138-deduplicate-perf-maps-and-unwind_data (9a7f9bc) with main (ad6011b)}

not-matthias

I think the struct naming can be clarified a bit, I was struggling quite a lot to understand how they map together. Maybe also using custom types for the key that's used in the hashmaps.

The overall logic looks good! Next round should be quick

crates/runner-shared/src/metadata.rs

...apshots/codspeed_runner__executor__wall_time__perf__unwind_data__tests__cpp_unwind_data.snap

src/executor/wall_time/perf/parse_perf_file.rs

src/executor/wall_time/perf/perf_map.rs

src/bin/compare_walltime_output.rs

build.rs

This fixes a regression introduced in 8b37208. We now filter pids using `bench_pids`, except for `exec-harness` integrations, where we take all pids.

- Store pid-agnostic data in a file or json map under a mapped `path_key` for each elf - For each pid, store pid specific data, mostly the computed load_bias from where each module was loaded into memory at runtime, alongside a key to retrieve the pid-agnostic data This way, we only write to disk relevant parts of the information.

Also add a rebuild trigger to make it easier to run GITHUB_ACTIONS=1 cargo test` locally. We could have a better trigger, but this will do for now.

not-matthias

LGTM overall. Just a few minor stylistic comments.

not-matthias · 2026-02-13T19:40:10Z

src/executor/wall_time/perf/mod.rs

 pub mod unwind_data;

-const PERF_METADATA_CURRENT_VERSION: u64 = 1;
+const PERF_METADATA_CURRENT_VERSION: u64 = 3;


Didn't we skip 2 here? 😄

not-matthias · 2026-02-13T19:42:43Z

src/executor/wall_time/perf/parse_perf_file.rs

+use std::path::PathBuf;
+
+#[derive(Default)]
+pub struct MountedModule {


Not sure if "mounted" is the right name for this. Wouldn't it be more like MappedModule?

not-matthias · 2026-02-13T19:42:52Z

src/executor/wall_time/perf/parse_perf_file.rs

+    /// Unwind data extracted from the mapped ELF file
+    pub unwind_data: Option<UnwindData>,
+    /// Per-process mounting information
+    pub process_mounted_module: HashMap<pid_t, ProcessMountedModule>,


not-matthias · 2026-02-13T19:47:51Z

src/executor/wall_time/perf/unwind_data.rs

+        let load_bias =
+            assert_and_get_load_bias(start_addr, end_addr, file_offset, MODULE_PATH, 0x0);


Don't have to change it, but we could instead also define a variable load_bias that is passed as the last parameter to assert_load_bias, which we then can also use below in the unwind_data_from_elf

Feels a bit cleaner than having a assert_and_get function

GuillaumeLagrange changed the title ~~Cod 2138 deduplicate perf maps and unwind data~~ Deduplicate walltime perf symbols, unwind data and debug info across pids Feb 11, 2026

GuillaumeLagrange force-pushed the cod-2138-deduplicate-perf-maps-and-unwind_data branch from 99e0ea4 to 1a32faa Compare February 11, 2026 14:45

GuillaumeLagrange force-pushed the cod-2138-deduplicate-perf-maps-and-unwind_data branch 2 times, most recently from 15316f4 to 9305a03 Compare February 12, 2026 03:37

GuillaumeLagrange marked this pull request as ready for review February 12, 2026 08:49

GuillaumeLagrange requested review from art049 and not-matthias February 12, 2026 08:49

GuillaumeLagrange force-pushed the cod-2138-deduplicate-perf-maps-and-unwind_data branch from a3617af to 7f65690 Compare February 12, 2026 11:14

not-matthias requested changes Feb 12, 2026

View reviewed changes

GuillaumeLagrange removed the request for review from art049 February 13, 2026 11:04

GuillaumeLagrange force-pushed the cod-2138-deduplicate-perf-maps-and-unwind_data branch 3 times, most recently from 8c864c5 to 692a99b Compare February 13, 2026 16:33

feat: use bench_pids filters when harvesting symbols from perf file

5cad97d

This fixes a regression introduced in 8b37208. We now filter pids using `bench_pids`, except for `exec-harness` integrations, where we take all pids.

GuillaumeLagrange force-pushed the cod-2138-deduplicate-perf-maps-and-unwind_data branch 2 times, most recently from 192d2ce to 14cb24f Compare February 13, 2026 16:45

GuillaumeLagrange requested a review from not-matthias February 13, 2026 16:46

GuillaumeLagrange added 2 commits February 13, 2026 17:55

feat: skip tests requiring sudo if GITHUB_ACTIONS is not set

9a7f9bc

Also add a rebuild trigger to make it easier to run GITHUB_ACTIONS=1 cargo test` locally. We could have a better trigger, but this will do for now.

GuillaumeLagrange force-pushed the cod-2138-deduplicate-perf-maps-and-unwind_data branch from 14cb24f to 9a7f9bc Compare February 13, 2026 16:55

GuillaumeLagrange requested a review from art049 February 13, 2026 17:29

not-matthias approved these changes Feb 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deduplicate walltime perf symbols, unwind data and debug info across pids#240

Deduplicate walltime perf symbols, unwind data and debug info across pids#240
GuillaumeLagrange wants to merge 3 commits intomainfrom
cod-2138-deduplicate-perf-maps-and-unwind_data

GuillaumeLagrange commented Feb 11, 2026 •

edited

Loading

Uh oh!

codspeed-hq bot commented Feb 11, 2026 •

edited

Loading

Uh oh!

not-matthias left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

not-matthias left a comment

Uh oh!

not-matthias Feb 13, 2026

Uh oh!

not-matthias Feb 13, 2026

Uh oh!

not-matthias Feb 13, 2026

Uh oh!

not-matthias Feb 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		let load_bias =
		assert_and_get_load_bias(start_addr, end_addr, file_offset, MODULE_PATH, 0x0);

Conversation

GuillaumeLagrange commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codspeed-hq bot commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging this PR will not alter performance

Uh oh!

not-matthias left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

not-matthias left a comment

Choose a reason for hiding this comment

Uh oh!

not-matthias Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

not-matthias Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

not-matthias Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

not-matthias Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

GuillaumeLagrange commented Feb 11, 2026 •

edited

Loading

codspeed-hq bot commented Feb 11, 2026 •

edited

Loading