Implement an advanced fuzzy diffing feature for interdiff by kerneltoast · Pull Request #156 · twaugh/patchutils

kerneltoast · 2025-11-14T06:13:41Z

Description

This implements a --fuzzy option to make interdiff perform a fuzzy
comparison between two diffs. This is very helpful, for example, for
comparing a backport patch to its upstream source patch to assist a human
reviewer in verifying the correctness of the backport.

The fuzzy diffing process is complex and works by:

Generating a new patch file with hunks split up into smaller hunks to
separate out multiple deltas (+/- lines) in a single hunk that are spaced
apart by context lines, increasing the amount of deltas that can be
applied successfully with fuzz
Applying the rewritten p1 patch to p2's original file, and the rewritten
p2 patch to p1's original file; the original files aren't ever merged
Relocating patched hunks in only p1's original file to align with their
respective locations in the other file, based on the reported line
offset printed out by patch for each hunk it successfully applied
Squashing unline gaps fewer than max_context*2 lines between hunks in the
patched files, to hide unknown contextual information that is irrelevant
for comparing the two diffs while also improving hunk alignment between
the two patched files
Diffing the two patched files as usual
Rewriting the hunks in the diff output to exclude unlines from the
unified diff, even splitting up hunks to remove unlines present in the
middle of a hunk, while also adjusting the @@ line to compensate for the
change in line offsets
Emitting the rewritten diff output while interleaving rejected hunks from
both p1 and p2 in the output in order by line number, with a comment on
the @@ line indicating when an emitted hunk is a rejected hunk

This also involves working around some bugs in patch itself encountered
along the way, such as occasionally inaccurate line offsets printed out and
spurious fuzzing in certain cases that involve hunks with an unequal number
of pre-context and post-context lines.

The end result of all of this is a minimal set of real differences in the
context lines of each hunk between the user's provided diffs. Even when
fuzzing results in a faulty patch, the context differences are shown so
there is never a risk of any real deltas getting hidden due to fuzzing.

By default, the fuzz factor used is just the default used in patch. The
fuzz factor can be adjusted by the user via appending =N to --fuzzy to
specify the maximum number of context lines for patch to fuzz.

Testing

This was tested on several complex Linux kernel patches to compare the backported version of a patch to its original upstream version. This PR also comes with a few basic fuzzy diffing tests integrated into the test infrastructure.

It's difficult to conditionally add additional arguments to the patch execution in apply_patch() because they are placed within a compound literal array. Make the arguments more extensible by creating a local array and an index variable to place the next argument into the array. This way, it's much easier to change the number of arguments provided at runtime.

Remove the superfluous fseeks and simplify the original file creation process by moving relevant fseeks to come right after the file cursor was last modified.

Coloring the newline character results in the terminal cursor becoming colored when the final line in the interdiff is colored. Fix this by not coloring the newline character.

codecov · 2025-11-14T07:54:48Z

Codecov Report

❌ Patch coverage is 86.19001% with 141 lines in your changes missing coverage. Please review.
✅ Project coverage is 86.16%. Comparing base (487f3e8) to head (84a6fe1).

Files with missing lines	Patch %	Lines
src/interdiff.c	86.19%	141 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #156      +/-   ##
==========================================
- Coverage   86.47%   86.16%   -0.32%     
==========================================
  Files          15       15              
  Lines        8176     8924     +748     
  Branches     1643     1838     +195     
==========================================
+ Hits         7070     7689     +619     
- Misses       1106     1235     +129

Flag	Coverage Δ
unittests	`86.16% <86.19%> (-0.32%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

kerneltoast · 2025-11-14T08:00:10Z

@twaugh Please take a look at this when you can, thanks!

kerneltoast · 2025-11-14T19:05:07Z

All checks are passing now with a lot more test coverage added.

kerneltoast · 2025-11-24T19:15:29Z

@twaugh I had updated this PR with several fixes and additional tests last week, and everything is finalized beyond a shadow of a doubt at this point. Would love to land this into patchutils!

When an @@ line isn't immediately after the +++ line in patch2, the next line is checked from the top of the loop which tries to search for a +++ line again, even though the +++ was already found. This results in the +++ not being found again and thus a spurious error that patch2 is empty. Fix this by making the patch2 case loop over the next line until either an @@ is found or the patch is exhausted.

This implements a --fuzzy option to make interdiff perform a fuzzy comparison between two diffs. This is very helpful, for example, for comparing a backport patch to its upstream source patch to assist a human reviewer in verifying the correctness of the backport. The fuzzy diffing process is complex and works by: - Generating a new patch file with hunks split up into smaller hunks to separate out multiple deltas (+/- lines) in a single hunk that are spaced apart by context lines, increasing the amount of deltas that can be applied successfully with fuzz - Applying the rewritten p1 patch to p2's original file, and the rewritten p2 patch to p1's original file; the original files aren't ever merged - Relocating patched hunks in only p1's original file to align with their respective locations in the other file, based on the reported line offset printed out by `patch` for each hunk it successfully applied - Squashing unline gaps fewer than max_context*2 lines between hunks in the patched files, to hide unknown contextual information that is irrelevant for comparing the two diffs while also improving hunk alignment between the two patched files - Diffing the two patched files as usual - Rewriting the hunks in the diff output to exclude unlines from the unified diff, even splitting up hunks to remove unlines present in the middle of a hunk, while also adjusting the @@ line to compensate for the change in line offsets - Emitting the rewritten diff output while interleaving rejected hunks from both p1 and p2 in the output in order by line number, with a comment on the @@ line indicating when an emitted hunk is a rejected hunk This also involves working around some bugs in `patch` itself encountered along the way, such as occasionally inaccurate line offsets printed out and spurious fuzzing in certain cases that involve hunks with an unequal number of pre-context and post-context lines. The end result of all of this is a minimal set of real differences in the context lines of each hunk between the user's provided diffs. Even when fuzzing results in a faulty patch, the context differences are shown so there is never a risk of any real deltas getting hidden due to fuzzing. By default, the fuzz factor used is just the default used in `patch`. The fuzz factor can be adjusted by the user via appending =N to `--fuzzy` to specify the maximum number of context lines for `patch` to fuzz.

Fuzzy interdiffs would show how to go from patch2 to patch1, the opposite of how it should be. Fix it so that the output shows how to go from patch1 to patch2.

split_patch_hunks() fails to filter bogus hunks that contain real context lines mixed with unlines. Such hunks are still bogus because they don't have any delta lines. Additionally, the numbers in the @@ line become nonsense when more than one bogus hunk is filtered. Fix both of these issues affecting the bogus hunk filter logic.

So that patch never tries to prompt and ask for something when running interdiff on an interactive terminal.

Fuzzy mode output is incoherent because context differences show up as going from patch1 -> patch2, while delta differences show up in the opposite direction: patch2 -> patch1. This makes it rather impossible to tell which patch file a +/- line originates from, not to mention that the diff itself is totally nonsense in terms of the actual code changed by the patches. Bring interdiff part of the way there to fixing this issue, by eliminating delta differences from the compared files. Note that this makes fuzzy mode only do context diffing, which is fixed in the subsequent commits.

…ed one When using --fuzzy mode, interdiff splits patch hunks and applies them with fuzz to maximize successful application. The function parse_fuzzed_hunks() parses patch's output to create relocation records that track where hunks were applied and how much they need to be relocated back to their original intended positions. The bug was in how the relocation's `new` field was calculated. The code was storing: new = lnum - hunk_offs[hnum - 1] where `lnum` is the line number where patch applied the hunk, and `hunk_offs` is the offset introduced by splitting the original hunk into smaller pieces. This calculation gives the "original hunk's intended line number" before any splitting occurred. However, fuzzy_relocate_hunks() later searches for hunks in the patched file by looking for hunks where `hcurr->nstart == rcurr->new`. The hunks in the patched file are located at their *actual* applied positions (i.e., `lnum`), not at the adjusted position (`lnum - hunk_offs`). This mismatch caused the relocation logic to fail to find the target hunk, triggering the fatal "failed to relocate hunk" error. The fix changes the relocation record to store: new = lnum (actual position in patched file) off = off + split_off (total offset: patch + split offset) This ensures that: 1. Hunks can be found at their actual positions in the patched file (matching `hcurr->nstart == rcurr->new` succeeds) 2. When relocating, subtracting `off` correctly moves the hunk back to its original intended position, accounting for both the patch offset (where patch chose to apply the hunk) and the split offset (the difference introduced by hunk splitting) 3. Duplicate detection still works correctly by comparing original intended positions: `lnum - off - split_off == prev->new - prev->off` simplifies to comparing the same values as before since prev->off now also includes the split offset Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>

This is a slippery slope and tricky to keep track of to ensure that the file string buffer is sufficiently large. Just use dynamic allocations.

Extract the common diff invocation pattern into a reusable run_diff() helper function. This function: - Takes options string as a parameter (caller builds it) - Builds the diff command with the options and diff_opts - Executes diff via xpipe() - Consumes the --- and +++ header lines - Returns a FILE* positioned at the first @@ line, or NULL if empty - Optionally returns the child pid for the caller to waitpid() Refactor output_delta() to use run_diff() instead of inline diff code. This simplifies the function and prepares for additional diff call sites. Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>

When comparing context differences between two patches, diff hunks can contain changes at the top or bottom edge that are exclusively additions or exclusively deletions. These are not real differences -- they are artifacts of one patch having captured more or fewer context lines than the other around the same code change. For example, if patch2 includes 5 lines of context above a change but patch1 only includes 3, the context diff will show those 2 extra lines as additions at the top of a hunk. This is misleading because the patches make the same change; they just differ in how much surrounding code was captured. Add filter_edge_hunks() to detect and handle these spurious edge lines. For each hunk, the first and last context lines partition the body into three regions: top edge, middle, and bottom edge. Each edge is then classified: - Two-sided (has both additions and deletions): a real change, kept as-is - One-sided (exclusively additions or exclusively deletions): spurious, trimmed from the hunk and the @@ header line counts adjusted If no changes remain after trimming (the entire hunk was spurious edges), the hunk is dropped. If all hunks are dropped, the CONTEXT DIFFERENCES section is suppressed entirely. Assisted-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Replace the inline getline/fseek header-skipping loop in the function fuzzy_output_list() and the fgetc loops in run_diff() with a shared skip_header_lines() helper that skips the two --- / +++ lines. Assisted-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Extract the fuzz value calculation from apply_patch() into a standalone helper so it can be reused by split_patch_hunks(). Assisted-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

When the last line in the remaining hunk content is a + line, the function ctx_lookahead() skips it without checking for end-of- content. The loop then continues with a NULL next_line pointer, crashing on dereference. Add the same end-of-content check that already exists for non-+ lines. Assisted-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

The patch utility distributes fuzz across both sides of a hunk's context by computing prefix_fuzz and suffix_fuzz from the fuzz value and the number of pre/post context lines. When the two sides are unbalanced, the larger side consumes the entire fuzz budget and the smaller side gets none -- any context mismatch on the short side causes the hunk to be rejected outright. For example, a split sub-hunk with 3 lines of pre-context and 6 lines of post-context would have prefix_fuzz = fuzz + 3 - 6, which is zero (or negative, clamped to zero) at every fuzz level. Even a single mismatched pre-context line will cause the hunk to fail, despite the mismatch being well within the configured fuzz limit. In the opposite direction, when prefix exceeds suffix, the patch utility computes a negative suffix_fuzz. Unlike prefix_fuzz (which is clamped to zero), a negative suffix_fuzz restricts matching to end-of-file only, causing mid-file hunks to be rejected even when the context is otherwise identical. Both imbalances arise because split_patch_hunks() shares the gap context lines between consecutive sub-hunks: the gap appears as the post-context of the first sub-hunk and the pre-context of the second. When the gap is larger or smaller than the original hunk's leading or trailing context, one side ends up with more context than the other. Fix by: - Widening the xctx_post condition to add extra post-context whenever total prefix exceeds suffix, not just when post-context is below nctx_target. - Clamping xctx_pre propagation at the fuzz value to prevent cascading prefix inflation across sub-hunks. - Capping post-context at the pre-context count so the fuzz budget is evenly split between prefix and suffix. Assisted-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Add tests fuzzy1 through fuzzy9 exercising the interdiff --fuzzy mode with various patch combinations including relocations, hunk splitting, reject handling, and real-world kernel backport diffs.

kerneltoast mentioned this pull request Nov 14, 2025

Implement a fuzzy diffing option for interdiff #147

Closed

kerneltoast added 2 commits November 13, 2025 23:03

Simplify original file creation in output_delta()

dcd431d

Remove the superfluous fseeks and simplify the original file creation process by moving relevant fseeks to come right after the file cursor was last modified.

Exclude newline character from colorized output

430bbfc

Coloring the newline character results in the terminal cursor becoming colored when the final line in the interdiff is colored. Fix this by not coloring the newline character.

kerneltoast force-pushed the sultan/new-fuzzy-algo branch 2 times, most recently from 18418f4 to 5401c9f Compare November 14, 2025 07:51

kerneltoast force-pushed the sultan/new-fuzzy-algo branch from 5401c9f to 170ca21 Compare November 14, 2025 18:58

kerneltoast force-pushed the sultan/new-fuzzy-algo branch 2 times, most recently from 01efe36 to 60a60b3 Compare November 20, 2025 08:04

kerneltoast force-pushed the sultan/new-fuzzy-algo branch from 60a60b3 to d90325c Compare February 11, 2026 03:18

kerneltoast force-pushed the sultan/new-fuzzy-algo branch from d90325c to 2a5d295 Compare February 19, 2026 02:49

kerneltoast added 2 commits February 19, 2026 10:29

kerneltoast force-pushed the sultan/new-fuzzy-algo branch from 2a5d295 to 5017f3c Compare February 19, 2026 23:06

kerneltoast and others added 12 commits February 20, 2026 00:00

interdiff: Fix incorrect base file direction in fuzzy mode

3aa81a3

Fuzzy interdiffs would show how to go from patch2 to patch1, the opposite of how it should be. Fix it so that the output shows how to go from patch1 to patch2.

interdiff: Use -N on patch all the time

c3bbb8a

So that patch never tries to prompt and ask for something when running interdiff on an interactive terminal.

interdiff: Stop modifying file strings in-place

f5ea809

This is a slippery slope and tricky to keep track of to ensure that the file string buffer is sufficiently large. Just use dynamic allocations.

interdiff: Separate delta and context differences in fuzzy mode

089f677

interdiff: Show files only in one patch under fuzzy mode headers

2b32e73

interdiff: Add get_fuzz() helper function

49b397b

Extract the fuzz value calculation from apply_patch() into a standalone helper so it can be reused by split_patch_hunks(). Assisted-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

kerneltoast added 3 commits February 20, 2026 00:00

tests: Add fuzzy mode tests for interdiff

84a6fe1

Add tests fuzzy1 through fuzzy9 exercising the interdiff --fuzzy mode with various patch combinations including relocations, hunk splitting, reject handling, and real-world kernel backport diffs.

kerneltoast force-pushed the sultan/new-fuzzy-algo branch from 5017f3c to 84a6fe1 Compare February 20, 2026 08:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement an advanced fuzzy diffing feature for interdiff#156

Implement an advanced fuzzy diffing feature for interdiff#156
kerneltoast wants to merge 20 commits intotwaugh:masterfrom
kerneltoast:sultan/new-fuzzy-algo

kerneltoast commented Nov 14, 2025

Uh oh!

codecov bot commented Nov 14, 2025 •

edited

Loading

Uh oh!

kerneltoast commented Nov 14, 2025

Uh oh!

kerneltoast commented Nov 14, 2025

Uh oh!

kerneltoast commented Nov 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

kerneltoast commented Nov 14, 2025

Description

Testing

Uh oh!

codecov bot commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

kerneltoast commented Nov 14, 2025

Uh oh!

kerneltoast commented Nov 14, 2025

Uh oh!

kerneltoast commented Nov 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

codecov bot commented Nov 14, 2025 •

edited

Loading