Add convenience feature for word diff #1

pascalkuthe · 2022-10-26T16:55:07Z

Performing a word diff over a full file can be fairly slow on large files.
A better approach is to perform a line diff first and and then perform the word diff on the found changes.
While this is already possible with imara-diff is requires quite a bit of legwork and can be tricky to get right.
It would be nice if this could be included in the library directly.
This has multiple steps for an implementation:

Determine the output format. A different trait or force collecting into a Vec?
Implement a TokenSource for words
Implement a Sink that automatically computes a word diff
Potentially implement a heuristic to detect and ignore

The diff algorithm in git only operates on lines. It is worth looking into what exactly they use to produce a colored word diff from the line diff.
Perhaps a different algorithm is a better fit?

The text was updated successfully, but these errors were encountered:

jlama · 2023-09-28T16:30:46Z

FYI git does word diffing by feeding the same algorithm with one word per line.

pascalkuthe added the enhancement New feature or request label Oct 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add convenience feature for word diff #1

Add convenience feature for word diff #1

pascalkuthe commented Oct 26, 2022

jlama commented Sep 28, 2023

Add convenience feature for word diff #1

Add convenience feature for word diff #1

Comments

pascalkuthe commented Oct 26, 2022

jlama commented Sep 28, 2023