Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BUG qsv diff produces different results for the same command #2443

Open
datatraveller1 opened this issue Jan 14, 2025 · 1 comment
Open

BUG qsv diff produces different results for the same command #2443

datatraveller1 opened this issue Jan 14, 2025 · 1 comment

Comments

@datatraveller1
Copy link

datatraveller1 commented Jan 14, 2025

This is an interesting issue. Have you noticed that successive invocations of the same command with qsv diff give different results? The results are usually correct, but sometimes wrong.

qsv diff --key=art_no a.csv b.csv -o diff.csv

If you call this command twice or more, the file diff.csv often (but not always) contains different content.

I have not yet succeeded in creating a small test file without confidential data, but perhaps you can already do something with this information.

@jqnatividad
Copy link
Collaborator

Thanks for the heads up @datatraveller1 . I know it uses parallelized hashing, which may account for this non-deterministic behavior.

Copying in @janriemer.

He contributed the diff command, which takes advantage of his csv-diff crate...

Hopefully, he can quickly identify the reason why diff sometimes produces different results...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants