Improve parquet reader very-long string performance #17773

pmattione-nvidia · 2025-01-21T19:17:55Z

The previous strings PR significantly reduced the parquet reader string performance for very-long strings, for lengths ~1024 and longer. This PR fixes the performance issue by instituting a max memcpy length of 8 bytes at once (this length yielded best perf). Also, up to all of the threads in the block can work on the same string, rather than limiting it to just all of the threads in a warp.

PERFORMANCE:
Short strings: Unchanged
Length 1024: 25% faster
Longer lengths (up to 64k): Up to 90% faster, same as before strings PR

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

…a/cudf into fix_long_strings

cpp/src/io/parquet/page_string_utils.cuh

vuule · 2025-01-24T20:24:30Z

Could you post the impact of the change on the benchmarks? Not required to merge IMO, but it's nice to keep such result available long-term.

vuule

Looks good. I really like the simplification in calc_threads_per_string_log2.

cpp/src/io/parquet/page_string_utils.cuh

…a/cudf into fix_long_strings

pmattione-nvidia · 2025-01-24T20:55:36Z

Could you post the impact of the change on the benchmarks? Not required to merge IMO, but it's nice to keep such result available long-term.

Done

review-notebook-app · 2025-01-24T21:38:53Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

This reverts commit f1542d3.

…a/cudf into fix_long_strings

galipremsagar · 2025-01-27T23:34:11Z

@pmattione-nvidia I cancelled the most recent workflow to free up resources to unblock all of cudf CI for this PR: #17771

I'll rerun once #17771 is merged.

pmattione-nvidia · 2025-01-28T16:25:32Z

/merge

improve long strings

3bcaba2

pmattione-nvidia added Performance Performance related issue improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Jan 21, 2025

pmattione-nvidia self-assigned this Jan 21, 2025

pmattione-nvidia requested a review from a team as a code owner January 21, 2025 19:17

pmattione-nvidia requested review from mythrocks and mhaseeb123 January 21, 2025 19:17

github-actions bot added the libcudf Affects libcudf (C++/CUDA) code. label Jan 21, 2025

pmattione-nvidia requested review from nvdbaranec and vuule January 21, 2025 19:18

Merge branch 'branch-25.02' into fix_long_strings

cb6a8b2

mhaseeb123 modified the milestone: Parquet continuous improvement Jan 21, 2025

pmattione-nvidia and others added 3 commits January 22, 2025 11:44

Cleanup code

93202c7

Merge branch 'fix_long_strings' of https://github.com/pmattione-nvidi…

e3ecae2

…a/cudf into fix_long_strings

Merge branch 'branch-25.02' into fix_long_strings

29558e5

vuule reviewed Jan 24, 2025

View reviewed changes

cpp/src/io/parquet/page_string_utils.cuh Show resolved Hide resolved

vuule approved these changes Jan 24, 2025

View reviewed changes

cpp/src/io/parquet/page_string_utils.cuh Outdated Show resolved Hide resolved

cpp/src/io/parquet/page_string_utils.cuh Outdated Show resolved Hide resolved

pmattione-nvidia added 3 commits January 24, 2025 15:51

use min/max

fd3aed2

Merge branch 'fix_long_strings' of https://github.com/pmattione-nvidi…

dcc11c2

…a/cudf into fix_long_strings

use min/max again

1164256

pmattione-nvidia and others added 2 commits January 24, 2025 16:00

Merge branch 'branch-25.02' into fix_long_strings

c6d69c9

fix import order

f1542d3

mhaseeb123 approved these changes Jan 24, 2025

View reviewed changes

pmattione-nvidia and others added 2 commits January 27, 2025 10:08

Merge branch 'branch-25.02' into fix_long_strings

93da20e

Revert "fix import order"

78220b5

This reverts commit f1542d3.

pmattione-nvidia and others added 3 commits January 27, 2025 10:30

Merge branch 'fix_long_strings' of https://github.com/pmattione-nvidi…

71693d2

…a/cudf into fix_long_strings

Merge branch 'branch-25.02' into fix_long_strings

1a8538b

Merge branch 'branch-25.02' into fix_long_strings

c187a38

Merge branch 'branch-25.02' into fix_long_strings

19d32a5

github-actions bot assigned vyasr Jan 28, 2025

rapids-bot bot merged commit be1f76c into rapidsai:branch-25.02 Jan 28, 2025
107 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve parquet reader very-long string performance #17773

Improve parquet reader very-long string performance #17773

pmattione-nvidia commented Jan 21, 2025 •

edited

Loading

vuule commented Jan 24, 2025

vuule left a comment

pmattione-nvidia commented Jan 24, 2025

review-notebook-app bot commented Jan 24, 2025

galipremsagar commented Jan 27, 2025

pmattione-nvidia commented Jan 28, 2025

Improve parquet reader very-long string performance #17773

Improve parquet reader very-long string performance #17773

Conversation

pmattione-nvidia commented Jan 21, 2025 • edited Loading

Checklist

vuule commented Jan 24, 2025

vuule left a comment

Choose a reason for hiding this comment

pmattione-nvidia commented Jan 24, 2025

review-notebook-app bot commented Jan 24, 2025

galipremsagar commented Jan 27, 2025

pmattione-nvidia commented Jan 28, 2025

pmattione-nvidia commented Jan 21, 2025 •

edited

Loading