Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: parallelize rows fetcher. #12590

Merged
merged 6 commits into from
Aug 28, 2023

Conversation

RinChanNOWWW
Copy link
Contributor

@RinChanNOWWW RinChanNOWWW commented Aug 25, 2023

I hereby agree to the terms of the CLA available at: https://databend.rs/dev/policies/cla/

Summary

  • Make some methods static to support running parallelly.
  • Parallelize parquet rows fetcher.
  • Parallelize native rows fetcher.

This change is Reviewable

@vercel
Copy link

vercel bot commented Aug 25, 2023

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
databend ✅ Ready (Inspect) Visit Preview 💬 Add feedback Aug 26, 2023 8:41am

@BohuTANG
Copy link
Member

Cool. It would be great if you could share the benchmarking results if they are ready.

@RinChanNOWWW
Copy link
Contributor Author

RinChanNOWWW commented Aug 28, 2023

select * from hits order by userid desc limit 1000 ignore_result;

Main: 27s
This PR: 1.7s

@RinChanNOWWW RinChanNOWWW marked this pull request as ready for review August 28, 2023 02:26
@github-actions github-actions bot added the pr-feature this PR introduces a new feature to the codebase label Aug 28, 2023
@RinChanNOWWW RinChanNOWWW added the ci-benchmark Benchmark: run all test label Aug 28, 2023
@github-actions
Copy link
Contributor

Docker Image for PR

  • tag: pr-12590-a88be20

note: this image tag is only available for internal use,
please check the internal doc for more details.

@BohuTANG BohuTANG merged commit 0c5dd54 into databendlabs:main Aug 28, 2023
61 of 62 checks passed
@RinChanNOWWW RinChanNOWWW deleted the parallel-rows-fetch branch August 29, 2023 01:51
RinChanNOWWW added a commit that referenced this pull request Aug 29, 2023
andylokandy pushed a commit to andylokandy/databend that referenced this pull request Nov 27, 2023
* Parallelize parquet rows fetcher.

* Each threads hold one uncompressed buffer.

* Refactor codes.

* Parallelize native rows fetcher.

* Refactor codes.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci-benchmark Benchmark: run all test pr-feature this PR introduces a new feature to the codebase
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants