-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Performance of cursor.next() could be improved with typedarray #117
Comments
So a few things.
If you don't need any of the additional features this library has, you might find https://github.com/kylebarron/parquet-wasm to be faster. |
- Buffer.slice -> Buffer.subarray (and correct test that wasn't using buffers) - new Buffer(array) -> Buffer.from(array) - Fix issue with `npm run serve` Via looking into #117 As `subarray` is slightly faster in the browser shim.
I did some work for TPCH sf1 lineitem.parquet with specified columns: If |
Hi,
I'm trying to read a parquet file in the browser, and it seems to take a lot longer than it does in Python. Testing with the largest parquet file in this repo,
test/test-files/customer.impala.parquet
, in Python:outputs:
Whereas in the browser, using this test HTML/JS:
The console outputs:
Which is ~10x slower than Python
Any ideas on how to improve browser read performance?
The bulk of the time seems to spent reading the first row.
The text was updated successfully, but these errors were encountered: