-
Notifications
You must be signed in to change notification settings - Fork 101
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
parquet format support #106
Comments
Hi @weibingo, no plan currently but this would be a welcome PR. In the meantime, you would have to manually de/serialize the output of the raw |
For reading a Pandas dataframe in parquet format from HDFS, currently I use a BytesIO object to read the parquet file into a bytes buffer completely first and pass this to pandas afterwards.
If I try to pass the
I got the following error:
Is there a way to read the parquet file into Pandas directly without reading it completely to a BytesIO object first? |
Hi @mtth, is there any plan to support parquet data format?
parquet data has schema by self . so can read parquet to pandas directly, write is same .
python parquet module: fastparquet , pyarrow
The text was updated successfully, but these errors were encountered: