-
Notifications
You must be signed in to change notification settings - Fork 143
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parsing fails with long strings #1009
Comments
yeah, this is tricky one -- some discusson about it here: #935 and JuliaData/Parsers.jl#98 |
Ugh, this is really bad and it happens even without enormously long lines, just big files... Here's the Census 2020 ACS household data https://www2.census.gov/programs-surveys/acs/experimental/2020/data/pums/1-Year/csv_hus.zip unzip it and try to read the second large file:
You'll get one of these parse errors. Lines in this file are like a few thousand characters, not hundreds of thousands of characters. But there are 645744 lines in the file. Is there a workaround here? |
@dlakelan, it sounds to me like there might be some bad quoting in your file. The limits when you would hit this bug are:
If there was a cell, however, that started with |
FWIW I can't reproduce it on that file:
|
I also cannot reproduce this on that file on mac, same versions as the Windows test above. |
Ok, Sure enough, a fresh download and the file loads... Computers are weird. Thanks for you guys helping with this! |
Glad it's sorted! There's #935 open for the "really long strings" issue, so will close this one. |
Replication
Using this test file saved as "test.csv"
test.csv
Run the following to try to read it:
Which gives the following error:
The text was updated successfully, but these errors were encountered: