-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make sure a truncated file always uses EOFError
#228
Comments
Actually, there's more to it: since generator is used to iterate over the file, the iteration would stop if a file is corrupted (i.e. not truncated, but some record in the middle is erroneous). I'm not 100% sure whether it may happen, that one record in simtel file got corrupted, but the next is not, but in such case the iteration will stop and the remainder of the file won't be processed. p.s. in current implementation it is also not possible to catch an exception like this:
due to the fact that exception will be raised in generator. |
This is not really possible, as simtel array files are a streaming data format. If something is broken, everything after that point will be garbage / impossible to judge if correct. You won't be able to find back to something working.
That's how iterators work. If you want to be safe against that, do it like this:
|
Are there no "sync markers"? Some event-wise binary formats allow re-syncing after corrupt data by seeking to the next start-of-event marker and re-starting the stream. Though I'm less worried about that for simulations, and more for observations... |
Yes, there are. But we never encountered such a case of a "broken in the middle" file. It was always until now truncated files where processing stopped due to some error |
Yeah, I think for simulations it's not worth worrying about as the files are smallish and can be re-generated. I will however propose a requirement for DL0... |
Right now, a truncated file can raise many different types of exceptions, e.g.
ValueError
whennp.frombuffer
gets a too short buffer.We should make sure that the same root cause always results in the same exception.
The text was updated successfully, but these errors were encountered: