-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Intermittent segfaults on Ubuntu 14.04 #14
Comments
Hello, @ipmb, sorry for taking so long to reply. Is this still an issue? Can you supply us with a bit more information such as what kind of operations and/or filters are involved? Also some important aspect of your configuration you think could help shedding a light on this? |
Yeah, I had to stop using it because of the issue and I'm no longer working on the project, but I'd love to figure it out because I'm sure I'll use thumbor again in the future. The configuration was totally stock iirc. I was using the default optimizers and filters and pulling source images from S3 over HTTP. My guess is that it would be easy to replicate with the packages I mentioned on Ubuntu 14.04. |
We used to have this problem in our facial detector. That's a broken image example: |
These weren't broken images. I could repeat the same request and it would work fine. The problem seemed related to load/concurrency. |
If the images weren't broken, tornado could be finishing the request in the middle of it and feeding a broken image to the engine. |
We might be dealing with a broken pipe or a timeout the wrong way. |
I think these problems are related: thumbor/thumbor#208 |
A small percentage (0.15%) of our requests result in the 599 timeout error and we're also using S3 to store the source images so that would make sense. |
@ipmb Do you mind testing on this new version ? https://github.com/thumbor/thumbor/releases/tag/5.0.0rc2 |
Unfortunately I'm not working on the project anymore, but that might change in the future. I'll post results when/if that happens. |
I might be affected as well, using Ubuntu 14.04 and a Syslog:
Are the exit codes 139 and 2 something designed into Thumbor? Or might they come from elsewhere? IIRC, errno = 2 is ENOENT? Why does thumbor exit in the first place, and why does it have trouble starting up following that? If I restart it a few moments later, everything is fine. What can I do to help here? |
Not sure if this is the right place to raise this, but I've been unable to get the opencv_engine running reliably due to intermittent segfaults in the cv module taking down the whole thumbor process. It's nothing related to a specific file because I can retry and it works fine. I noticed this during load testing so it may be exacerbated during high load.
It took me a while to find the issue, there was nothing relevant in the thumbor logs, but I noticed 502s and dropped/prematurely closed connections in Nginx. Here's the relevant snippet from syslog:
I tried with both the
python-opencv
package and building 2.4.10 from source, but saw the same results. Relevant Python packages:The text was updated successfully, but these errors were encountered: