Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unreproducible crash when running remotely against AOSS. #132

Open
alexdunnjpl opened this issue Aug 26, 2024 · 0 comments
Open

Unreproducible crash when running remotely against AOSS. #132

alexdunnjpl opened this issue Aug 26, 2024 · 0 comments
Assignees
Labels
bug Something isn't working needs:triage open.1.3.0

Comments

@alexdunnjpl
Copy link
Contributor

alexdunnjpl commented Aug 26, 2024

Checked for duplicates

No - I haven't checked

🐛 Describe the bug

When running sweepers on MCP EC2 from a local IDE via remote execution, a crash was observed. The crash occurred during repairkit, targeting atm (probably by coincidence - this happened to be the first sweeper step.

The first (20k doc) page of updates completed successfully, then the next (500 doc) page of the doc fetch query would coincide with a fatal crash, exit code 1. There was no apparent error in the stdout/log, or any other indication of the cause.

The contents of the request response for the update was inexplicably printed to either stdout or the log (it's unclear which). This contents showed no errors for the 20k updates.

Disabling the tqdm calls (related note about tqdm issues) initially appeared to resolve the problem, but re-enabling it did not result in reproduction of the bug and subsequent execution was successful. The bug was reproducible ~10 times before disappearing.

No path forward unless/until it reappears, just logging the details for future reference.

🕵️ Expected behavior

I expected successful execution, or for a fatal error to produce a stack trace or other descriptive content.

📜 To Reproduce

Unreproducible

🖥 Environment Info

Apple M2 Silicon
Pycharm 2023.1.2
Remote execution on mcp-prod1

📚 Version of Software Used

No response

🩺 Test Data / Additional context

No response

🦄 Related requirements

🦄 #xyz

⚙️ Engineering Details

No response

🎉 Integration & Test

No response

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working needs:triage open.1.3.0
Projects
Status: ToDo
Development

No branches or pull requests

8 participants
@jordanpadams @pdsen-ci @alexdunnjpl and others