Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replicate completion response with status=processing does not retry #7900

Open
bjhargrave opened this issue Jan 21, 2025 · 0 comments · May be fixed by #7901
Open

Replicate completion response with status=processing does not retry #7900

bjhargrave opened this issue Jan 21, 2025 · 0 comments · May be fixed by #7901

Comments

@bjhargrave
Copy link

Despite the comment which says the result may be status=processing and being in a retry loop, the code never checks for processing and assumes the first response is successful. Which results in the transform_response method raising an exception.

The code for completions should retry the request when the response has status=processing.

PR to follow.

bjhargrave added a commit to bjhargrave/litellm that referenced this issue Jan 21, 2025
We use the DEFAULT_REPLICATE_ constants for retry count and initial
delay. If the completion response returns status=processing, we
loop to retry.

Fixes BerriAI#7900

Signed-off-by: BJ Hargrave <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant