Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request: global token cap #1135

Open
edkins opened this issue Jan 17, 2025 · 0 comments
Open

Feature request: global token cap #1135

edkins opened this issue Jan 17, 2025 · 0 comments

Comments

@edkins
Copy link

edkins commented Jan 17, 2025

It would be great when running inspect eval or inspect eval-set to be able to set a global cap on the total_tokens consumed (input tokens + output tokens). This is for budgeting reasons: sometimes an experiment will turn out to be unexpectedly expensive due to a large number of samples or tokens used per sample, and we'd like to be able to stop the expensive experiments automatically.

The ideal behaviour would be to stop all currently running task instances when the global cap is reached, and not to schedule any more, so that the total tokens used stays under budget. Any completed task instances/samples should be recorded as normal so a partial log is still generated (similar to when a run is aborted due to errors).

Being able to set a monetary budget in dollars would be an alternative. I'd consider the features roughly equivalent since cost roughly tracks tokens for a given model (a monetary budget may be more directly useful but harder to implement).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant