Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Choose evaluation example from test set #1804

Merged
merged 4 commits into from
Oct 24, 2024
Merged

Choose evaluation example from test set #1804

merged 4 commits into from
Oct 24, 2024

Conversation

rasbt
Copy link
Collaborator

@rasbt rasbt commented Oct 24, 2024

Instead of using the "Recommend what movie to watch" instruction in all finetuning scripts, we now select an example from the test set for the evaluation print during finetuning.

The default can be overridden via the following setting:

litgpt finetune ... --eval.evaluate_example "random" | "first" | 99

Fixes #796
Fixes #1443

@rasbt rasbt requested a review from lantiga as a code owner October 24, 2024 17:03
litgpt/args.py Outdated Show resolved Hide resolved
@rasbt rasbt merged commit 3ca2311 into main Oct 24, 2024
8 of 9 checks passed
@rasbt rasbt deleted the evaluate-example branch October 24, 2024 18:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

validation output during finetuning Harcoded incorrect (and repeated) validation example
1 participant