Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add MMLU-Pro #3018

Open
yifanmai opened this issue Sep 24, 2024 · 0 comments
Open

Add MMLU-Pro #3018

yifanmai opened this issue Sep 24, 2024 · 0 comments
Assignees
Labels
additions New models or scenarios good first issue Good for newcomers scenarios

Comments

@yifanmai
Copy link
Collaborator

yifanmai commented Sep 24, 2024

https://huggingface.co/datasets/TIGER-Lab/MMLU-Pro

Should be similar to original MMLU: see mmlu_scenario.py for the original MMLU and air_bench_scenario.py for how to use load_dataset() with Hugging Face datasets.

Edit: Also look at simple_scenarios.py and test_simple_scenarios.py for an example of MCQA.

Edit 2: Also see this doc.

Edit 3: To create the run spec function, take this function in lite_run_specs.py:

@run_spec_function("mmlu")
def get_mmlu_spec(subject: str, method: str = ADAPT_MULTIPLE_CHOICE_JOINT) -> RunSpec:

and modify it so mmlu becomes mmlu-pro, then you should be able to do helm-run.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
additions New models or scenarios good first issue Good for newcomers scenarios
Projects
None yet
Development

No branches or pull requests

2 participants