Add MMLU-Pro #3018

yifanmai · 2024-09-24T22:59:08Z

https://huggingface.co/datasets/TIGER-Lab/MMLU-Pro

Should be similar to original MMLU: see mmlu_scenario.py for the original MMLU and air_bench_scenario.py for how to use load_dataset() with Hugging Face datasets.

Edit: Also look at simple_scenarios.py and test_simple_scenarios.py for an example of MCQA.

Edit 2: Also see this doc.

Edit 3: To create the run spec function, take this function in lite_run_specs.py:

@run_spec_function("mmlu")
def get_mmlu_spec(subject: str, method: str = ADAPT_MULTIPLE_CHOICE_JOINT) -> RunSpec:

and modify it so mmlu becomes mmlu-pro, then you should be able to do helm-run.

The text was updated successfully, but these errors were encountered:

yifanmai added good first issue Good for newcomers scenarios additions New models or scenarios labels Sep 24, 2024

yifanmai assigned yifanmai and siyagoel and unassigned yifanmai Oct 3, 2024

yifanmai mentioned this issue Oct 22, 2024

Support few-shot chain-of-thought in GPQA / MMLU #3088

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MMLU-Pro #3018

Add MMLU-Pro #3018

yifanmai commented Sep 24, 2024 •

edited

Loading

Add MMLU-Pro #3018

Add MMLU-Pro #3018

Comments

yifanmai commented Sep 24, 2024 • edited Loading

yifanmai commented Sep 24, 2024 •

edited

Loading