Add custom task (bac-fr) for evaluation of models in french #518

mdiazmel · 2025-01-27T08:04:53Z

Propose a specific task to run on the bac-fr, a dataset composed of questions extracted from the french BAC exam.
The header of the dataset is like this:

Instruction	Question	Réponse	Réponse étendue	Choix	Choix correct	Matière	Année	Sujet

We need to clearly define the metrics that will be used, as well as how the prompt will formulate its requests (@clefourrier).

community_tasks/french_evals.py

clefourrier · 2025-01-27T09:12:42Z

community_tasks/french_evals.py

+    few_shots_split=None,
+    few_shots_select="random_sampling",
+    generation_size=1,
+    metric=[],  # To be defined


If we feel like the instructions constrain the answer enough, we could go for an exact match. We can also look at @hynky1999 's parser for math equations.

This is math bench ?

Includes math questions

(but not math only, I'd say 1/3 to 1/2 are math questions?)

I would say 2/3 math questions!

Co-authored-by: Clémentine Fourrier <[email protected]>

mdiazmel · 2025-01-30T13:42:34Z

Tested with current state of bac-fr dataset.
Ready to review! @clefourrier

Add custom task (bac-fr) for evaluation of models in french

73bc6ae

clefourrier reviewed Jan 27, 2025

View reviewed changes

community_tasks/french_evals.py Outdated Show resolved Hide resolved

clefourrier reviewed Jan 27, 2025

View reviewed changes

mdiazmel and others added 5 commits January 27, 2025 13:52

Update prompt function for the bac-fr task

c592de7

Co-authored-by: Clémentine Fourrier <[email protected]>

Add metrics for the evaluation of bac-fr

6b4de76

Fix function name (as_list)

c791903

Fix prompt for multichoice

fa123e8

Merge branch 'main' into main

b31f854

mdiazmel marked this pull request as ready for review January 30, 2025 13:32

Merge branch 'main' into main

93a3452

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add custom task (bac-fr) for evaluation of models in french #518

Add custom task (bac-fr) for evaluation of models in french #518

mdiazmel commented Jan 27, 2025

clefourrier Jan 27, 2025

hynky1999 Jan 27, 2025 •

edited

Loading

clefourrier Jan 27, 2025

clefourrier Jan 27, 2025

mdiazmel Jan 27, 2025 •

edited

Loading

mdiazmel commented Jan 30, 2025

Add custom task (bac-fr) for evaluation of models in french #518

Are you sure you want to change the base?

Add custom task (bac-fr) for evaluation of models in french #518

Conversation

mdiazmel commented Jan 27, 2025

clefourrier Jan 27, 2025

Choose a reason for hiding this comment

hynky1999 Jan 27, 2025 • edited Loading

Choose a reason for hiding this comment

clefourrier Jan 27, 2025

Choose a reason for hiding this comment

clefourrier Jan 27, 2025

Choose a reason for hiding this comment

mdiazmel Jan 27, 2025 • edited Loading

Choose a reason for hiding this comment

mdiazmel commented Jan 30, 2025

hynky1999 Jan 27, 2025 •

edited

Loading

mdiazmel Jan 27, 2025 •

edited

Loading