Ichigo v0.5 Benchmark Results. #164

bachvudinh · 2025-01-21T08:21:09Z

Instruction-Following Evaluation

Model Name	MMLU	MMLU pro	VMLU	Alpaca (GPT-4 judge)	Open-hermes (GPT-4 judge)	ASR (WER)
meta-llama3.1-8B-instruct	69.40	-	50.69	-	-	-
Ichigo v0.5 checkpoint 4000	60.61	-	-	-	-	-
Ichigo v0.5 end epoch	62.27	-	43.22	2.93	3.28	-
Ichigo-v0.4	64.66	-	-	3.5	3.52	-

hahuyhoang411 · 2025-01-21T08:29:39Z

can we add the orignial scores of ichigo v0.4 also for better comparation?

github-project-automation bot added this to Menlo Jan 21, 2025

github-project-automation bot moved this to Investigating in Menlo Jan 21, 2025

bachvudinh self-assigned this Jan 23, 2025

Yip-Jia-Qi added this to the publication milestone Feb 6, 2025

Provide feedback