Skip to content

👑 Biochemistry

Rank Model Fraction Correct
1 Claude-3.5 (Sonnet) 0.8
2 GPT-4o 0.783
3 Claude-3 (Opus) 0.765
4 Llama-3-70B-Instruct 0.657
5 GPT-4 0.629
6 GPT-3.5 Turbo 0.6
7 Claude-2-Zero-T 0.6
8 Claude-2 0.6
9 Phi-3-Medium-4k-Instruct 0.557
10 Llama-3-8B-Instruct 0.557
11 Gemini-Pro 0.557
12 Command-R+ 0.529
13 Mistral-8x7b-Instruct 0.443
14 Galatica-120b 0.443
15 Gemma-7b-Instruct 0.429

Leaderboard Plot

The following plot shows the leaderboard of the models based on the fraction of correctly answered questions. This fraction is calculated as the number of correct answers divided by the total number of answers. The leaderboard is sorted in descending order of the fraction correct.