Skip to content

👑 Materials Science

Rank Model Fraction Correct
1 Claude-3.5 (Sonnet) 0.739
2 GPT-4o 0.682
3 Command-R+ 0.609
4 Phi-3-Medium-4k-Instruct 0.565
5 GPT-4 0.565
6 Claude-3 (Opus) 0.565
7 Llama-3-70B-Instruct 0.522
8 Galatica-120b 0.478
9 Claude-2-Zero-T 0.435
10 Claude-2 0.435
11 Gemini-Pro 0.435
12 Gemma-7b-Instruct 0.435
13 Llama-3-8B-Instruct 0.391
14 Mistral-8x7b-Instruct 0.391
15 GPT-3.5 Turbo 0.391

Leaderboard Plot

The following plot shows the leaderboard of the models based on the fraction of correctly answered questions. This fraction is calculated as the number of correct answers divided by the total number of answers. The leaderboard is sorted in descending order of the fraction correct.