Extraction Performance
Rank | Model | Score |
---|---|---|
1 | Claude-3.5-Sonnet | 0.686 |
2 | GPT-4o | 0.536 |
3 | Gemini-1.5-Pro | 0.454 |
4 | Llama 3.2 90B Vision | 0.300 |
5 | Baseline | 0.278 |
Sub-Task Performance
Performance across individual Sub-Tasks in this domain.
Rank | Model | Score |
---|---|---|
1 | Claude-3.5-Sonnet | 0.686 |
2 | GPT-4o | 0.536 |
3 | Gemini-1.5-Pro | 0.454 |
4 | Llama 3.2 90B Vision | 0.300 |
5 | Baseline | 0.278 |
Performance across individual Sub-Tasks in this domain.