TriviaQA
TriviaQA β reading comprehension benchmark with trivia questions, requiring models to find and reason over evidence from provided documents.
31
Models Tested
87.6
Top Score
75.4
Average Score
Rankings
| # | Model | Score | Bar |
|---|---|---|---|
| 1 | 87.6 | ||
| 2 | 87.5 | ||
| 3 | 86.0 | ||
| 4 | 84.8 | ||
| 5 | 84.8 | ||
| 6 | 84.8 | ||
| 7 | 84.8 | ||
| 8 | 84.6 | ||
| 9 | 83.8 | ||
| 10 | 82.9 | ||
| 11 | 82.7 | ||
| 12 | 82.2 | ||
| 13 | 80.0 | ||
| 14 | T Falcon-40BTII | 79.9 | |
| 15 | 79.6 | ||
| 16 | 78.9 | ||
| 17 | 77.9 | ||
| 18 | 75.2 | ||
| 19 | 73.9 | ||
| 20 | 73.7 | ||
| 21 | U MPT-30Bunknown | 73.6 | |
| 22 | 72.3 | ||
| 23 | 71.9 | ||
| 24 | 71.0 | ||
| 25 | 67.7 | ||
| 26 | T Falcon-7BTII | 64.6 | |
| 27 | 64.0 | ||
| 28 | U MPT-7Bunknown | 61.6 | |
| 29 | 58.1 | ||
| 30 | 53.2 | ||
| 31 | 45.2 |