OpenBookQA
OpenBookQA β science questions that require combining a given core fact with broad common knowledge, mimicking an open-book exam setting.
27
Models Tested
84.0
Top Score
48.4
Average Score
Rankings
| # | Model | Score | Bar |
|---|---|---|---|
| 1 | 84.0 | ||
| 2 | 84.0 | ||
| 3 | 83.2 | ||
| 4 | 81.1 | ||
| 5 | 76.8 | ||
| 6 | 73.1 | ||
| 7 | 71.5 | ||
| 8 | 64.8 | ||
| 9 | T Falcon-180BTII | 52.3 | |
| 10 | 46.9 | ||
| 11 | 46.9 | ||
| 12 | 44.8 | ||
| 13 | 44.8 | ||
| 14 | 44.3 | ||
| 15 | 42.9 | ||
| 16 | 42.7 | ||
| 17 | T Falcon-40BTII | 42.1 | |
| 18 | 41.9 | ||
| 19 | U MPT-30Bunknown | 36.0 | |
| 20 | T Falcon-7BTII | 35.5 | |
| 21 | U MPT-7Bunknown | 35.2 | |
| 22 | 32.3 | ||
| 23 | 30.1 | ||
| 24 | U XGen-7Bunknown | 20.3 | |
| 25 | U Dolly 2.0-12bunknown | 18.9 | |
| 26 | 16.3 | ||
| 27 | 14.4 |