LAMBADA
LAMBADA β measures the ability to predict the final word of a passage, requiring broad contextual understanding across long text spans.
16
Models Tested
79.8
Top Score
74.5
Average Score
Rankings
| # | Model | Score | Bar |
|---|---|---|---|
| 1 | T Falcon-180BTII | 79.8 | |
| 2 | 78.9 | ||
| 3 | 77.7 | ||
| 4 | T Falcon-40BTII | 77.3 | |
| 5 | 77.2 | ||
| 6 | 76.5 | ||
| 7 | 75.2 | ||
| 8 | T Falcon-7BTII | 74.9 | |
| 9 | U Baichuan2-13Bunknown | 74.0 | |
| 10 | U Baichuan 2-7Bunknown | 73.3 | |
| 11 | 73.3 | ||
| 12 | 73.3 | ||
| 13 | U Stable Beluga 2unknown | 71.3 | |
| 14 | 71.1 | ||
| 15 | U MPT-7Bunknown | 70.0 | |
| 16 | 67.9 |