VideoMME
VideoMME β multimodal benchmark testing video understanding across diverse domains, requiring temporal reasoning and cross-frame comprehension.
11
Models Tested
66.7
Top Score
59.7
Average Score
Rankings
| # | Model | Score | Bar |
|---|---|---|---|
| 1 | 66.7 | ||
| 2 | 64.7 | ||
| 3 | 62.5 | ||
| 4 | 62.5 | ||
| 5 | 62.5 | ||
| 6 | 62.5 | ||
| 7 | 62.5 | ||
| 8 | 60.4 | ||
| 9 | 53.1 | ||
| 10 | 53.1 | ||
| 11 | 46.7 |