Multimodal
Vision understanding, image analysis, and cross-modal reasoning β processing both text and visual inputs.
11
Models Ranked
66.7
Top Score
59.7
Average Score
1
Benchmarks
Benchmarks in This Skill
Rankings
| # | Model | Avg Score | Bar |
|---|---|---|---|
| 1 | 66.7 | ||
| 2 | 64.7 | ||
| 3 | 62.5 | ||
| 4 | 62.5 | ||
| 5 | 62.5 | ||
| 6 | 62.5 | ||
| 7 | 62.5 | ||
| 8 | 60.4 | ||
| 9 | 53.1 | ||
| 10 | 53.1 | ||
| 11 | 46.7 |