CadEval
CadEval β evaluates the ability to generate and reason about Computer-Aided Design code, testing spatial reasoning and engineering knowledge.
15
Models Tested
74.0
Top Score
39.6
Average Score
Rankings
| # | Model | Score | Bar |
|---|---|---|---|
| 1 | 74.0 | ||
| 2 | 62.0 | ||
| 3 | 56.0 | ||
| 4 | 54.0 | ||
| 5 | 54.0 | ||
| 6 | 54.0 | ||
| 7 | 42.0 | ||
| 8 | 34.0 | ||
| 9 | 32.0 | ||
| 10 | 26.0 | ||
| 11 | 26.0 | ||
| 12 | 26.0 | ||
| 13 | 26.0 | ||
| 14 | 16.0 | ||
| 15 | 12.0 |