API
Benchmarks/CadEval

CadEval

CadEval β€” evaluates the ability to generate and reason about Computer-Aided Design code, testing spatial reasoning and engineering knowledge.

15
Models Tested
74.0
Top Score
39.6
Average Score