API

PIQA

PIQA (Physical Interaction QA) β€” tests intuitive physical reasoning by asking models to select the correct approach for everyday physical tasks.

36
Models Tested
77.4
Top Score
63.2
Average Score