Benchmark · AgentCompetitive

The Agent Company

The Agent Company · tests AI agents on realistic corporate tasks like email management, code review, data analysis, and cross-tool workflows.

Updated 2025-09-29
Models tested
13
Top score
42.9
DeepSeek V3.2 Exp
Median
11.4
min 1.1
Top-5 spread
σ 5.3
wide open

Best score over time · one chart, every benchmark

THE AGENT COMPANY10 MODELS · FRONTIER RUNNING MAX0255075100SCORE ↑Jul 24Nov 24Feb 25Jun 25Sep 25RELEASE DATE →benchgecko.ai/benchmark/the-agent-company · frontier
Frontier on The Agent Company rose from 7.4 to 42.9 in 15 months · +35.5 points · latest leader DeepSeek V3.2 Exp from DeepSeek.
Pink dots = frontier records · 7 totalClick to open model page
Details
Category
Agent
Max score
100
Models
13
Updated
2025-09-29

Same category · related evaluations