Benchmark · KnowledgeCompetitive

Aider · Code Editing

Updated 2025-04-15
Models tested
27
Top score
84.2
Claude 3.5 Sonnet
Median
60.2
min 14.3
Top-5 spread
σ 5.4
wide open

Best score over time · one chart, every benchmark

AIDER · CODE EDITING15 MODELS · FRONTIER RUNNING MAX0255075100SCORE ↑Jul 24Sep 24Nov 24Feb 25Apr 25RELEASE DATE →benchgecko.ai/benchmark/aider-edit · frontier
Only 15 models have been tested on Aider · Code Editing · not enough history to compute a frontier yet.
Pink dots = frontier records · 1 totalClick to open model page

Same category · related evaluations