LIVE268개 제공업체의 976개 AI 모델 추적 중.

BenchGecko베타

모델976·제공업체268·벤치마크128·기업71·에이전트165·1위Qwen3 VL 235B A22B Instruct · 1415.8%·업데이트방금·데이터 포인트2,902·MCP 서버4,923

리더보드/DeepSeek V3

DeepSeek V3

오픈소스

제공 DeepSeek · 출시일 2024-12-26

59.0

평균 점수

$0.32/1M

입력 가격

$0.89/1M

출력 가격

164K tokens (~82 books)

컨텍스트 윈도우

text

유형

Tested on 22 benchmarks with 59.0% average. Top scores: Chatbot Arena Elo — Overall (1358.2%), ARC AI2 (93.7%), HellaSwag (85.2%).

벤치마크 점수

벤치마크	카테고리	점수	Bar
Chatbot Arena Elo — Overall	arena	1358.2
ARC AI2	knowledge	93.7
HellaSwag	knowledge	85.2
BBH	reasoning	83.3
HELM — IFEval	language	83.2
HELM — WildBench	reasoning	83.1
MMLU	knowledge	82.9
TriviaQA	knowledge	82.9
Lech Mazur Writing	knowledge	77.0
HELM — MMLU-Pro	knowledge	72.3
Winogrande	knowledge	70.4
PIQA	knowledge	69.4
MATH level 5	math	64.8
HELM — GPQA	knowledge	53.8
Fiction.LiveBench	knowledge	50.0
Aider polyglot	coding	48.4
GPQA diamond	knowledge	42.0
HELM — Omni-MATH	math	40.3
WeirdML	coding	36.1
OTIS Mock AIME 2024-2025	math	15.8
SimpleBench	reasoning	2.7
FrontierMath-2025-02-28-Private	math	1.7

유사 모델

Gemini 2.5 Flash Lite

Google DeepMind

phi-3-medium 14B

DeepSeek DeepSeek V3 타임라인

DeepSeek V3Dec 2024

$0.32/M in164Kctx22 benchmarks

DeepSeek V3 0324Mar 2025

$0.20/M in(-0.12)164Kctx2 benchmarks