베타
리더보드/Claude 3.5 Haiku
Anthropic logo

Claude 3.5 Haiku

제공 Anthropic · 출시일 2024-11-04

37.2
평균 점수
$0.80/1M
입력 가격
$4.00/1M
출력 가격
200K tokens (~100 books)
컨텍스트 윈도우
multimodal
유형

Tested on 17 benchmarks with 37.2% average. Top scores: HELM — IFEval (79.2%), HELM — WildBench (76.0%), Lech Mazur Writing (73.5%).

벤치마크카테고리점수Bar
HELM — IFEvallanguage79.2
HELM — WildBenchreasoning76.0
Lech Mazur Writingknowledge73.5
MMLUknowledge65.7
HELM — MMLU-Proknowledge60.5
MATH level 5math46.4
HELM — GPQAknowledge36.3
GeoBenchknowledge34.0
CadEvalcoding32.0
WeirdMLcoding30.7
Aider polyglotcoding28.0
HELM — Omni-MATHmath22.4
Balrogknowledge19.3
GPQA diamondknowledge17.5
SimpleQA Verifiedknowledge6.7
OTIS Mock AIME 2024-2025math4.2
FrontierMath-2025-02-28-Privatemath0.3