How much does Claude 3.7 Sonnet cost?

Claude 3.7 Sonnet costs $3.00 per million input tokens and $15.00 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.021 per message.

What benchmarks has Claude 3.7 Sonnet been tested on?

Claude 3.7 Sonnet has been evaluated on 26 benchmarks. Top scores: MATH level 5: 91.2, HELM — IFEval: 83.4, Fiction.LiveBench: 83.3.

Is Claude 3.7 Sonnet open source?

No, Claude 3.7 Sonnet is a proprietary model by Anthropic.

How does Claude 3.7 Sonnet compare to Mistral 7B V0.1?

Claude 3.7 Sonnet has an average score of 48.9 while Mistral 7B V0.1 scores 48.9. Mistral 7B V0.1 slightly outperforms Claude 3.7 Sonnet overall. See full comparison →

Home/Models/Claude 3.7 Sonnet

Claude 3.7 Sonnet

Name: Claude 3.7 Sonnet
Price: 3 USD
Author: Anthropic

by Anthropic · Released Feb 2025

Multimodal

48.9

avg score

Rank #113

Compare

Better than 52% of all models

Context

200K tokens (~100 books)

Input $/1M

$3.00

Output $/1M

$15.00

Type

multimodal

License

Proprietary

Benchmarks

26 tested

Data updated today

About

Claude 3.7 Sonnet is an advanced large language model with improved reasoning, coding, and problem-solving capabilities. It introduces a hybrid reasoning approach, allowing users to choose between rapid responses and...

Tested on 26 benchmarks with 47.7% average. Top scores: MATH level 5 (91.2%), HELM — IFEval (83.4%), Fiction.LiveBench (83.3%).

Looking for similar performance at lower cost?
R1 scores 48.0 (98% as good) at $0.70/1M input · 77% cheaper

Capabilities

coding

42.7

#92 globally

reasoning

36.6

#73 globally

math

46.5

#86 globally

knowledge

55.6

#75 globally

agentic

33.3

#14 globally

language

83.4

#40 globally

Benchmark Scores

Compare All

Tested on 26 benchmarks · Ranked across 6 categories

Score Distribution (all 233 models)

0255075100

▲ You are here

codingCompare coding →

Aider polyglot

Multi-language code editing from Aider. Tests editing ability across Python, JavaScript, TypeScript, Java, C++, Go, Rust, and more.

64.9—

SWE-Bench verified

Real-world software engineering tasks from GitHub issues. Models must diagnose bugs and write patches that pass test suites. Human-verified subset of SWE-bench.

61.0—

CadEval

Computer-aided design evaluation. Tests understanding of CAD concepts, 3D modeling, and engineering design principles.

54.0—

reasoningCompare reasoning →

HELM — WildBench

Stanford HELM WildBench evaluation. Tests reasoning on challenging real-world tasks.

81.4—

SimpleBench

Deceptively simple questions that humans find easy but AI models often get wrong. Tests common sense and reasoning gaps.

35.7—

ARC-AGI

Abstraction and Reasoning Corpus. Tests fluid intelligence through novel visual pattern recognition puzzles. Core measure of general intelligence.

28.6—

mathCompare math →

MATH level 5

Competition-level math from AMC, AIME, and olympiad problems. Level 5 is the hardest tier, requiring creative problem-solving.

91.2—

OTIS Mock AIME 2024-2025

Mock AIME (American Invitational Mathematics Exam) problems from OTIS. Tests mathematical competition performance.

57.7—

HELM — Omni-MATH

Stanford HELM evaluation of mathematical reasoning across diverse problem types.

33.0—

Quick compare:

vs Mistral 7B V0.1

vs Phi 4 Mini Instruct

vs Falcon-180B

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · Anthropic Claude Sonnet

Claude 3 SonnetJan 2024

28.3

N/AN/Actx6 benchmarks

Claude 3.5 SonnetJan 2024

42.3+14.0

N/AN/Actx25 benchmarks

Claude 3.7 SonnetFeb 2025

47.7+5.4

$3.00/M in200Kctx26 benchmarks

Claude 3.7 Sonnet (thinking)Feb 2025

$3.00/M in200Kctx

Claude Sonnet 4May 2025

44.6+44.6

$3.00/M in1.0Mctx(+800K)27 benchmarks

Claude Sonnet 4.5Sep 2025

42.1-2.5

$3.00/M in1.0Mctx21 benchmarks

Claude Sonnet 4.6Feb 2026

47.6+5.5

$3.00/M in1.0Mctx18 benchmarks

See the full Claude Sonnet family →

Similar Models

Links

Info

Anthropic Pricing explorer Developers · API

Research

Technical Report

Documentation

API Docs Playground

Community

@Anthropic

BenchGecko API

claude-3-7-sonnet

Specifications

Typemultimodal
Context200K tokens (~100 books)
ReleasedFeb 2025
LicenseProprietary
StatusActive
Cost / Message~$0.021

Available On

Anthropic$3.00

Frequently Asked Questions

Claude 3.7 Sonnet is a proprietary multimodal AI model by Anthropic, released in February 2025. It has an average benchmark score of 48.9. Context window: 200K tokens.

Benchmarks

MATH level 5 HELM — IFEval Fiction.LiveBench HELM — WildBench Lech Mazur Writing

Anthropic · Provider Anthropic · Economy All Models Compare Models Pricing Developers · API Context Window · Glossary

Claude 3.7 Sonnet

Frequently Asked Questions

Related Models

Benchmarks

Related Pages