Better than 94% of all models
Context
N/A
Input $/1M
TBD
Output $/1M
TBD
Type
text
License
Proprietary
Benchmarks
4 tested
Data updated today
About
Tested on 4 benchmarks with 78.0% average. Top scores: GSM8K (86.7%), ARC AI2 (81.7%), TriviaQA (78.9%).
Capabilities
math
86.7
#10 globally
knowledge
75.1
#8 globally
Benchmark Scores
Compare AllTested on 4 benchmarks · Ranked across 2 categories
Score Distribution (all 233 models)
0255075100
▲ You are here
mathCompare math →
GSM8K
86.7—Grade school math word problems. 8,500 problems testing multi-step arithmetic reasoning. A foundational math benchmark.
knowledgeCompare knowledge →
ARC AI2
81.7—AI2 Reasoning Challenge. Grade-school science questions requiring multi-step reasoning. Easy and Challenge sets test different difficulty levels.
TriviaQA
78.9—Trivia questions sourced from trivia enthusiasts and quiz websites. Tests breadth of general knowledge.
MMLU
64.5—Massive Multitask Language Understanding. 57 subjects from STEM, humanities, and social sciences. The most widely-cited knowledge benchmark.
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Links
Research
Documentation
Community
BenchGecko API
claude-instant
Specifications
- Typetext
- ContextN/A
- ReleasedJan 2024
- LicenseProprietary
- Statusbenchmark-only
Available On
Learn More
Share & Export
Frequently Asked Questions
Claude Instant is a proprietary text AI model by Anthropic, released in January 2024. It has an average benchmark score of 84.6.