Better than 21% of all models
Context
N/A
Input $/1M
TBD
Output $/1M
TBD
Type
text
License
Proprietary
Benchmarks
7 tested
Data updated today
About
Tested on 7 benchmarks with 32.5% average. Top scores: PIQA (54.8%), HellaSwag (43.7%), Winogrande (41.6%).
Capabilities
reasoning
24.1
#119 globally
math
28.1
#167 globally
knowledge
35.0
#197 globally
Benchmark Scores
Compare AllTested on 7 benchmarks · Ranked across 3 categories
Score Distribution (all 274 models)
0255075100
▲ You are here
reasoningCompare reasoning →
BBH
24.1—BIG-Bench Hard. 23 challenging tasks from BIG-Bench where prior language models fell below average human performance.
mathCompare math →
GSM8K
28.1—Grade school math word problems. 8,500 problems testing multi-step arithmetic reasoning. A foundational math benchmark.
knowledgeCompare knowledge →
PIQA
54.8—Physical Intuition QA. Tests understanding of everyday physical interactions and commonsense physics.
HellaSwag
43.7—Sentence completion requiring commonsense reasoning about physical and social situations. Tests real-world understanding.
Winogrande
41.6—Commonsense coreference resolution. Tests understanding of pronoun references in ambiguous sentences.
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Links
Research
Documentation
Community
BenchGecko API
vicuna-13b-v1-1
Specifications
- Typetext
- ContextN/A
- ReleasedJan 2024
- LicenseProprietary
- Statusbenchmark-only
Available On
U
UnknownTBDLearn More
Share & Export
Frequently Asked Questions
vicuna-13b-v1.1 is a proprietary text AI model by Unknown, released in January 2024. It has an average benchmark score of 30.2.