Home/Models/vicuna-13b-v1.1
U

vicuna-13b-v1.1

by Unknown · Released Jan 2024

30.2
avg score
Rank #216
Compare
Better than 21% of all models
Context
N/A
Input $/1M
TBD
Output $/1M
TBD
Type
text
License
Proprietary
Benchmarks
7 tested
Data updated today
About

Tested on 7 benchmarks with 32.5% average. Top scores: PIQA (54.8%), HellaSwag (43.7%), Winogrande (41.6%).

Capabilities
reasoning
24.1
#119 globally
math
28.1
#167 globally
knowledge
35.0
#197 globally
Benchmark Scores
Compare All
Tested on 7 benchmarks · Ranked across 3 categories
Score Distribution (all 274 models)
0255075100
▲ You are here
BBH

BIG-Bench Hard. 23 challenging tasks from BIG-Bench where prior language models fell below average human performance.

24.1
GSM8K

Grade school math word problems. 8,500 problems testing multi-step arithmetic reasoning. A foundational math benchmark.

28.1
PIQA

Physical Intuition QA. Tests understanding of everyday physical interactions and commonsense physics.

54.8
HellaSwag

Sentence completion requiring commonsense reasoning about physical and social situations. Tests real-world understanding.

43.7
Winogrande

Commonsense coreference resolution. Tests understanding of pronoun references in ambiguous sentences.

41.6
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Links
Documentation
Community
BenchGecko API
vicuna-13b-v1-1
Specifications
  • Typetext
  • ContextN/A
  • ReleasedJan 2024
  • LicenseProprietary
  • Statusbenchmark-only
Available On
U
UnknownTBD
Share & Export
Tweet
vicuna-13b-v1.1 is a proprietary text AI model by Unknown, released in January 2024. It has an average benchmark score of 30.2.