How much does vicuna-13b-v1.1 cost?

vicuna-13b-v1.1 pricing information is not yet available.

What benchmarks has vicuna-13b-v1.1 been tested on?

vicuna-13b-v1.1 has been evaluated on 7 benchmarks. Top scores: PIQA: 54.8, HellaSwag: 43.7, Winogrande: 41.6.

Is vicuna-13b-v1.1 open source?

No, vicuna-13b-v1.1 is a proprietary model by Unknown.

How does vicuna-13b-v1.1 compare to open_llama_7b?

vicuna-13b-v1.1 has an average score of 30.2 while open_llama_7b scores 30.0. vicuna-13b-v1.1 outperforms open_llama_7b overall. See full comparison →

Home/Models/vicuna-13b-v1.1

vicuna-13b-v1.1

Name: vicuna-13b-v1.1
Author: Unknown

by Unknown · Released Jan 2024

30.2

avg score

Rank #216

Compare

Better than 21% of all models

Context

N/A

Input $/1M

TBD

Output $/1M

TBD

Type

text

License

Proprietary

Benchmarks

7 tested

Data updated today

About

Tested on 7 benchmarks with 32.5% average. Top scores: PIQA (54.8%), HellaSwag (43.7%), Winogrande (41.6%).

Capabilities

reasoning

24.1

#119 globally

math

28.1

#167 globally

knowledge

35.0

#197 globally

Benchmark Scores

Compare All

Tested on 7 benchmarks · Ranked across 3 categories

Score Distribution (all 274 models)

0255075100

▲ You are here

reasoningCompare reasoning →

BBH

BIG-Bench Hard. 23 challenging tasks from BIG-Bench where prior language models fell below average human performance.

24.1—

mathCompare math →

GSM8K

Grade school math word problems. 8,500 problems testing multi-step arithmetic reasoning. A foundational math benchmark.

28.1—

knowledgeCompare knowledge →

PIQA

Physical Intuition QA. Tests understanding of everyday physical interactions and commonsense physics.

54.8—

HellaSwag

Sentence completion requiring commonsense reasoning about physical and social situations. Tests real-world understanding.

43.7—

Winogrande

Commonsense coreference resolution. Tests understanding of pronoun references in ambiguous sentences.

41.6—

Quick compare:

vs open_llama_7b

vs Claude 3 Haiku

vs Llama 3 70B Instruct

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Similar Models

open_llama_7b

Frequently Asked Questions

vicuna-13b-v1.1 is a proprietary text AI model by Unknown, released in January 2024. It has an average benchmark score of 30.2.

Benchmarks

PIQA HellaSwag Winogrande GSM8K ARC AI2

Unknown · Provider Unknown · Economy All Models Compare Models Pricing Developers · API

vicuna-13b-v1.1

Frequently Asked Questions

Related Models

Benchmarks

Related Pages