How much does DeepSeek-V2 (MoE-236B, May 2024) cost?

DeepSeek-V2 (MoE-236B, May 2024) is open source and can be self-hosted.

What benchmarks has DeepSeek-V2 (MoE-236B, May 2024) been tested on?

DeepSeek-V2 (MoE-236B, May 2024) has been evaluated on 7 benchmarks. Top scores: ARC AI2: 89.6, HellaSwag: 82.8, TriviaQA: 80.0.

Is DeepSeek-V2 (MoE-236B, May 2024) open source?

Yes, DeepSeek-V2 (MoE-236B, May 2024) is open source.

How does DeepSeek-V2 (MoE-236B, May 2024) compare to Claude Instant?

DeepSeek-V2 (MoE-236B, May 2024) has an average score of 84.4 while Claude Instant scores 84.6. Claude Instant slightly outperforms DeepSeek-V2 (MoE-236B, May 2024) overall. See full comparison →

Home/Models/DeepSeek-V2 (MoE-236B, May 2024)

DeepSeek-V2 (MoE-236B, May 2024)

Name: DeepSeek-V2 (MoE-236B, May 2024)
Author: DeepSeek

by DeepSeek · Released Jan 2024

Open Source

84.4

avg score

Rank #16

Compare

Better than 93% of all models

Context

N/A

Input $/1M

TBD

Output $/1M

TBD

Type

text

License

Open Source

Benchmarks

7 tested

Data updated today

About

Tested on 7 benchmarks with 76.5% average. Top scores: ARC AI2 (89.6%), HellaSwag (82.8%), TriviaQA (80.0%).

Capabilities

reasoning

71.7

#22 globally

knowledge

77.3

#6 globally

Benchmark Scores

Compare All

Tested on 7 benchmarks · Ranked across 2 categories

Score Distribution (all 233 models)

0255075100

▲ You are here

reasoningCompare reasoning →

BBH

BIG-Bench Hard. 23 challenging tasks from BIG-Bench where prior language models fell below average human performance.

71.7—

knowledgeCompare knowledge →

ARC AI2

AI2 Reasoning Challenge. Grade-school science questions requiring multi-step reasoning. Easy and Challenge sets test different difficulty levels.

89.6—

HellaSwag

Sentence completion requiring commonsense reasoning about physical and social situations. Tests real-world understanding.

82.8—

TriviaQA

Trivia questions sourced from trivia enthusiasts and quiz websites. Tests breadth of general knowledge.

80.0—

Quick compare:

vs Claude Instant

vs GPT-5.2-Codex

vs GPT-5.4

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Similar Models

Links

Info

DeepSeek Pricing explorer Developers · API

Research

Documentation

Community

Source Code

BenchGecko API

deepseek-v2-moe-236b-may-2024

Specifications

Typetext
ContextN/A
ReleasedJan 2024
LicenseOpen Source
Statusbenchmark-only

Available On

DeepSeekTBD

Frequently Asked Questions

DeepSeek-V2 (MoE-236B, May 2024) is an open-source text AI model by DeepSeek, released in January 2024. It has an average benchmark score of 84.4.

Benchmarks

ARC AI2 HellaSwag TriviaQA Winogrande BBH

DeepSeek · Provider DeepSeek · Economy All Models Compare Models Pricing Developers · API

DeepSeek-V2 (MoE-236B, May 2024)

Frequently Asked Questions

Related Models

Benchmarks

Related Pages