How much does DeepSeek V3.1 cost?

DeepSeek V3.1 costs $0.15 per million input tokens and $0.75 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.001 per message.

What benchmarks has DeepSeek V3.1 been tested on?

DeepSeek V3.1 has been evaluated on 5 benchmarks. Top scores: Chatbot Arena Elo — Overall: 1417.9, Lech Mazur Writing: 85.2, Fiction.LiveBench: 52.8.

Is DeepSeek V3.1 open source?

Yes, DeepSeek V3.1 is open source.

How does DeepSeek V3.1 compare to LongCat Flash Chat?

DeepSeek V3.1 has an average score of 53.4 while LongCat Flash Chat scores 53.4. LongCat Flash Chat slightly outperforms DeepSeek V3.1 overall. DeepSeek V3.1 costs $0.15/1M input vs LongCat Flash Chat at $0.20/1M input. See full comparison →

Home/Models/DeepSeek V3.1

DeepSeek V3.1

Name: DeepSeek V3.1
Price: 0.15 USD
Author: DeepSeek

by DeepSeek · Released Aug 2025

Open Source

53.4

avg score

Rank #95

Compare

Better than 59% of all models

Context

33K tokens (~16 books)

Input $/1M

$0.15

Output $/1M

$0.75

Type

text

License

Open Source

Benchmarks

5 tested

Data updated today

About

DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes via prompt templates. It extends the DeepSeek-V3 base with a two-phase long-context...

Tested on 5 benchmarks with 51.1% average. Top scores: Chatbot Arena Elo — Overall (1417.9%), Lech Mazur Writing (85.2%), Fiction.LiveBench (52.8%).

Looking for similar performance at lower cost?
Phi 4 scores 54.2 (101% as good) at $0.07/1M input · 57% cheaper

Capabilities

coding

38.4

#101 globally

reasoning

28.0

#89 globally

knowledge

69.0

#20 globally

Benchmark Scores

Compare All

Tested on 5 benchmarks · Ranked across 4 categories

Score Distribution (all 233 models)

0255075100

▲ You are here

codingCompare coding →

WeirdML

Unusual and adversarial machine learning challenges. Tests robustness of reasoning about edge cases in ML systems.

38.4—

reasoningCompare reasoning →

SimpleBench

Deceptively simple questions that humans find easy but AI models often get wrong. Tests common sense and reasoning gaps.

28.0—

knowledgeCompare knowledge →

Lech Mazur Writing

Writing quality evaluation by Lech Mazur. Tests prose quality, coherence, and stylistic ability.

85.2—

Fiction.LiveBench

LiveBench fiction analysis. Tests literary comprehension and creative text understanding.

52.8—

Quick compare:

vs LongCat Flash Chat

vs Grok 3 Mini Beta

vs GPT-5 Pro

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · DeepSeek DeepSeek V3.1

DeepSeek V3.1Aug 2025

51.1

$0.15/M in33Kctx5 benchmarks

DeepSeek V3.1 TerminusSep 2025

$0.27/M in(+0.12)164Kctx(+131K)1 benchmark

See the full DeepSeek V3.1 family →

Similar Models

Links

Info

DeepSeek Pricing explorer Developers · API

Research

Documentation

Community

Source Code

BenchGecko API

deepseek-chat-v3-1

Specifications

Typetext
Context33K tokens (~16 books)
ReleasedAug 2025
LicenseOpen Source
StatusActive
Cost / Message~$0.001

Available On

DeepSeek$0.15

Frequently Asked Questions

DeepSeek V3.1 is an open-source text AI model by DeepSeek, released in August 2025. It has an average benchmark score of 53.4. Context window: 33K tokens.

Benchmarks

Chatbot Arena Elo — Overall Lech Mazur Writing Fiction.LiveBench WeirdML SimpleBench

DeepSeek · Provider DeepSeek · Economy All Models Compare Models Pricing Developers · API

DeepSeek V3.1

Frequently Asked Questions

Related Models

Benchmarks

Related Pages