How much does DeepSeek R1 Distill Llama 8B cost?

DeepSeek R1 Distill Llama 8B is open source and can be self-hosted.

What benchmarks has DeepSeek R1 Distill Llama 8B been tested on?

DeepSeek R1 Distill Llama 8B has been evaluated on 11 benchmarks. Top scores: JSQuAD: 80.2, JNLI: 69.4, JCommonsenseQA: 62.4.

Is DeepSeek R1 Distill Llama 8B open source?

Yes, DeepSeek R1 Distill Llama 8B is open source.

How does DeepSeek R1 Distill Llama 8B compare to Claude 3 Opus?

DeepSeek R1 Distill Llama 8B has an average score of 38.4 while Claude 3 Opus scores 38.4. Claude 3 Opus slightly outperforms DeepSeek R1 Distill Llama 8B overall. See full comparison →

Home/Models/DeepSeek R1 Distill Llama 8B

DeepSeek R1 Distill Llama 8B

Name: DeepSeek R1 Distill Llama 8B
Author: DeepSeek

by DeepSeek · Released Jan 2025

Open Source

38.4

avg score

Rank #148

Compare

Better than 36% of all models

Context

N/A

Input $/1M

TBD

Output $/1M

TBD

Type

text-generation

License

Open Source

Benchmarks

11 tested

Data updated today

About

Deepseek-ai text generation model. 1145K downloads on HuggingFace.

Tested on 11 benchmarks with 33.6% average. Top scores: JSQuAD (80.2%), JNLI (69.4%), JCommonsenseQA (62.4%).

Capabilities

reasoning

0.5

#182 globally

math

22.0

#153 globally

knowledge

6.4

#215 globally

language

54.8

#99 globally

general

5.3

#60 globally

Benchmark Scores

Compare All

Tested on 11 benchmarks · Ranked across 5 categories

Score Distribution (all 231 models)

0255075100

▲ You are here

reasoningCompare reasoning →

MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

0.5—

mathCompare math →

MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

22.0—

knowledgeCompare knowledge →

MMLU-PRO

HuggingFace MMLU-Pro. Harder version of MMLU with 10 answer choices instead of 4 and more challenging questions.

12.1—

GPQA

HuggingFace evaluation of GPQA (Graduate-Level Google-Proof Q&A). PhD-level science questions that cannot be easily searched.

0.7—

Quick compare:

vs Claude 3 Opus

vs DeepSeek R1 Distill Qwen 7B

vs Command R+ (08-2024)

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · DeepSeek DeepSeek R1

DeepSeek R1 Distill Llama 8BJan 2025

33.6

N/AN/Actx11 benchmarks

DeepSeek R1 Distill Qwen 1.5BJan 2025

10.4-23.2

N/AN/Actx6 benchmarks

DeepSeek R1 Distill Qwen 14BJan 2025

56.0+45.6

N/AN/Actx11 benchmarks

DeepSeek R1 Distill Qwen 7BJan 2025

32.7-23.3

N/AN/Actx11 benchmarks

See the full DeepSeek R1 family →

Similar Models

Claude 3 Opus

Anthropic

38.4TBD

DeepSeek R1 Distill Qwen 7B

Links

Info

Research

Documentation

Community

Source Code

BenchGecko API

deepseek-ai-deepseek-r1-distill-llama-8b

Specifications

Typetext-generation
ContextN/A
ReleasedJan 2025
LicenseOpen Source
StatusActive

Available On

DeepSeekTBD

Frequently Asked Questions

DeepSeek R1 Distill Llama 8B is an open-source text-generation AI model by DeepSeek, released in January 2025. It has an average benchmark score of 38.4.

Benchmarks

JSQuAD JNLI JCommonsenseQA LLM-JP — Overall IFEval

DeepSeek · Provider All Models Compare Models

DeepSeek R1 Distill Llama 8B

Frequently Asked Questions

Related Models

Benchmarks

Related Pages