How much does Gemma 2 9B cost?

Gemma 2 9B costs $0.03 per million input tokens and $0.09 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.000 per message.

What benchmarks has Gemma 2 9B been tested on?

Gemma 2 9B has been evaluated on 13 benchmarks. Top scores: Chatbot Arena Elo — Overall: 1265.0, GSM8K: 84.9, IFEval: 74.4.

Is Gemma 2 9B open source?

Yes, Gemma 2 9B is open source.

How does Gemma 2 9B compare to Magnum v4 72B?

Gemma 2 9B has an average score of 51.2 while Magnum v4 72B scores 51.2. Magnum v4 72B slightly outperforms Gemma 2 9B overall. Gemma 2 9B costs $0.03/1M input vs Magnum v4 72B at $3.00/1M input. See full comparison →

Home/Models/Gemma 2 9B

Gemma 2 9B

Name: Gemma 2 9B
Price: 0.03 USD
Author: Google DeepMind

by Google DeepMind · Released Jun 2024

Open Source

51.2

avg score

Rank #129

Compare

Better than 53% of all models

Context

8K tokens (~4 books)

Input $/1M

$0.03

Output $/1M

$0.09

Type

text

License

Open Source

Benchmarks

13 tested

Data updated today

About

Gemma 2 9B by Google is an advanced, open-source language model that sets a new standard for efficiency and performance in its size class. Designed for a wide variety of...

Tested on 13 benchmarks with 36.0% average. Top scores: Chatbot Arena Elo — Overall (1265.0%), GSM8K (84.9%), IFEval (74.4%).

Capabilities

reasoning

9.7

#163 globally

math

31.5

#152 globally

knowledge

36.0

#193 globally

language

74.4

#65 globally

general

42.1

#16 globally

Benchmark Scores

Compare All

Tested on 13 benchmarks · Ranked across 6 categories

Score Distribution (all 274 models)

0255075100

▲ You are here

reasoningCompare reasoning →

MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

9.7—

mathCompare math →

GSM8K

Grade school math word problems. 8,500 problems testing multi-step arithmetic reasoning. A foundational math benchmark.

84.9—

MATH level 5

Competition-level math from AMC, AIME, and olympiad problems. Level 5 is the hardest tier, requiring creative problem-solving.

21.0—

MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

19.5—

knowledgeCompare knowledge →

PIQA

Physical Intuition QA. Tests understanding of everyday physical interactions and commonsense physics.

67.4—

MMLU

Massive Multitask Language Understanding. 57 subjects from STEM, humanities, and social sciences. The most widely-cited knowledge benchmark.

62.8—

MMLU-PRO

HuggingFace MMLU-Pro. Harder version of MMLU with 10 answer choices instead of 4 and more challenging questions.

31.9—

Quick compare:

vs Magnum v4 72B

vs Phi 3 Mini 4k Instruct

vs Phi 3.5 Mini Instruct

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Similar Models

Magnum v4 72B

anthracite-org

51.2$3.00/1M

Phi 3 Mini 4k Instruct

Microsoft

51.1TBD

Phi 3.5 Mini Instruct

Microsoft

51.5TBD

Links

Info

Google DeepMind Pricing explorer Developers · API

Research

Documentation

Community

Source Code

BenchGecko API

gemma-2-9b-it

Specifications

Typetext
Context8K tokens (~4 books)
ReleasedJun 2024
LicenseOpen Source
StatusActive
Cost / Message~$0.000

Available On

Google DeepMind$0.03

Frequently Asked Questions

Gemma 2 9B is an open-source text AI model by Google DeepMind, released in June 2024. It has an average benchmark score of 51.2. Context window: 8K tokens.

Benchmarks

Chatbot Arena Elo — Overall GSM8K IFEval PIQA MMLU

Google DeepMind · Provider Google DeepMind · Economy All Models Compare Models Pricing Developers · API

Gemma 2 9B

Frequently Asked Questions

Related Models

Benchmarks

Related Pages