Beta
Home/Models/Gemma 4 31B
Google DeepMind logo

Gemma 4 31B

by Google DeepMind · Released Apr 2026

Open SourceMultimodal
68.2
avg score
Rank #42
Compare
Better than 82% of all models
Context
262K tokens (~131 books)
Input $/1M
$0.13
Output $/1M
$0.38
Type
multimodal
License
Open Source
Benchmarks
8 tested
Data updated today
About

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

Tested on 8 benchmarks with 61.6% average. Top scores: LiveBench — Mathematics (73.9%), LiveBench — Language (71.3%), LiveBench — If (67.6%).

Capabilities
coding
50.2
#64 globally
reasoning
59.1
#32 globally
math
73.9
#26 globally
knowledge
61.6
#37 globally
language
69.5
#70 globally
Benchmark Scores
Compare All
Tested on 8 benchmarks · Ranked across 5 categories
Score Distribution (all 231 models)
0255075100
▲ You are here
LiveBench — Coding

Regularly refreshed coding problems that avoid data contamination. New problems added monthly to prevent memorization.

60.3
LiveBench — Agentic Coding

LiveBench coding tasks that require multi-step reasoning and tool use. Tests planning and execution of complex coding workflows.

40.0
LiveBench — Reasoning

Regularly refreshed reasoning problems testing logical deduction, spatial reasoning, and analytical thinking.

59.4
LiveBench — Data Analysis

Fresh data analysis tasks testing ability to interpret tables, charts, and statistical data.

58.8
LiveBench — Mathematics

Regularly updated math problems that test numerical reasoning, algebra, calculus, and combinatorics.

73.9
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Links
Documentation
BenchGecko API
gemma-4-31b-it
Specifications
  • Typemultimodal
  • Context262K tokens (~131 books)
  • ReleasedApr 2026
  • LicenseOpen Source
  • StatusActive
  • Cost / Message~$0.001
Available On
Google DeepMind logoGoogle DeepMind$0.13
Share & Export
Tweet
Gemma 4 31B is an open-source multimodal AI model by Google DeepMind, released in April 2026. It has an average benchmark score of 68.2. Context window: 262K tokens.