Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...
Tested on 8 benchmarks with 61.6% average. Top scores: LiveBench — Mathematics (73.9%), LiveBench — Language (71.3%), LiveBench — If (67.6%).
Regularly refreshed coding problems that avoid data contamination. New problems added monthly to prevent memorization.
LiveBench coding tasks that require multi-step reasoning and tool use. Tests planning and execution of complex coding workflows.
Regularly refreshed reasoning problems testing logical deduction, spatial reasoning, and analytical thinking.
Fresh data analysis tasks testing ability to interpret tables, charts, and statistical data.
Regularly updated math problems that test numerical reasoning, algebra, calculus, and combinatorics.
- Typemultimodal
- Context262K tokens (~131 books)
- ReleasedApr 2026
- LicenseOpen Source
- StatusActive
- Cost / Message~$0.001