How much does Gemini 3.1 Pro Preview cost?

Gemini 3.1 Pro Preview costs $2.00 per million input tokens and $12.00 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.016 per message.

What benchmarks has Gemini 3.1 Pro Preview been tested on?

Gemini 3.1 Pro Preview has been evaluated on 28 benchmarks. Top scores: Chatbot Arena Elo — Overall: 1486.4, Chatbot Arena Elo — Coding: 1447.0, ARC-AGI: 98.0.

Is Gemini 3.1 Pro Preview open source?

No, Gemini 3.1 Pro Preview is a proprietary model by Google DeepMind.

How does Gemini 3.1 Pro Preview compare to phi-3-small 7.4B?

Gemini 3.1 Pro Preview has an average score of 79.4 while phi-3-small 7.4B scores 79.3. Gemini 3.1 Pro Preview outperforms phi-3-small 7.4B overall. See full comparison →

Home/Models/Gemini 3.1 Pro Preview

Gemini 3.1 Pro Preview

Name: Gemini 3.1 Pro Preview
Price: 2 USD
Author: Google DeepMind

by Google DeepMind · Released Feb 2026

Multimodal1M ContextPreview

79.4

avg score

Rank #24

Compare

Better than 91% of all models

Context

1.0M tokens (~524 books)

Input $/1M

$2.00

Output $/1M

$12.00

Type

multimodal

License

Proprietary

Benchmarks

28 tested

Data updated today

About

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation...

Tested on 28 benchmarks with 56.6% average. Top scores: Chatbot Arena Elo — Overall (1486.4%), Chatbot Arena Elo — Coding (1447.0%), ARC-AGI (98.0%).

Looking for similar performance at lower cost?
MiniMax M3 scores 79.7 (100% as good) at $0.30/1M input · 85% cheaper

Capabilities

coding

62.6

#41 globally

reasoning

83.5

#5 globally

math

47.1

#97 globally

knowledge

53.2

#98 globally

agentic

33.5

#16 globally

speed

75.9

#19 globally

Benchmark Scores

Compare All

Tested on 28 benchmarks · Ranked across 7 categories

Score Distribution (all 274 models)

0255075100

▲ You are here

codingCompare coding →

Terminal Bench

Complex terminal-based engineering tasks. Models must use command-line tools, navigate filesystems, and debug systems through shell interaction.

80.2—

SWE-Bench verified

Real-world software engineering tasks from GitHub issues. Models must diagnose bugs and write patches that pass test suites. Human-verified subset of SWE-bench.

75.6—

WeirdML

Unusual and adversarial machine learning challenges. Tests robustness of reasoning about edge cases in ML systems.

72.1—

reasoningCompare reasoning →

ARC-AGI

Abstraction and Reasoning Corpus. Tests fluid intelligence through novel visual pattern recognition puzzles. Core measure of general intelligence.

98.0—

ARC-AGI-2

ARC-AGI 2, harder sequel to ARC. More complex abstract reasoning patterns that test generalization ability beyond training data.

77.1—

SimpleBench

Deceptively simple questions that humans find easy but AI models often get wrong. Tests common sense and reasoning gaps.

75.5—

mathCompare math →

OTIS Mock AIME 2024-2025

Mock AIME (American Invitational Mathematics Exam) problems from OTIS. Tests mathematical competition performance.

95.6—

FrontierMath-Tiers-1-3-v2-Private

59.6—

FrontierMath-2025-02-28-Private

Original research-level math problems created by professional mathematicians. Problems are unpublished and cannot be memorized.

36.9—

Quick compare:

vs phi-3-small 7.4B

vs MiniMax M3

vs Qwen3.7 Max

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Model Family · Google DeepMind Gemini 3.1 Pro

Gemini 3.1 Pro PreviewFeb 2026

60.6

$2.00/M in1.0Mctx23 benchmarks

Gemini 3.1 Pro Preview Custom ToolsFeb 2026

$2.00/M in1.0Mctx

See the full Gemini 3.1 Pro family →

Similar Models

Links

Info

Google DeepMind Pricing explorer Developers · API

Research

Technical Report

Documentation

API Docs Playground

Community

@Google DeepMind

BenchGecko API

gemini-3-1-pro-preview

Specifications

Typemultimodal
Context1.0M tokens (~524 books)
ReleasedFeb 2026
LicenseProprietary
Statuspreview
Cost / Message~$0.016

Available On

Google DeepMind$2.00

Frequently Asked Questions

Gemini 3.1 Pro Preview is a proprietary multimodal AI model by Google DeepMind, released in February 2026. It has an average benchmark score of 79.4. Context window: 1M tokens.

Benchmarks

Chatbot Arena Elo — Overall Chatbot Arena Elo — Coding ARC-AGI OTIS Mock AIME 2024-2025 GPQA diamond

Google DeepMind · Provider Google DeepMind · Economy All Models Compare Models Pricing Developers · API Context Window · Glossary

Gemini 3.1 Pro Preview

Frequently Asked Questions

Related Models

Benchmarks

Related Pages