How much does Kimi K2 0711 cost?

Kimi K2 0711 costs $0.57 per million input tokens and $2.30 per million output tokens. For a typical conversation (~2,000 tokens), that's approximately $0.003 per message.

What benchmarks has Kimi K2 0711 been tested on?

Kimi K2 0711 has been evaluated on 12 benchmarks. Top scores: Lech Mazur Writing: 86.9, HELM — WildBench: 86.2, HELM — IFEval: 85.0.

Is Kimi K2 0711 open source?

Yes, Kimi K2 0711 is open source.

How does Kimi K2 0711 compare to Palmyra X5?

Kimi K2 0711 has an average score of 57.0 while Palmyra X5 scores 57.0. Palmyra X5 slightly outperforms Kimi K2 0711 overall. Kimi K2 0711 costs $0.57/1M input vs Palmyra X5 at $0.60/1M input. See full comparison →

Home/Models/Kimi K2 0711

Kimi K2 0711

Name: Kimi K2 0711
Price: 0.57 USD
Author: moonshotai

by moonshotai · Released Jul 2025

Open Source

57.0

avg score

Rank #102

Compare

Better than 63% of all models

Context

131K tokens (~66 books)

Input $/1M

$0.57

Output $/1M

$2.30

Type

text

License

Open Source

Benchmarks

12 tested

Data updated today

About

Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It is optimized for...

Tested on 12 benchmarks with 56.2% average. Top scores: Lech Mazur Writing (86.9%), HELM — WildBench (86.2%), HELM — IFEval (85.0%).

Looking for similar performance at lower cost?
Qwen2.5 Coder 7B Instruct scores 56.6 (99% as good) at $0.03/1M input · 95% cheaper

Capabilities

coding

32.8

#134 globally

reasoning

48.9

#65 globally

math

65.4

#48 globally

knowledge

73.8

#10 globally

language

85.0

#35 globally

Benchmark Scores

Compare All

Tested on 12 benchmarks · Ranked across 5 categories

Score Distribution (all 274 models)

0255075100

▲ You are here

codingCompare coding →

Aider polyglot

Multi-language code editing from Aider. Tests editing ability across Python, JavaScript, TypeScript, Java, C++, Go, Rust, and more.

59.1—

WeirdML

Unusual and adversarial machine learning challenges. Tests robustness of reasoning about edge cases in ML systems.

39.4—

Terminal Bench

Complex terminal-based engineering tasks. Models must use command-line tools, navigate filesystems, and debug systems through shell interaction.

27.8—

reasoningCompare reasoning →

HELM — WildBench

Stanford HELM WildBench evaluation. Tests reasoning on challenging real-world tasks.

86.2—

SimpleBench

Deceptively simple questions that humans find easy but AI models often get wrong. Tests common sense and reasoning gaps.

11.6—

mathCompare math →

HELM — Omni-MATH

Stanford HELM evaluation of mathematical reasoning across diverse problem types.

65.4—

Quick compare:

vs Palmyra X5

vs o3

vs Gemma 2 27B

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Similar Models

Links

Info

moonshotai Pricing explorer Developers · API

Research

Documentation

Community

Source Code

BenchGecko API

kimi-k2

Specifications

Typetext
Context131K tokens (~66 books)
ReleasedJul 2025
LicenseOpen Source
StatusActive
Cost / Message~$0.003

Available On

moonshotai$0.57

Frequently Asked Questions

Kimi K2 0711 is an open-source text AI model by moonshotai, released in July 2025. It has an average benchmark score of 57.0. Context window: 131K tokens.

Benchmarks

Lech Mazur Writing HELM — WildBench HELM — IFEval HELM — MMLU-Pro HELM — Omni-MATH

moonshotai · Provider moonshotai · Economy All Models Compare Models Pricing Developers · API Context Window · Glossary

Kimi K2 0711

Frequently Asked Questions

Related Models

Benchmarks

Related Pages