Context · 200K+

Cheapest 200K context LLMs

Every LLM with a 200,000+ token context window. Ranked by input price per 1M tokens.

Models40
Cheapest$-1000000.00
Min context200K tokens
What this page is
200K context is the modern default for frontier models. This page lists every priced 200K+ model, cheapest first. For long docs that fit in 200K, this tier offers the best balance of price, recall, and model diversity.

200K+ context models, cheapest first.

#ModelIn $/1MOut $/1MType
1openrouter logoAuto Router$-1000000.00$-1000000.00Closed
2openrouter logoPareto Code Router$-1000000.00$-1000000.00Closed
3openrouter logoElephant$0.00$0.00Closed
4openrouter logoFree Models Router$0.00$0.00Closed
5Google DeepMind logoGemma 4 26B A4B (free)$0.00$0.00OSS
6Google DeepMind logoGemma 4 31B (free)$0.00$0.00OSS
7tencent logoHy3 preview (free)$0.00$0.00Closed
8Google DeepMind logoLyria 3 Clip Preview$0.00$0.00Closed
9Google DeepMind logoLyria 3 Pro Preview$0.00$0.00Closed
10NVIDIA logoNemotron 3 Nano 30B A3B (free)$0.00$0.00OSS
11NVIDIA logoNemotron 3 Nano Omni (free)$0.00$0.00Closed
12NVIDIA logoNemotron 3 Super (free)$0.00$0.00OSS
13openrouter logoOwl Alpha$0.00$0.00Closed
14Alibaba Qwen logoQwen3 Coder 480B A35B (free)$0.00$0.00OSS
15Alibaba Qwen logoQwen3 Next 80B A3B Instruct (free)$0.00$0.00OSS
16Alibaba Qwen logoQwen3.6 Plus (free)$0.00$0.00Closed
17Alibaba Qwen logoQwen3.6 Plus Preview (free)$0.00$0.00OSS
18stepfun logoStep 3.5 Flash (free)$0.00$0.00OSS
19OpenAI logoGPT-5 Nano$0.05$0.40Closed
20NVIDIA logoNemotron 3 Nano 30B A3B$0.05$0.20OSS
21Google DeepMind logoGemma 4 26B A4B $0.06$0.33OSS
22z-ai logoGLM 4.7 Flash$0.06$0.40OSS
23Amazon logoNova Lite 1.0$0.06$0.24Closed
24Alibaba Qwen logoQwen3.5-Flash$0.07$0.26OSS
25Alibaba Qwen logoQwen3 235B A22B Instruct 2507$0.07$0.10OSS
26Google DeepMind logoGemini 2.0 Flash Lite$0.07$0.30Closed
27ByteDance logoSeed 1.6 Flash$0.07$0.30Closed
28Meta logoLlama 4 Scout$0.08$0.30OSS
29xiaomi logoMiMo-V2-Flash$0.09$0.29OSS
30NVIDIA logoNemotron 3 Super$0.09$0.45OSS
31Alibaba Qwen logoQwen3 30B A3B Instruct 2507$0.09$0.30OSS
32Alibaba Qwen logoQwen3 Next 80B A3B Instruct$0.09$1.10OSS
33Google DeepMind logoGemini 2.0 Flash$0.10$0.40Closed
34Google DeepMind logoGemini 2.5 Flash Lite$0.10$0.40Closed
35Google DeepMind logoGemini 2.5 Flash Lite Preview 09-2025$0.10$0.40Closed
36OpenAI logoGPT-4.1 Nano$0.10$0.40Closed
37Alibaba Qwen logoQwen3.5-9B$0.10$0.15OSS
38ByteDance logoSeed-2.0-Mini$0.10$0.40Closed
39stepfun logoStep 3.5 Flash$0.10$0.30OSS
40Alibaba Qwen logoQwen3 Coder Next$0.12$0.80OSS
Cheapest
Auto Router
$-1000000.00/M
$ per 1M input tokens
Why the gap

At the 200K tier, premium pricing buys better tail-end retrieval and higher overall reasoning. For retrieval-heavy RAG, cheap models often match premium ones.

Most expensive
Qwen3 Coder Next
$0.12/M
$ per 1M input tokens
Yes. 200K handles a full novel, a medium-sized codebase, or 100+ pages of PDFs. Only use 1M when you truly need whole-repo or multi-book context.