Is it cheaper to self-host or use an API?

At low volume, API is cheaper. At very high volume (24/7 high utilization), self-hosting wins. The crossover point is usually around 10 to 100 million tokens per day depending on model size. Our self-host estimates above assume on-demand cloud GPUs.

Where do I download the weights?

HuggingFace is the canonical source for nearly every OSS model. Check the model detail page on BenchGecko for the exact HF ID.

Which OSS models are safe for commercial use?

Apache 2.0 and MIT licensed models (Mistral, most Qwen versions, Gemma) are clean for commercial use. Llama Community License has revenue-threshold restrictions. Always read the specific license before shipping.

Can I fine-tune an OSS model?

Yes, if the license permits (most do). Fine-tuning services on Together, Fireworks, and DeepInfra run $1 to $5 per million training tokens. Deploy the fine-tune back on the same platform or download the adapter.

Licensing · Open source

Open source LLM pricing

Every open-source LLM with license, cheapest API provider, and rough self-host cost. Weights on HuggingFace, API on OpenRouter, DeepInfra, Together, Fireworks, or self-host.

Models212

Cheapest API$0.00

LicensesApache · MIT · Custom

All pricing Pricing home

What this page is

This page lists every open-source LLM with priced API access. For each, we show the license, cheapest API on the market, and a rough self-host cost estimate based on parameter count. Weights are downloadable from HuggingFace. APIs are offered by OpenRouter, DeepInfra, Together, Fireworks, Groq, Cerebras, and others. For very high volume, self-hosting on reserved GPUs is cheaper; for anything under 100M tokens per day, API access usually wins on total cost.

Open source models · ranked by API price

Cheapest input price first.

#	Model	License	Cheapest API	Params	Self-host est	Weights
1	Gemma 3 12B (free)	Gemma License	$0.00/M	-	-	HF
2	Gemma 3 27B (free)	Gemma License	$0.00/M	27B	~$1.50/hr (1x A100)	HF
3	Gemma 3 4B (free)	Gemma License	$0.00/M	-	-	HF
4	Gemma 3n 2B (free)	Gemma License	$0.00/M	-	-	HF
5	Gemma 3n 4B (free)	Gemma License	$0.00/M	-	-	HF
6	Gemma 4 26B A4B (free)	Apache 2.0	$0.00/M	27B	~$1.50/hr (1x A100)	HF
7	Gemma 4 31B (free)	Apache 2.0	$0.00/M	33B	~$1.50/hr (1x A100)	HF
8	GLM 4.5 Air (free)	MIT	$0.00/M	110B	~$12/hr (4x A100)	HF
9	gpt-oss-120b (free)	Apache 2.0	$0.00/M	120B	~$12/hr (4x A100)	HF
10	gpt-oss-20b (free)	Apache 2.0	$0.00/M	22B	~$1.50/hr (1x A100)	HF
11	Hermes 3 405B Instruct (free)	Llama Community	$0.00/M	-	-	HF
12	LFM2.5-1.2B-Instruct (free)	other	$0.00/M	1B	~$0.50/hr (1x L4)	HF
13	LFM2.5-1.2B-Thinking (free)	other	$0.00/M	1B	~$0.50/hr (1x L4)	HF
14	Llama 3.2 3B Instruct (free)	Llama Community	$0.00/M	3B	~$0.50/hr (1x L4)	HF
15	Llama 3.3 70B Instruct (free)	Llama Community	$0.00/M	71B	~$4/hr (2x A100)	HF
16	MiniMax M2.5 (free)	other	$0.00/M	229B	~$30/hr (8x H100)	HF
17	Mistral Small 3.1 24B (free)	Custom OSS	$0.00/M	-	-	-
18	Nemotron 3 Nano 30B A3B (free)	other	$0.00/M	32B	~$1.50/hr (1x A100)	HF
19	Nemotron 3 Super (free)	other	$0.00/M	124B	~$12/hr (4x A100)	HF
20	Nemotron Nano 12B 2 VL (free)	other	$0.00/M	13B	~$1.50/hr (1x A100)	HF
21	Nemotron Nano 9B V2 (free)	other	$0.00/M	9B	~$0.50/hr (1x L4)	HF
22	Qwen3 4B (free)	Custom OSS	$0.00/M	-	-	-
23	Qwen3 Coder 480B A35B (free)	Apache 2.0	$0.00/M	480B	~$30/hr (8x H100)	HF
24	Qwen3 Next 80B A3B Instruct (free)	Apache 2.0	$0.00/M	81B	~$12/hr (4x A100)	HF
25	Qwen3.6 Plus Preview (free)	Custom OSS	$0.00/M	-	-	-
26	Step 3.5 Flash (free)	Custom OSS	$0.00/M	-	-	-
27	Trinity Large Preview (free)	Apache 2.0	$0.00/M	399B	~$30/hr (8x H100)	HF
28	Trinity Mini (free)	Apache 2.0	$0.00/M	-	-	HF
29	Uncensored (free)	Apache 2.0	$0.00/M	24B	~$1.50/hr (1x A100)	HF
30	LFM2-2.6B	Custom OSS	$0.01/M	-	-	-
31	LFM2-8B-A1B	Custom OSS	$0.01/M	-	-	-
32	Granite 4.0 Micro	Apache 2.0	$0.02/M	3B	~$0.50/hr (1x L4)	HF
33	Gemma 3n 4B	Gemma License	$0.02/M	-	-	HF
34	Llama 3.1 8B Instruct	Llama Community	$0.02/M	8B	~$0.50/hr (1x L4)	HF
35	Mistral Nemo	Apache 2.0	$0.02/M	-	-	HF
36	Llama 3.2 1B Instruct	Llama Community	$0.03/M	1B	~$0.50/hr (1x L4)	HF
37	Gemma 2 9B	Gemma License	$0.03/M	9B	~$0.50/hr (1x L4)	HF
38	gpt-oss-20b	Apache 2.0	$0.03/M	22B	~$1.50/hr (1x A100)	HF
39	LFM2-24B-A2B	other	$0.03/M	24B	~$1.50/hr (1x A100)	HF
40	Llama 3 8B Instruct	Llama Community	$0.03/M	8B	~$0.50/hr (1x L4)	HF
41	Qwen2.5 Coder 7B Instruct	Apache 2.0	$0.03/M	8B	~$0.50/hr (1x L4)	HF
42	Qwen-Turbo	Custom OSS	$0.03/M	-	-	-
43	gpt-oss-120b	Apache 2.0	$0.04/M	120B	~$12/hr (4x A100)	HF
44	Gemma 3 12B	Gemma License	$0.04/M	-	-	HF
45	Gemma 3 4B	Gemma License	$0.04/M	-	-	HF
46	Llama 3 8B Lunaris	Llama Community	$0.04/M	8B	~$0.50/hr (1x L4)	HF
47	Nemotron Nano 9B V2	other	$0.04/M	9B	~$0.50/hr (1x L4)	HF
48	Qwen2.5 7B Instruct	Apache 2.0	$0.04/M	8B	~$0.50/hr (1x L4)	HF
49	Trinity Mini	Apache 2.0	$0.04/M	-	-	HF
50	Mistral Small 3	Apache 2.0	$0.05/M	-	-	HF
51	Nemotron 3 Nano 30B A3B	other	$0.05/M	32B	~$1.50/hr (1x A100)	HF
52	Olmo 2 32B Instruct	Apache 2.0	$0.05/M	32B	~$1.50/hr (1x A100)	HF
53	Qwen3 8B	Apache 2.0	$0.05/M	8B	~$0.50/hr (1x L4)	HF
54	Qwen3.5-9B	Apache 2.0	$0.05/M	10B	~$0.50/hr (1x L4)	HF
55	Llama 3.2 3B Instruct	Llama Community	$0.05/M	3B	~$0.50/hr (1x L4)	HF
56	GLM 4.7 Flash	MIT	$0.06/M	31B	~$1.50/hr (1x A100)	HF
57	MythoMax 13B	other	$0.06/M	-	-	HF
58	Qwen3 14B	Apache 2.0	$0.06/M	-	-	HF
59	Phi 4	MIT	$0.07/M	-	-	HF
60	Qwen3.5-Flash	Custom OSS	$0.07/M	-	-	-
61	ERNIE 4.5 21B A3B	Apache 2.0	$0.07/M	-	-	HF
62	ERNIE 4.5 21B A3B Thinking	Apache 2.0	$0.07/M	22B	~$1.50/hr (1x A100)	HF
63	Qwen3 Coder 30B A3B Instruct	Apache 2.0	$0.07/M	31B	~$1.50/hr (1x A100)	HF
64	Qwen3 235B A22B Instruct 2507	Apache 2.0	$0.07/M	-	-	HF
65	gpt-oss-safeguard-20b	Apache 2.0	$0.07/M	-	-	HF
66	Mistral Small 3.2 24B	Apache 2.0	$0.07/M	-	-	HF
67	Gemma 3 27B	Gemma License	$0.08/M	27B	~$1.50/hr (1x A100)	HF
68	Gemma 4 26B A4B	Apache 2.0	$0.08/M	27B	~$1.50/hr (1x A100)	HF
69	Llama 4 Scout	other	$0.08/M	-	-	HF
70	Qwen3 30B A3B	Apache 2.0	$0.08/M	-	-	HF
71	Qwen3 30B A3B Thinking 2507	Apache 2.0	$0.08/M	31B	~$1.50/hr (1x A100)	HF
72	Qwen3 32B	Apache 2.0	$0.08/M	-	-	HF
73	Qwen3 VL 8B Instruct	Apache 2.0	$0.08/M	9B	~$0.50/hr (1x L4)	HF
74	MiMo-V2-Flash	MIT	$0.09/M	310B	~$30/hr (8x H100)	HF
75	Qwen3 30B A3B Instruct 2507	Apache 2.0	$0.09/M	-	-	HF
76	Qwen3 Next 80B A3B Instruct	Apache 2.0	$0.09/M	81B	~$12/hr (4x A100)	HF
77	Tongyi DeepResearch 30B A3B	Apache 2.0	$0.09/M	31B	~$1.50/hr (1x A100)	HF
78	Qwen3 Next 80B A3B Thinking	Apache 2.0	$0.10/M	-	-	HF
79	Devstral Small 1.1	Apache 2.0	$0.10/M	-	-	HF
80	Llama 3.3 70B Instruct	Llama Community	$0.10/M	71B	~$4/hr (2x A100)	HF
81	Llama 3.3 Nemotron Super 49B V1.5	other	$0.10/M	50B	~$4/hr (2x A100)	HF
82	Ministral 3 3B 2512	Apache 2.0	$0.10/M	-	-	HF
83	Mistral Small Creative	Custom OSS	$0.10/M	-	-	-
84	Nemotron 3 Super	other	$0.10/M	124B	~$12/hr (4x A100)	HF
85	Reka Edge	other	$0.10/M	7B	~$0.50/hr (1x L4)	HF
86	Reka Flash 3	Apache 2.0	$0.10/M	21B	~$1.50/hr (1x A100)	HF
87	Step 3.5 Flash	Apache 2.0	$0.10/M	199B	~$12/hr (4x A100)	HF
88	UI-TARS 7B	Apache 2.0	$0.10/M	8B	~$0.50/hr (1x L4)	HF
89	Voxtral Small 24B 2507	Apache 2.0	$0.10/M	24B	~$1.50/hr (1x A100)	HF
90	Qwen3 VL 32B Instruct	Apache 2.0	$0.10/M	33B	~$1.50/hr (1x A100)	HF
91	Mistral 7B Instruct v0.1	Apache 2.0	$0.11/M	-	-	HF
92	Qwen3 VL 8B Thinking	Apache 2.0	$0.12/M	9B	~$0.50/hr (1x L4)	HF
93	MiniMax M2.5	other	$0.12/M	229B	~$30/hr (8x H100)	HF
94	Qwen2.5 72B Instruct	other	$0.12/M	73B	~$4/hr (2x A100)	HF
95	Gemma 4 31B	Apache 2.0	$0.13/M	33B	~$1.50/hr (1x A100)	HF
96	GLM 4.5 Air	MIT	$0.13/M	110B	~$12/hr (4x A100)	HF
97	Hermes 4 70B	Llama Community	$0.13/M	-	-	HF
98	Qwen3 VL 30B A3B Instruct	Apache 2.0	$0.13/M	-	-	HF
99	Qwen3 VL 30B A3B Thinking	Apache 2.0	$0.13/M	31B	~$1.50/hr (1x A100)	HF
100	DeepSeek V3.1 Nex N1	Apache 2.0	$0.14/M	671B	~$60+/hr (multi-node)	HF
101	Qwen VL Plus	Custom OSS	$0.14/M	-	-	-
102	ERNIE 4.5 VL 28B A3B	Apache 2.0	$0.14/M	29B	~$1.50/hr (1x A100)	HF
103	Hermes 2 Pro - Llama-3 8B	Llama Community	$0.14/M	8B	~$0.50/hr (1x L4)	HF
104	Hunyuan A13B Instruct	other	$0.14/M	-	-	HF
105	Qwen3 235B A22B Thinking 2507	Apache 2.0	$0.15/M	-	-	HF
106	DeepSeek V3.1	MIT	$0.15/M	-	-	HF
107	Llama 4 Maverick	other	$0.15/M	402B	~$30/hr (8x H100)	HF
108	Ministral 3 8B 2512	Apache 2.0	$0.15/M	9B	~$0.50/hr (1x L4)	HF
109	Mistral Small 4	Apache 2.0	$0.15/M	119B	~$12/hr (4x A100)	HF
110	Olmo 3 32B Think	Apache 2.0	$0.15/M	1M	~$0.50/hr (1x L4)	HF
111	Olmo 3.1 32B Think	Custom OSS	$0.15/M	-	-	-
112	Qwen3 Coder Next	Apache 2.0	$0.15/M	80B	~$4/hr (2x A100)	HF
113	QwQ 32B	Apache 2.0	$0.15/M	33B	~$1.50/hr (1x A100)	HF
114	Rnj 1 Instruct	Apache 2.0	$0.15/M	8B	~$0.50/hr (1x L4)	HF
115	Qwen3.5-35B-A3B	Apache 2.0	$0.16/M	36B	~$4/hr (2x A100)	HF
116	Rocinante 12B	other	$0.17/M	-	-	HF
117	Llama Guard 4 12B	other	$0.18/M	-	-	HF
118	Qwen3 Coder Flash	Custom OSS	$0.20/M	-	-	-
119	Qwen3.5-27B	Apache 2.0	$0.20/M	28B	~$1.50/hr (1x A100)	HF
120	DeepSeek V3 0324	MIT	$0.20/M	685B	~$60+/hr (multi-node)	HF
121	INTELLECT-3	MIT	$0.20/M	107B	~$12/hr (4x A100)	HF
122	LongCat Flash Chat	MIT	$0.20/M	562B	~$60+/hr (multi-node)	HF
123	MiniMax-01	Custom OSS	$0.20/M	-	-	HF
124	Ministral 3 14B 2512	Apache 2.0	$0.20/M	-	-	HF
125	Nemotron Nano 12B 2 VL	other	$0.20/M	13B	~$1.50/hr (1x A100)	HF
126	Olmo 3.1 32B Instruct	Apache 2.0	$0.20/M	-	-	HF
127	Qwen2.5 VL 32B Instruct	Apache 2.0	$0.20/M	33B	~$1.50/hr (1x A100)	HF
128	Qwen3 VL 235B A22B Instruct	Apache 2.0	$0.20/M	236B	~$30/hr (8x H100)	HF
129	Saba	Custom OSS	$0.20/M	-	-	-
130	DeepSeek V3.1 Terminus	MIT	$0.21/M	-	-	HF
131	Qwen3 Coder 480B A35B	Apache 2.0	$0.22/M	480B	~$30/hr (8x H100)	HF
132	Trinity Large Thinking	Apache 2.0	$0.22/M	399B	~$30/hr (8x H100)	HF
133	Llama 3.2 11B Vision Instruct	Llama Community	$0.24/M	11B	~$1.50/hr (1x A100)	HF
134	MiniMax M2	other	$0.26/M	229B	~$30/hr (8x H100)	HF
135	DeepSeek V3.2	MIT	$0.26/M	685B	~$60+/hr (multi-node)	HF
136	Qwen Plus 0728	Custom OSS	$0.26/M	-	-	-
137	Qwen Plus 0728 (thinking)	Custom OSS	$0.26/M	-	-	-
138	Qwen-Plus	Custom OSS	$0.26/M	-	-	-
139	Qwen3 VL 235B A22B Thinking	Apache 2.0	$0.26/M	236B	~$30/hr (8x H100)	HF
140	Qwen3.5 Plus 2026-02-15	Custom OSS	$0.26/M	-	-	-
141	Qwen3.5-122B-A10B	Apache 2.0	$0.26/M	125B	~$12/hr (4x A100)	HF
142	DeepSeek V3.2 Exp	MIT	$0.27/M	-	-	HF
143	ERNIE 4.5 300B A47B	Apache 2.0	$0.28/M	-	-	HF
144	MiniMax M2.1	other	$0.29/M	229B	~$30/hr (8x H100)	HF
145	R1 Distill Qwen 32B	MIT	$0.29/M	33B	~$1.50/hr (1x A100)	HF
146	Codestral 2508	Custom OSS	$0.30/M	-	-	-
147	Cydonia 24B V4.1	Custom OSS	$0.30/M	24B	~$1.50/hr (1x A100)	HF
148	DeepSeek R1T2 Chimera	MIT	$0.30/M	-	-	HF
149	GLM 4.6V	MIT	$0.30/M	-	-	HF
150	Hermes 3 70B Instruct	Llama Community	$0.30/M	71B	~$4/hr (2x A100)	HF
151	MiniMax M2.7	Custom OSS	$0.30/M	-	-	-
152	DeepSeek V3	Custom OSS	$0.32/M	685B	~$60+/hr (multi-node)	HF
153	Qwen3.6 Plus	Custom OSS	$0.33/M	-	-	-
154	Mistral Small 3.1 24B	Apache 2.0	$0.35/M	-	-	HF
155	Kimi K2.5	other	$0.38/M	1.1T	~$60+/hr (multi-node)	HF
156	GLM 4.6	MIT	$0.39/M	357B	~$30/hr (8x H100)	HF
157	GLM 4.7	MIT	$0.39/M	358B	~$30/hr (8x H100)	HF
158	Qwen3.5 397B A17B	Apache 2.0	$0.39/M	403B	~$30/hr (8x H100)	HF
159	DeepSeek V3.2 Speciale	MIT	$0.40/M	-	-	HF
160	Devstral 2 2512	other	$0.40/M	125B	~$12/hr (4x A100)	HF
161	Devstral Medium	Custom OSS	$0.40/M	-	-	-
162	Kimi K2 0905	other	$0.40/M	1.0T	~$60+/hr (multi-node)	HF
163	Llama 3.1 70B Instruct	Llama Community	$0.40/M	71B	~$4/hr (2x A100)	HF
164	Mistral Medium 3	Custom OSS	$0.40/M	-	-	-
165	Mistral Medium 3.1	Custom OSS	$0.40/M	-	-	-
166	UnslopNemo 12B	Custom OSS	$0.40/M	12B	~$1.50/hr (1x A100)	HF
167	ERNIE 4.5 VL 424B A47B	Apache 2.0	$0.42/M	424B	~$30/hr (8x H100)	HF
168	ReMM SLERP 13B	CC BY	$0.45/M	-	-	HF
169	Qwen3 235B A22B	Apache 2.0	$0.46/M	235B	~$30/hr (8x H100)	HF
170	Llama Guard 3 8B	Llama Community	$0.48/M	8B	~$0.50/hr (1x L4)	HF
171	Mistral Large 3 2512	Custom OSS	$0.50/M	-	-	-
172	R1 0528	MIT	$0.50/M	685B	~$60+/hr (multi-node)	HF
173	Llama 3 70B Instruct	Llama Community	$0.51/M	71B	~$4/hr (2x A100)	HF
174	Qwen VL Max	Custom OSS	$0.52/M	-	-	-
175	Mixtral 8x7B Instruct	Apache 2.0	$0.54/M	47B	~$4/hr (2x A100)	HF
176	Skyfall 36B V2	other	$0.55/M	37B	~$4/hr (2x A100)	HF
177	Kimi K2 0711	other	$0.57/M	1.0T	~$60+/hr (multi-node)	HF
178	GLM 4.5	MIT	$0.60/M	358B	~$30/hr (8x H100)	HF
179	GLM 4.5V	MIT	$0.60/M	108B	~$12/hr (4x A100)	HF
180	Kimi K2 Thinking	other	$0.60/M	1.1T	~$60+/hr (multi-node)	HF
181	Llama 3.1 Nemotron Ultra 253B v1	other	$0.60/M	-	-	HF
182	WizardLM-2 8x22B	Custom OSS	$0.62/M	-	-	-
183	Gemma 2 27B	Gemma License	$0.65/M	27B	~$1.50/hr (1x A100)	HF
184	Llama 3.3 Euryale 70B	Llama Community	$0.65/M	71B	~$4/hr (2x A100)	HF
185	Qwen3 Coder Plus	Custom OSS	$0.65/M	-	-	-
186	Qwen2.5 Coder 32B Instruct	Apache 2.0	$0.66/M	33B	~$1.50/hr (1x A100)	HF
187	Aion-1.0-Mini	Apache 2.0	$0.70/M	-	-	HF
188	R1	MIT	$0.70/M	685B	~$60+/hr (multi-node)	HF
189	R1 Distill Llama 70B	MIT	$0.70/M	-	-	HF
190	GLM 5	MIT	$0.72/M	754B	~$60+/hr (multi-node)	HF
191	Qwen3 Max	Custom OSS	$0.78/M	-	-	-
192	Qwen3 Max Thinking	Custom OSS	$0.78/M	-	-	-
193	CodeLLaMa 7B Instruct Solidity	Llama Community	$0.80/M	-	-	HF
194	Llemma 7b	Llama Community	$0.80/M	-	-	HF
195	Qwen2.5 VL 72B Instruct	other	$0.80/M	73B	~$4/hr (2x A100)	HF
196	Llama 3.1 Euryale 70B v2.2	CC BY	$0.85/M	-	-	HF
197	GLM 5.1	MIT	$0.95/M	754B	~$60+/hr (multi-node)	HF
198	Hermes 3 405B Instruct	Llama Community	$1.00/M	-	-	HF
199	Hermes 4 405B	Llama Community	$1.00/M	406B	~$30/hr (8x H100)	HF
200	Qwen-Max	Custom OSS	$1.04/M	-	-	-
201	Llama 3.1 Nemotron 70B Instruct	Llama Community	$1.20/M	71B	~$4/hr (2x A100)	HF
202	Llama 3 Euryale 70B v2.1	CC BY	$1.48/M	-	-	HF
203	Jamba Large 1.7	other	$2.00/M	-	-	HF
204	Mistral Large	Custom OSS	$2.00/M	-	-	-
205	Mistral Large 2407	Custom OSS	$2.00/M	-	-	-
206	Mistral Large 2411	Custom OSS	$2.00/M	-	-	-
207	Mixtral 8x22B Instruct	Apache 2.0	$2.00/M	141B	~$12/hr (4x A100)	HF
208	Pixtral Large 2411	Custom OSS	$2.00/M	-	-	-
209	Command A	CC BY	$2.50/M	111B	~$12/hr (4x A100)	HF
210	Llama 3.1 70B Hanami x1	CC BY	$3.00/M	71B	~$4/hr (2x A100)	HF
211	Magnum v4 72B	Apache 2.0	$3.00/M	73B	~$4/hr (2x A100)	HF
212	Goliath 120B	Llama Community	$3.75/M	118B	~$12/hr (4x A100)	HF

Top 3 open-source LLMs

Gemma 3 12B (free) · Gemma License · cheapest API at $0.00/M input. Self-host estimate: -.

Gemma 3 27B (free) · Gemma License · cheapest API at $0.00/M input. Self-host estimate: ~$1.50/hr (1x A100).

Gemma 3 4B (free) · Gemma License · cheapest API at $0.00/M input. Self-host estimate: -.

Self-host vs API · when does it pay off?

Low volume

Under 1M tokens/day

Always pick API. A single GPU hour wipes out weeks of API spend at this volume.

Mid volume

10M to 100M tokens/day

Depends on model size. Small models (under 30B) are cheaper via API. Large models (200B+) favor dedicated GPUs.

High volume

1B+ tokens/day

Self-host wins, if utilization stays near 100%. Use reserved instances and bundle across workloads.

The price gap · cheapest vs most expensive

Cheapest

Gemma 3 12B (free)

$0.00/M

$ per 1M input tokens

Why the gap

Within OSS, the price range is driven by parameter count (bigger = more expensive to serve) and provider margins. Smaller dense models and heavily quantized serves sit at the bottom.

Most expensive

Goliath 120B

$3.75/M

$ per 1M input tokens

Frequently asked questions

The weights are publicly downloadable under some license (Apache 2.0, MIT, Llama Community, Qwen License, DeepSeek License, etc.). Not every "open" license is actually OSI-compliant · always read the license before commercial use.

Open source LLM pricing

Open source models · ranked by API price

Top 3 open-source LLMs

Self-host vs API · when does it pay off?

The price gap · cheapest vs most expensive

Frequently asked questions

See also

Licensing

Budget tiers

Compare