Compute · ChipsAnchor of the compute layer

Every AI Chip · Tracked

Every AI chip from every manufacturer and every foundry. Specs, power, cloud pricing, known buyers, and live supply tightness · cross-linked to foundries, memory suppliers, racks, datacenters, and the models trained on them.

Supply trackedFab-level originHBM supplier mapPerf per Watt

Preview · full AICDI on /compute

AI Compute Demand Index

325

+12 WoW

FULL PARABOLIC

No bubble in sight · full parabolic. 100 = balanced supply · unbounded above · a reading of 250+ means no bubble in sight.

How it's built

Weighted tightness68
Lead time multiplier2.09×
Capex accelerator1.85×
Frontier holdout premium+60

0100 balanced300+

Driving the number up

Full Chip Demand Ladder

Tracked chips

Frontier tier

Manufacturers

90%

TSMC share

Shipping now

On HBM3e

Disclosed buyer entries

Known models trained

Tier Ladder

Frontier · Mainstream · Specialized · Legacy · assessed quarterly

Frontier

10 · 50%

Current-generation flagship silicon shipping at scale to hyperscalers. The chips training the biggest models right now.

B200 GB200 TPU v7 Trainium2 GB300 MI355X+4

Mainstream

5 · 25%

Current-gen non-flagship or last-gen flagship still deployed widely. The workhorses of the AI economy.

MI300X Gaudi 3 H100 H200 TPU v5p

Specialized

4 · 20%

Wafer-scale, custom hyperscaler silicon, or experimental architectures serving niche workloads.

WSE-3 MTIA v2 Maia 100 LPU

Legacy

1 · 5%

Older chips being phased out or already past EOL. Still used in research clusters and price-sensitive workloads.

A100

Performance per Watt

FP16 TFLOPs vs TDP · log scale · bubble size = HBM capacity

0.5 T/W1 T/W2 T/W4 T/W

Up-and-to-the-left wins. Chips above the 4 T/W line are the efficiency frontier — that's where the power-budget game is won at hyperscale. Wafer-scale and multi-die superchips sit top-right because they trade watts for raw throughput.

Chip Demand Ladder

How tight is supply right now · hand-curated from order books + earnings calls

sold out tight glut

GB200

Sold out98

52wk lead

GB300

Sold out95

48wk lead

B200

Sold out92

38wk lead

Ascend 910C

Sold out92

44wk lead

WSE-3

Sold out90

internal

B300

Tight88

36wk lead

MI355X

Tight80

30wk lead

Trainium2

Tight75

24wk lead

MI325X

Tight70

22wk lead

Maia 100

Tight70

internal

H200

Balanced65

16wk lead

TPU v7

Balanced60

internal

LPU

Balanced60

internal

TPU v6e

Balanced55

internal

MTIA v2

Balanced50

internal

MI300X

Balanced40

6wk lead

H100

Loose35

4wk lead

Gaudi 3

Loose25

4wk lead

TPU v5p

Loose25

internal

A100

Glut15

2wk lead

Supply notes

GB200Sold out through 2027 · Oracle, Microsoft, Meta competing
GB300xAI Memphis + Stargate phase 2
B200CoWoS-L packaging capacity is the bottleneck
Ascend 910CChina export-ban alternative · SMIC N7 constrained
WSE-3G42 Condor Galaxy absorbing most wafers
B300Ultra variant ramping Q4 2025

This is the per-chip preview of the AI Compute Demand Index landing on /compute · unbounded above 100, goes to 250+ when the market is on fire. Updated weekly from hyperscaler order books and earnings disclosures.

Best for

Six curated shortlists computed from the live spec data

Most raw FP16 throughput

Top 5 by FP16/BF16 TFLOPs · ignore price and power · pure compute

1WSE-3Cerebras
2GB300NVIDIA
3TPU v7Google
4GB200NVIDIA
5B300NVIDIA

Most efficient (TFLOPs/W)

Best FP16 TFLOPs per Watt · the chips that make datacenter power budgets work

1TPU v7Google
2WSE-3Cerebras
3MTIA v2Meta
4TPU v6eGoogle
5B200NVIDIA

Most HBM capacity

Biggest memory for fitting 400B+ parameter models in a single node

1GB300NVIDIA
2GB200NVIDIA
3MI355XAMD
4B300NVIDIA
5MI325XAMD

Cheapest cloud rental

Lowest $/h in public clouds · best for quick experiments

1A100NVIDIA
2Trainium2AWS
3H100NVIDIA
4TPU v6eGoogle
5MI300XAMD

Current flagship chips

The chips training the biggest frontier models right now

1B200NVIDIA
2GB200NVIDIA
3TPU v7Google
4Trainium2AWS
5GB300NVIDIA

Specialized architectures

Wafer-scale, LPU, MTIA and other custom silicon beyond GPU/TPU

1WSE-3Cerebras
2MTIA v2Meta
3Maia 100Microsoft
4LPUGroq

Full leaderboard

Sorted by Gecko score · composite of tier, recency, adoption, perf density

#	Chip	Maker	Tier	FP16 T	TDP	T/W	Memory	Fab	$/h	Score
1	B200	NVIDIA	Frontier	2,250	1000W	2.25	192GB	TSMC N4P	$4.9	94
2	GB200	NVIDIA	Frontier	4,500	2700W	1.67	384GB	TSMC N4P	—	86
3	TPU v7	Google	Frontier	4,614	600W	7.69	192GB	TSMC N3	—	83
4	Trainium2	AWS	Frontier	667	500W	1.33	96GB	TSMC N5	$1.75	78
5	GB300	NVIDIA	Frontier	5,000	3200W	1.56	576GB	TSMC N4P	—	75
6	MI355X	AMD	Frontier	2,300	1400W	1.64	288GB	TSMC N3	—	75
7	Ascend 910C	Huawei	Frontier	800	550W	1.45	128GB	SMIC N7 · SMIC	—	75
8	TPU v6e	Google	Frontier	918	350W	2.62	32GB	TSMC N5	$2.7	71
9	B300	NVIDIA	Frontier	2,500	1400W	1.79	288GB	TSMC N4P	—	70
10	MI325X	AMD	Frontier	1,307	1000W	1.31	256GB	TSMC N5+N6	—	68
11	MI300X	AMD	Mainstream	1,307	750W	1.74	192GB	TSMC N5+N6	$2.99	65
12	WSE-3	Cerebras	Specialized	125,000	23000W	5.43	44GB	TSMC N5	—	60
13	Gaudi 3	Intel	Mainstream	1,835	900W	2.04	128GB	TSMC N5	$10.42	57
14	H100	NVIDIA	Mainstream	989	700W	1.41	80GB	TSMC N4	$1.99	55
15	H200	NVIDIA	Mainstream	989	700W	1.41	141GB	TSMC N4	$3.79	55
16	MTIA v2	Meta	Specialized	354	90W	3.93	128GB	TSMC N5	—	52
17	TPU v5p	Google	Mainstream	459	450W	1.02	95GB	TSMC N5	$4.2	49
18	Maia 100	Microsoft	Specialized	800	700W	1.14	64GB	TSMC N5	—	42
19	LPU	Groq	Specialized	188	375W	0.5	0.23GB	GlobalFoundries N14	—	34
20	A100	NVIDIA	Legacy	312	400W	0.78	80GB	TSMC N7	$1.29	31

By manufacturer

10 companies · 3 foundries · 2 countries

Cerebras

Chips

Frontier

Avg T/W

5.43

Top: WSE-3

USA

NVIDIA

Chips

Frontier

Avg T/W

1.55

Top: B200

USA

Google

Chips

Frontier

Avg T/W

3.78

Top: TPU v7

USA

AMD

Chips

Frontier

Avg T/W

1.56

Top: MI355X

USA

Intel

Chips

Frontier

Avg T/W

2.04

Top: Gaudi 3

USA

Huawei

Chips

Frontier

Avg T/W

1.45

Top: Ascend 910C

China

Microsoft

Chips

Frontier

Avg T/W

1.14

Top: Maia 100

USA

AWS

Chips

Frontier

Avg T/W

1.33

Top: Trainium2

USA

Frequently asked

Pulled from the live dataset · schema-ready for AEO

Which AI chip has the most FP16 TFLOPs right now?

NVIDIA GB300 (Grace Blackwell Ultra) leads dense FP16 throughput at ~5000 TFLOPs per superchip, followed by GB200 at 4500 TFLOPs. The Cerebras WSE-3 reaches 125000 TFLOPs per wafer but is a fundamentally different architecture · a single wafer contains what would otherwise be ~50 GPUs.

Which AI chip is most efficient per Watt?

Google TPU v7 (Ironwood) and NVIDIA B200 lead FP16 TFLOPs per Watt among shipping chips. Trillium (TPU v6e) is close behind. Inference-first designs (Ironwood, Groq LPU) tend to dominate perf/Watt because they skip training-specific circuitry.

Who manufactures AI chips besides NVIDIA?

10 companies ship production AI accelerators tracked on BenchGecko · NVIDIA, AMD, Google, AWS, Intel, Microsoft, Meta, Huawei, Cerebras, Groq, and more. NVIDIA dominates third-party sales but hyperscalers are shipping more custom silicon (Trainium2, TPU v7, Maia 100, MTIA) to reduce dependence.

Which fabs make AI chips?

90% of tracked chips are fabbed at TSMC. GlobalFoundries handles Groq's 14nm LPU. SMIC fabricates Huawei's Ascend 910C at their N7 node as the primary answer to the U.S. export ban. TSMC advanced nodes (N5/N4/N3) are the bottleneck for current-gen NVIDIA, AMD, AWS, Google, and Microsoft silicon.

How do you decide which chip is "frontier" tier?

Frontier means current-generation flagship, shipping in 2025-2026, widely ordered by hyperscalers. Mainstream means current-gen non-flagship or last-gen flagship still deployed (H100, H200, MI300X). Specialized means wafer-scale, LPU, MTIA and other custom designs. Legacy means EOL or phasing out (A100, V100). Tiers are reviewed quarterly.

Where does BenchGecko get chip data from?

Manufacturer datasheets for specs, SEC 10-K filings and earnings transcripts for buyer disclosures, cloud provider API pricing for $/hour, public press releases for release dates. Every chip detail page includes source links. No paywalled data · no vendor-sponsored rankings · everything is reproducible.

Every AI Chip · Tracked

Tier Ladder

Performance per Watt

Chip Demand Ladder

Supply notes

Best for

Most raw FP16 throughput

Most efficient (TFLOPs/W)

Most HBM capacity

Cheapest cloud rental

Current flagship chips

Specialized architectures

Full leaderboard

By manufacturer

Frequently asked

See also