实时正在追踪来自268家提供商的976个AI模型。

BenchGecko测试版

模型976·提供商268·基准测试128·公司71·智能体165·榜首Qwen3 VL 235B A22B Instruct · 1415.8%·已更新刚刚·数据点2,902·MCP服务器4,923

排行榜/Llama 3.2 90B

Llama 3.2 90B

开源

来自 Meta · 发布于 2024-01-01

36.1

平均分

N/A

输入价格

N/A

输出价格

N/A

上下文窗口

text

类型

Tested on 6 benchmarks with 36.1% average. Top scores: MMLU (73.7%), GeoBench (52.0%), MATH level 5 (39.4%).

基准测试分数

基准测试	类别	分数	Bar
MMLU	knowledge	73.7
GeoBench	knowledge	52.0
MATH level 5	math	39.4
Balrog	knowledge	27.3
GPQA diamond	knowledge	21.4
OTIS Mock AIME 2024-2025	math	2.5

相似模型

Google DeepMind

Google DeepMind

GPT-4o (2024-08-06)

Meta Llama 3.2 时间线

Llama 3.2 11B Vision InstructSep 2024

$0.24/M in131Kctx

Llama 3.2 1B InstructSep 2024

$0.03/M in(-0.22)60Kctx(-71K)7 benchmarks

Llama 3.2 3B InstructSep 2024

$0.05/M in(+0.02)80Kctx(+20K)7 benchmarks

Llama 3.2 3B Instruct (free)Sep 2024

$0.00/M in(-0.05)131Kctx(+51K)6 benchmarks

Llama 3.2 90BJan 2024

N/AN/Actx6 benchmarks