Beta
Benchmark · Knowledge

Professional Reasoning · Finance

Updated 2026-04-07
Models tested
5
Top score
53.3
Claude Opus 4.6 (Fast)
Median
51.1
min 47.7
Top-5 spread
σ 1.9
settled

Best score over time · one chart, every benchmark

PROFESSIONAL REASONING · FINANCE5 MODELS · FRONTIER RUNNING MAX0255075100SCORE ↑Apr 25Jul 25Oct 25Jan 26Apr 26RELEASE DATE →benchgecko.ai/benchmark/seal-pro-reasoning-finance · frontier
Frontier on Professional Reasoning · Finance rose from 47.7 to 53.3 in 12 months · +5.6 points · latest leader Claude Opus 4.6 (Fast) from Anthropic.
Pink dots = frontier records · 4 totalClick to open model page

Where models cluster

SCORE DISTRIBUTION0–1010–2020–3030–40240–50350–6060–7070–8080–9090–100MEDIAN · 51.1SCORE BUCKET → (0 TO 100)MODELSbenchgecko.ai

Pearson r · original research

Correlation analysis

Benchmarks that track with Professional Reasoning · Finance

Pearson correlation across models scored on both benchmarks. Closer to 1 = strongly predictive.

5 models tested · sorted by score

#ModelScore
1Anthropic logoClaude Opus 4.6 (Fast)53.3
2OpenAI logoGPT-551.3
3OpenAI logoGPT-5 Pro51.1
4OpenAI logoo3 Pro49.1
5OpenAI logoo347.7

Pulled from the Professional Reasoning · Finance dataset · updated daily

What does Professional Reasoning · Finance measure?

Professional Reasoning · Finance is a knowledge benchmark in the BenchGecko catalog. 5 AI models have been tested on it. Scores range from 47.7 to 53.3 out of 100.

Which model leads on Professional Reasoning · Finance?

Claude Opus 4.6 (Fast) from Anthropic leads Professional Reasoning · Finance with a score of 53.3. The median score across 5 tested models is 51.1.

Is Professional Reasoning · Finance saturated?

No · the top score is 53.3 out of 100 (53%). There is still meaningful room for improvement on Professional Reasoning · Finance.

Does Professional Reasoning · Finance predict performance on other benchmarks?

Yes · Professional Reasoning · Finance scores correlate 0.79 with Professional Reasoning · Legal across 5 shared models. Models that do well on Professional Reasoning · Finance tend to do well on Professional Reasoning · Legal.

How often is Professional Reasoning · Finance data refreshed?

BenchGecko pulls updates daily. New model scores on Professional Reasoning · Finance appear as soon as they are published by Epoch AI or the model provider.

Same category · related evaluations