How much does INTELLECT-1 cost?

INTELLECT-1 pricing information is not yet available.

What benchmarks has INTELLECT-1 been tested on?

INTELLECT-1 has been evaluated on 12 benchmarks. Top scores: HellaSwag: 61.9, ARC AI2: 39.4, GSM8K: 38.6.

Is INTELLECT-1 open source?

No, INTELLECT-1 is a proprietary model by Unknown.

How does INTELLECT-1 compare to Llama 3.2 1B Instruct?

INTELLECT-1 has an average score of 19.8 while Llama 3.2 1B Instruct scores 19.9. Llama 3.2 1B Instruct slightly outperforms INTELLECT-1 overall. See full comparison →

Home/Models/INTELLECT-1

INTELLECT-1

Name: INTELLECT-1
Author: Unknown

by Unknown · Released Jan 2024

19.8

avg score

Rank #206

Compare

Better than 12% of all models

Context

N/A

Input $/1M

TBD

Output $/1M

TBD

Type

text

License

Proprietary

Benchmarks

12 tested

Data updated today

About

Tested on 12 benchmarks with 20.2% average. Top scores: HellaSwag (61.9%), ARC AI2 (39.4%), GSM8K (38.6%).

Capabilities

reasoning

8.6

#144 globally

math

19.3

#166 globally

knowledge

27.9

#181 globally

language

17.6

#148 globally

general

1.0

#73 globally

Benchmark Scores

Compare All

Tested on 12 benchmarks · Ranked across 5 categories

Score Distribution (all 233 models)

0255075100

▲ You are here

reasoningCompare reasoning →

BBH

BIG-Bench Hard. 23 challenging tasks from BIG-Bench where prior language models fell below average human performance.

13.1—

MUSR

HuggingFace MuSR (Multi-Step Reasoning). Tests multi-hop reasoning requiring chaining multiple facts together.

4.1—

mathCompare math →

GSM8K

Grade school math word problems. 8,500 problems testing multi-step arithmetic reasoning. A foundational math benchmark.

38.6—

MATH Level 5

HuggingFace evaluation of MATH Level 5 problems. Competition math requiring advanced reasoning and proof construction.

0.0—

knowledgeCompare knowledge →

HellaSwag

Sentence completion requiring commonsense reasoning about physical and social situations. Tests real-world understanding.

61.9—

ARC AI2

AI2 Reasoning Challenge. Grade-school science questions requiring multi-step reasoning. Easy and Challenge sets test different difficulty levels.

39.4—

MMLU

Massive Multitask Language Understanding. 57 subjects from STEM, humanities, and social sciences. The most widely-cited knowledge benchmark.

33.2—

Quick compare:

vs Llama 3.2 1B Instruct

vs QwQ 32B

vs Qwen3 4B Instruct 2507

Excellent (85+) Good (70-85) Average (50-70) Below (<50)

Similar Models

Llama 3.2 1B Instruct

Frequently Asked Questions

INTELLECT-1 is a proprietary text AI model by Unknown, released in January 2024. It has an average benchmark score of 19.8.

Benchmarks

HellaSwag ARC AI2 GSM8K MMLU Winogrande

Unknown · Provider Unknown · Economy All Models Compare Models Pricing Developers · API

INTELLECT-1

Frequently Asked Questions

Related Models

Benchmarks

Related Pages