AI Comparison

Claude Opus 4.5 is Anthropic’s frontier reasoning model optimized for complex software engineering, agentic workflows, and long-horizon computer use. It offers strong multimodal capabilities, competitive performance across real-world coding and reasoning benchmarks, and improved robustness to prompt injection. The model is designed to operate efficiently across varied effort levels, enabling developers to trade off speed, depth, and token usage depending on task requirements. It comes with a new parameter to control token efficiency, which can be accessed using the OpenRouter Verbosity parameter with low, medium, or high. Opus 4.5 supports advanced tool use, extended context management, and coordinated multi-agent setups, making it well-suited for autonomous research, debugging, multi-step planning, and spreadsheet/browser manipulation. It delivers substantial gains in structured reasoning, execution reliability, and alignment compared to prior Opus generations, while reducing token overhead and improving performance on long-running tasks.

Author

Claude

Release Date

2025-11-24

Knowledge Cutoff

2025-08-01

License

Proprietary

I/O Format

Context Length

200K / 64K

API I/O (1M)

$5 / $25

How to Use

Claude Pro or above / API Access

Output Speed

51 tok/s

Arena Overall

1474

Intelligence Index

49.7

Coding Index

47.8

Math Index

91.3

LiveBench

53.6

ForecastBench

60.1

GPQA Diamond

86.6%

HLE

28.4%

MMLU-Pro

89.5%

AIME 2025

91.3%

MATH-500

—

LB Reasoning

47.8

LB Math

64.1

LB Data Analysis

44.2

LiveCodeBench

87.1%

LB Coding

78.2

LB Agentic

50.0

TAU2

89.5%

TerminalBench

47.0%

SciCode

49.5%

IFBench

58.0%

AA-LCR

0.7

Hallucination (HHEM)

10.9%

Factual Consistency (HHEM)

89.1%

LB Language

77.2

LB Instruction Following

28.9

View Model Details

1 / 3

좌우로 스와이프하여 전환

Author

Claude

Release Date

2025-11-24

Knowledge Cutoff

2025-08-01

License

Proprietary

I/O Format

Context Length

200K / 64K

API I/O (1M)

$5 / $25

How to Use

Claude Pro or above / API Access

Output Speed

51 tok/s

Arena Overall

1474

Intelligence Index

49.7

Coding Index

47.8

Math Index

91.3

LiveBench

53.6

ForecastBench

60.1

GPQA Diamond

86.6%

HLE

28.4%

MMLU-Pro

89.5%

AIME 2025

91.3%

MATH-500

—

LB Reasoning

47.8

LB Math

64.1

LB Data Analysis

44.2

LiveCodeBench

87.1%

LB Coding

78.2

LB Agentic

50.0

TAU2

89.5%

TerminalBench

47.0%

SciCode

49.5%

IFBench

58.0%

AA-LCR

0.7

Hallucination (HHEM)

10.9%

Factual Consistency (HHEM)

89.1%

LB Language

77.2

LB Instruction Following

28.9

View Model Details