Claude
Claude

Claude Sonnet 4.5

Model ID:claude-sonnet-4-5-20250929
2025-09-29Proprietary Model
Claude FreeClaude ProClaude Max (5x)Claude Max (20x)|API
OverallNo.10
PopularityNo.25

Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with improvements across system design, code security, and specification adherence. The model is designed for extended autonomous operation, maintaining task continuity across sessions and providing fact-based progress tracking. Sonnet 4.5 also introduces stronger agentic capabilities, including improved tool orchestration, speculative parallel execution, and more efficient context and memory management. With enhanced context tracking and awareness of token usage across tool calls, it is particularly well-suited for multi-context and long-running workflows. Use cases span software engineering, cybersecurity, financial analysis, research agents, and other domains requiring sustained reasoning and tool use.

Knowledge Cutoff
2025-01-31

The date this AI finished learning. It may not know about things that happened after this date.

Input → Output Format

The types of content this AI can receive, and what it can produce in return.

Context Memory
1MIN64KOUT

The maximum amount of text the AI can read and process in a single request. A larger number means it can handle longer documents or conversations.

Cost/1M Words
$3IN$15OUT

The cost of using this AI directly in your own application. Shown in USD per 1 million units of text (tokens).

AI Performance Evaluation

Arena Overall Score
1452
±3
As of 2026-04-02
Overall Rank
No.25
58,729 Votes
Arena by Ability
Hard Prompts
1485±4No.17
Expert Knowledge
1497±11No.13
Instruction Following
1463±5No.11
Conversation Memory
1468±7No.21
Creative
1442±7No.20
Coding
1519±6No.10
Math
1455±10No.23
Arena by Occupation
Creative Writing
1445±6No.20
Social Sciences
1466±7No.32
Media
1438±6No.18
Business
1453±6No.24
Healthcare
1466±11No.39
Legal
1455±10No.34
Software
1497±5No.19
Mathematics
1468±12No.21
Reasoning Ability
AA Intelligence Index
43%↑4%
MMLU-Pro
88%↑5%
GPQA Diamond
83%↑1%
HLE
17%↑1%
Math
AA Math Index
88%↑14%
AIME 2025
88%↑14%
Coding Ability
AA Coding Index
39%↑2%
LiveCodeBench
71%↑6%
SciCode
45%↑3%
TerminalBench
36%↑2%
Instruction Following
IFBench
57%↑0%
환각률 (HHEM)
12%↑1%
사실 일관성 (HHEM)
88%↓1%
Long Context
AA-LCR
66%↑2%
Agentic AI Ability
TAU2
78%↑7%
Speed
Standard Mode
44tok/sec↓34
First Output 0.80s
Artificial Analysis
Reasoning Mode
49tok/sec↓24
First Output 7.95s
Artificial Analysis