Meta AI
Meta AI

Llama 4 Maverick

Model ID:meta-llama/llama-4-maverick
2025-04-05Open Model
API
OverallNo.32

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward pass (400B total). It supports multilingual text and image input, and produces multilingual text and code output across 12 supported languages. Optimized for vision-language tasks, Maverick is instruction-tuned for assistant-like behavior, image reasoning, and general-purpose multimodal interaction. Maverick features early fusion for native multimodality and a 1 million token context window. It was trained on a curated mixture of public, licensed, and Meta-platform data, covering ~22 trillion tokens, with a knowledge cutoff in August 2024. Released on April 5, 2025 under the Llama 4 Community License, Maverick is suited for research and commercial applications requiring advanced multimodal understanding and high model throughput.

Knowledge Cutoff
2024-08-31

The date this AI finished learning. It may not know about things that happened after this date.

Input → Output Format

The types of content this AI can receive, and what it can produce in return.

Context Memory
1.0MIN16KOUT

The maximum amount of text the AI can read and process in a single request. A larger number means it can handle longer documents or conversations.

Cost/1M Words
$0.15IN$0.6OUT

The cost of using this AI directly in your own application. Shown in USD per 1 million units of text (tokens).

AI Performance Evaluation

Reasoning Ability
AA Intelligence Index
18%↓20%
MMLU-Pro
81%↓2%
GPQA Diamond
67%↓15%
HLE
4.8%↓12%
Math
AA Math Index
19%↓55%
MATH-500
89%↓5%
AIME 2024
39%↓21%
AIME 2025
19%↓55%
Coding Ability
AA Coding Index
16%↓21%
LiveCodeBench
40%↓26%
SciCode
33%↓9%
TerminalBench
6.8%↓27%
Instruction Following
IFBench
43%↓14%
환각률 (HHEM)
8.2%↓3%
사실 일관성 (HHEM)
92%↑3%
Long Context
AA-LCR
46%↓18%
Agentic AI Ability
TAU2
18%↓54%
Speed
Standard Mode
130tok/sec↑52
First Output 0.50s
Artificial Analysis