Number of output tokens generated per second via API. Higher values mean faster responses. Median measured by Artificial Analysis.
Google
Gemini 2.5 Flash
ChatGPT
GPT-5.4 Nano
Gemini 3.1 Flash Lite
Amazon
Nova 2 Lite
Gemini 3 Flash
Grok
Grok 4.20
GPT-5.4 Mini
Grok 4.20 (Reasoning)
GPT-5.4
Mistral AI
Mistral Small 4
GPT-5 Nano
Meta AI
Llama 4 Scout
Gemini 3.1 Pro
Nano Banana
Llama 4 Maverick
Gemini 2.5 Pro
Meituan
Longcat Flash Chat
Grok 4.1 Fast (Reasoning)
Gemini 2.5 Flash Lite
Anthropic
Claude Haiku 4.5
Grok 4.1 Fast
GPT-4.1
GPT OSS 120B
Nano Banana 2
NVIDIA
Nemotron 3 Super
GPT-5
GPT-5 Mini
Arcee AI
Trinity Large Thinking
MiniMax
MiniMax M2.5
Xiaomi
MiMo-V2-Pro
GPT-4o Mini Transcribe
Claude Opus 4.5
Z.ai
GLM-5
Qwen
Qwen3.5 397B A17B
Qwen3.6 Plus
DeepSeek
DeepSeek V3.2
Claude Opus 4.6
Claude Sonnet 4.6
Claude Sonnet 4
Claude Sonnet 4.5
MiniMax M2.7
GLM-5.1
Claude Opus 4
Claude Opus 4.1
GPT-4o Transcribe
Moonshot AI
Kimi K2.5
Nano Banana Pro
Baidu
ERNIE 4.5 300B A47B
Gemma 4 31B
GPT-5.4 Pro