AI Model Pricing

Every model available on OpenClaw Launch. Switch models anytime, even after deployment.

GPT-5.5

OpenAI

OpenAI's latest frontier model. Excels at complex reasoning, coding, and multimodal analysis with 1M+ context.

Input$5/M

Output$30/M

GPT-5.4 Mini

OpenAI

Fast and efficient version of GPT-5.4 with vision support. Great for high-throughput workloads and computer use tasks.

Input$0.75/M

Output$4.50/M

Claude Opus 4.8

Anthropic

Most Capable

Anthropic's newest and most capable model. Best for nuanced analysis, agentic coding, and complex multi-step tasks.

Input$5/M

Output$25/M

Claude Opus 4.7

Anthropic

Frontier-tier reasoning and coding with deep agentic reliability. A step below Opus 4.8 at the same price point.

Input$5/M

Output$25/M

Claude Opus 4.6

Anthropic

Previous-generation Opus flagship. Still excellent for nuanced analysis, creative writing, and complex multi-step tasks.

Input$5/M

Output$25/M

Claude Sonnet 4.6

Anthropic

Recommended

The best balance of intelligence, speed, and cost. Ideal default for most use cases.

Input$3/M

Output$15/M

Gemini 3.1 Pro

Google

New

Google's frontier reasoning model with enhanced software engineering performance and improved agentic reliability.

Input$2/M

Output$12/M

Gemini 3.5 Flash

Google

New

Google's newest mid-tier multimodal model. Bigger reasoning step up from Gemini 3 Flash, with vision and video input.

Input$1.50/M

Output$9/M

Gemini 3 Flash

Google

Google's fast and affordable multimodal model. Great for vision tasks, quick reasoning, and high-throughput workloads.

Input$0.50/M

Output$3/M

Gemini 3.1 Flash Lite

Google

Cheapest

Ultra-efficient and fast. Great for high-volume tasks, quick responses, and the most cost-efficient workflows.

Input$0.25/M

Output$1.50/M

Gemini 2.5 Flash

Input$0.30/M

Output$2.50/M

Gemini 2.5 Flash Lite

Input$0.10/M

Output$0.40/M

Grok 4.20

Input$2/M

Output$6/M

Grok 4.1 Fast

Input$0.20/M

Output$0.50/M

DeepSeek V4 Flash

DeepSeek

New

Ultra-low-cost flash model with 1M context. A strong choice for budget-conscious chat and agent workflows.

Input$0.10/M

Output$0.20/M

DeepSeek V4 Pro

DeepSeek

New

DeepSeek's frontier model. 1M context, strong coding and reasoning at a fraction of the cost of other premium models.

Input$0.44/M

Output$0.87/M

Hunyuan 3

Tencent

New

Tencent's Hunyuan 3 preview — a remarkably cheap large model with a 256K context. One of the lowest-cost capable models on OpenRouter.

Input$0.06/M

Output$0.21/M

Kimi K2.6

Moonshot AI

Next-gen multimodal model for long-horizon coding, UI/UX generation, and multi-agent orchestration.

Input$0.60/M

Output$2.80/M

GLM 5.1

Z.ai

Coding-focused model with 202K context. Strong at long-horizon agentic tasks and tool use.

Input$0.95/M

Output$3.15/M

MiniMax M3

Input$0.30/M

Output$1.20/M

MiniMax M2.7

MiniMax

Best Value

Self-evolving model with strong coding and reasoning. Affordable pricing with 200K context window.

Input$0.30/M

Output$1.20/M

Step 3.7 Flash

StepFun

New

StepFun's newest flash model with vision and video input. Stronger reasoning than Step 3.5 Flash while staying low-cost.

Input$0.20/M

Output$1.15/M

Step 3.5 Flash

StepFun

Sparse MoE with 196B total params, 11B active per token. StepFun's most capable open-source model at an ultra-low price.

Input$0.10/M

Output$0.30/M

MiMo V2.5 Pro

Xiaomi

New

Xiaomi's newest flagship, tuned for agentic workflows. Strong coding and orchestration with a 1M context window.

Input$0.44/M

Output$0.87/M

MiMo V2.5

Xiaomi

Multimodal MiMo with vision and video input at an ultra-low price. Great for high-volume agent and chat workloads.

Input$0.14/M

Output$0.28/M

MiMo V2 Pro

Xiaomi

Xiaomi's flagship 1T-parameter model optimized for agentic workflows. Strong at coding, reasoning, and complex orchestration tasks.

Input$1.00/M

Output$3.00/M

Qwen 3.6 Plus

Alibaba Cloud

New

Hybrid linear-attention + sparse MoE flagship. Strong at agentic coding and front-end tasks (78.8 on SWE-bench Verified) with 1M context and native reasoning mode.

Input$0.33/M

Output$1.95/M

Gemma 4 26B A4B

Google

Open-weight multimodal Gemma model with vision and video input. MoE architecture for efficient inference.

Input$0.12/M

Output$0.40/M

Gemma 4 31B

Google

Larger Gemma variant with vision and video support. Great balance of quality and cost for multimodal tasks.

Input$0.14/M

Output$0.40/M

Free Models Router

OpenRouter

Free

Auto-routes each request to a working free model on OpenRouter. When one free model goes dark, the router picks another — the most reliable way to stay on free tier.

InputFree

OutputFree

Nemotron 3 Super

NVIDIA

Free

Hybrid Mamba-Transformer MoE model — 120B params, 12B active. 1M context window with zero cost. Great for complex agent tasks.

InputFree

OutputFree

custom/Qwen3.6 Plus

InputFree

OutputFree

custom/Qwen3.7 Plus

InputFree

OutputFree

custom/MiniMax M2.7

InputFree

OutputFree

custom/MiniMax M3

InputFree

OutputFree

custom/Kimi K2.6

InputFree

OutputFree

custom/DeepSeek V4 Pro

InputFree

OutputFree

custom/DeepSeek V4 Flash

InputFree

OutputFree

custom/GLM 5.2

InputFree

OutputFree

Search Models

Perplexity-powered web search with real-time citations. Used automatically when your bot needs current information.

Sonar

Perplexity

Default

Fast web search with citations. Good for quick factual lookups.

Input$1/M

Output$1/M

Image Generation Models

Generate and edit images directly from chat. Bring your own provider key (OpenAI, Google, or ByteDance) to unlock.

GPT-5.4 Image 2

OpenAI

New

GPT-5.4 reasoning paired with GPT Image 2 generation. Best-in-class prompt adherence and text rendering.

Per image$0.04

Model IDopenai/gpt-5.4-image-2

Gemini 3.1 Flash Image

Google

Fastest

Fast Nano-Banana-class image generation and editing. Strong at photoreal output and low-latency workflows.

Per image$0.04

Model IDgoogle/gemini-3.1-flash-image-preview

Seedream 4.5

ByteDance Seed

Best Value

ByteDance Seed's latest text-to-image model. Sharp detail, native Chinese prompt support, and competitive per-image pricing.

Per image$0.03

Model IDbytedance-seed/seedream-4.5

Intelligence Ranking

Ranked by the Artificial Analysis Intelligence Index, with detailed scores from major independent benchmarks

#	Model	AA Index	GDPval-AA	Arena Elo	GPQA	SWE-bench	AIME	BrowseComp	tau2-Bench
		Intelligence (0–100)	Real-world tasks	Human preference	Science reasoning	Coding & bugs	Math	Web research	Agent tool use
1	Claude Opus 4.8	61	—	—	—	—	—	—	—
2	GPT-5.5	60	1782	1485	92.8	—	—	82.7	—
3	Claude Opus 4.7	57	—	—	—	—	—	—	—
4	Gemini 3.1 Pro	57	1314	1500	94.3	80.6%	—	—	—
5	Claude Opus 4.6	—	1619	1504	91.3	80.8%	—	86.8	—
6	Gemini 3.5 Flash	55	—	—	—	—	—	—	—
7	Kimi K2.6	54	1486	—	—	—	—	—	—
8	MiMo V2.5 Pro	54	—	—	—	—	—	—	—
9	Claude Sonnet 4.6	52	1676	1446	89.9	79.6%	—	—	—
10	DeepSeek V4 Pro	52	1558	—	—	—	—	—	—
11	GLM 5.1	51	1535	—	—	—	—	—	—
12	MiniMax M2.7	50	1514	—	—	78.0%	—	—	—
13	Qwen 3.6 Plus	50	1298	—	—	78.8%	—	—	—
14	MiMo V2.5	49	—	—	—	—	—	—	—
15	GPT-5.4 Mini	49	1435	—	—	54.4%	—	—	—
16	MiMo V2 Pro	—	1418	—	—	—	—	—	—
17	DeepSeek V4 Flash	47	1414	—	—	—	—	—	—
18	Step 3.7 Flash	44	—	—	—	—	—	—	—
19	Hunyuan 3	42	—	—	—	—	—	—	—
20	Gemini 3 Flash	—	1119	1473	90.4	—	—	—	—
21	Gemma 4 31B	39	1117	1452	84.3	—	89.2	—	—
22	Step 3.5 Flash	38	1073	—	—	74.4%	97.3	—	88.2%
23	Gemma 4 26B A4B	—	1012	—	—	—	—	—	—
24	Nemotron 3 Super	—	1004	—	79.2	60.5%	90.2	—	—
25	Gemini 3.1 Flash Lite	—	927	1432	86.9	—	—	—	—
26	Free Models Router	—	—	—	—	—	—	—	—

Ready to deploy?

Pick any model and launch your AI chatbot in under 30 seconds.

Deploy Now