AI Model Pricing

Every model available on OpenClaw Launch. Switch models anytime, even after deployment.

GPT-5.5

OpenAI

OpenAI's latest frontier model. Excels at complex reasoning, coding, and multimodal analysis with 1M+ context.

Input$5/M
Output$30/M

GPT-5.4 Mini

OpenAI

Fast and efficient version of GPT-5.4 with vision support. Great for high-throughput workloads and computer use tasks.

Input$0.75/M
Output$4.50/M

Claude Opus 4.6

Anthropic
Most Capable

Anthropic's most capable model. Best for nuanced analysis, creative writing, and complex multi-step tasks.

Input$5/M
Output$25/M

Claude Sonnet 4.6

Anthropic
Recommended

The best balance of intelligence, speed, and cost. Ideal default for most use cases.

Input$3/M
Output$15/M

Gemini 3.1 Pro

Google
New

Google's frontier reasoning model with enhanced software engineering performance and improved agentic reliability.

Input$2/M
Output$12/M

Gemini 3 Flash

Google

Google's fast and affordable multimodal model. Great for vision tasks, quick reasoning, and high-throughput workloads.

Input$0.50/M
Output$3/M

Gemini 3.1 Flash Lite

Google
Cheapest

Ultra-efficient and fast. Great for high-volume tasks, quick responses, and the most cost-efficient workflows.

Input$0.25/M
Output$1.50/M

Grok 4.20

Input$2/M
Output$6/M

Grok 4.1 Fast

Input$0.20/M
Output$0.50/M

DeepSeek V4 Flash

DeepSeek
New

Ultra-low-cost flash model with 1M context. A strong choice for budget-conscious chat and agent workflows.

Input$0.14/M
Output$0.28/M

DeepSeek V4 Pro

DeepSeek
New

DeepSeek's frontier model. 1M context, strong coding and reasoning at a fraction of the cost of other premium models.

Input$1.74/M
Output$3.48/M

Kimi K2.6

Moonshot AI

Next-gen multimodal model for long-horizon coding, UI/UX generation, and multi-agent orchestration.

Input$0.60/M
Output$2.80/M

GLM 5.1

Z.ai

Coding-focused model with 202K context. Strong at long-horizon agentic tasks and tool use.

Input$0.95/M
Output$3.15/M

MiniMax M2.7

MiniMax
Best Value

Self-evolving model with strong coding and reasoning. Affordable pricing with 200K context window.

Input$0.30/M
Output$1.20/M

Step 3.5 Flash

StepFun

Sparse MoE with 196B total params, 11B active per token. StepFun's most capable open-source model at an ultra-low price.

Input$0.10/M
Output$0.30/M

MiMo V2 Pro

Xiaomi

Xiaomi's flagship 1T-parameter model optimized for agentic workflows. Strong at coding, reasoning, and complex orchestration tasks.

Input$1.00/M
Output$3.00/M

Qwen 3.6 Plus

Alibaba Cloud
New

Hybrid linear-attention + sparse MoE flagship. Strong at agentic coding and front-end tasks (78.8 on SWE-bench Verified) with 1M context and native reasoning mode.

Input$0.33/M
Output$1.95/M

Gemma 4 26B A4B

Google

Open-weight multimodal Gemma model with vision and video input. MoE architecture for efficient inference.

Input$0.12/M
Output$0.40/M

Gemma 4 31B

Google

Larger Gemma variant with vision and video support. Great balance of quality and cost for multimodal tasks.

Input$0.14/M
Output$0.40/M

Free Models Router

OpenRouter
Free

Auto-routes each request to a working free model on OpenRouter. When one free model goes dark, the router picks another — the most reliable way to stay on free tier.

InputFree
OutputFree

Nemotron 3 Super

NVIDIA
Free

Hybrid Mamba-Transformer MoE model — 120B params, 12B active. 1M context window with zero cost. Great for complex agent tasks.

InputFree
OutputFree

Search Models

Perplexity-powered web search with real-time citations. Used automatically when your bot needs current information.

Sonar

Perplexity
Default

Fast web search with citations. Good for quick factual lookups.

Input$1/M
Output$1/M

Image Generation Models

Generate and edit images directly from chat. Bring your own provider key (OpenAI, Google, or ByteDance) to unlock.

GPT-5.4 Image 2

OpenAI
New

GPT-5.4 reasoning paired with GPT Image 2 generation. Best-in-class prompt adherence and text rendering.

Per image$0.04
Model IDopenai/gpt-5.4-image-2

Gemini 3.1 Flash Image

Google
Fastest

Fast Nano-Banana-class image generation and editing. Strong at photoreal output and low-latency workflows.

Per image$0.04
Model IDgoogle/gemini-3.1-flash-image-preview

Seedream 4.5

ByteDance Seed
Best Value

ByteDance Seed's latest text-to-image model. Sharp detail, native Chinese prompt support, and competitive per-image pricing.

Per image$0.03
Model IDbytedance-seed/seedream-4.5

Intelligence Ranking

All models ranked by capability across major independent benchmarks

#ModelGDPval-AAArena EloGPQASWE-benchAIMEBrowseComptau2-Bench
Real-world tasksHuman preferenceScience reasoningCoding & bugsMathWeb researchAgent tool use
1GPT-5.51782148592.882.7
2Claude Opus 4.61619150491.380.8%86.8
3Claude Sonnet 4.61676144689.979.6%
4DeepSeek V4 Pro1558
5GLM 5.11535
6MiniMax M2.7151478.0%
7Kimi K2.61486
8GPT-5.4 Mini143554.4%
9MiMo V2 Pro1418
10DeepSeek V4 Flash1414
11Gemini 3.1 Pro1314150094.380.6%
12Qwen 3.6 Plus129878.8%
13Gemini 3 Flash1119147390.4
14Gemma 4 31B1117145284.389.2
15Step 3.5 Flash107374.4%97.388.2%
16Gemma 4 26B A4B1012
17Nemotron 3 Super100479.260.5%90.2
18Gemini 3.1 Flash Lite927143286.9
19Free Models Router

Ready to deploy?

Pick any model and launch your AI chatbot in under 10 seconds.

Deploy Now