AI Model Pricing

Every model available on OpenClaw Launch. Switch models anytime, even after deployment.

GPT-5.4

OpenAI

OpenAI's latest frontier model. Excels at complex reasoning, coding, and multimodal analysis with 1M+ context.

Input$2.50/M
Output$15/M

GPT-5.4 Mini

OpenAI

Fast and efficient version of GPT-5.4 with vision support. Great for high-throughput workloads and computer use tasks.

Input$0.75/M
Output$4.50/M

Claude Opus 4.6

Anthropic
Most Capable

Anthropic's most capable model. Best for nuanced analysis, creative writing, and complex multi-step tasks.

Input$5/M
Output$25/M

Claude Sonnet 4.6

Anthropic
Recommended

The best balance of intelligence, speed, and cost. Ideal default for most use cases.

Input$3/M
Output$15/M

Gemini 3.1 Pro

Google
New

Google's frontier reasoning model with enhanced software engineering performance and improved agentic reliability.

Input$2/M
Output$12/M

Gemini 3 Flash

Google

Google's fast and affordable multimodal model. Great for vision tasks, quick reasoning, and high-throughput workloads.

Input$0.50/M
Output$3/M

Gemini 3.1 Flash Lite

Google
Cheapest

Ultra-efficient and fast. Great for high-volume tasks, quick responses, and the most cost-efficient workflows.

Input$0.25/M
Output$1.50/M

DeepSeek V3.2

DeepSeek

Impressive reasoning at a fraction of the cost. A strong choice for budget-conscious users.

Input$0.25/M
Output$0.40/M

Kimi K2.5

Moonshot AI

Strong multilingual support and long-context capabilities at a competitive price.

Input$0.45/M
Output$2.20/M

MiniMax M2.7

MiniMax
Best Value

Self-evolving model with strong coding and reasoning. Affordable pricing with 200K context window.

Input$0.30/M
Output$1.20/M

Step 3.5 Flash

Input$0.10/M
Output$0.30/M

MiMo V2 Pro

Xiaomi

Xiaomi's flagship 1T-parameter model optimized for agentic workflows. Strong at coding, reasoning, and complex orchestration tasks.

Input$1.00/M
Output$3.00/M

Qwen 3.5 35B

Alibaba Cloud

Native vision-language model with hybrid linear attention and sparse MoE architecture. Efficient multimodal reasoning at low cost.

Input$0.16/M
Output$1.30/M

Gemma 4 26B A4B

Input$0.12/M
Output$0.40/M

Gemma 4 31B

Input$0.14/M
Output$0.40/M

Trinity Large Preview

Arcee AI
Free

Free-tier model — no API costs at all. Perfect for getting started or low-stakes experiments.

InputFree
OutputFree

Nemotron 3 Super

NVIDIA
Free

Hybrid Mamba-Transformer MoE model — 120B params, 12B active. 1M context window with zero cost. Great for complex agent tasks.

InputFree
OutputFree

Search Models

Perplexity-powered web search with real-time citations. Used automatically when your bot needs current information.

Sonar

Perplexity
Default

Fast web search with citations. Good for quick factual lookups.

Input$1/M
Output$1/M

Intelligence Ranking

All models ranked by capability across major independent benchmarks

#ModelArena EloGPQASWE-benchAIMEBrowseComptau2-Bench
Human preferenceScience reasoningCoding & bugsMathWeb researchAgent tool use
1Claude Opus 4.6Most Capable150491.380.8%86.8
2Gemini 3.1 ProNew150094.380.6%
3GPT-5.4148592.882.7
4Gemini 3 Flash147390.4
5Claude Sonnet 4.6Recommended144689.979.6%
6Gemma 4 31BBest Free145284.389.2
7Kimi K2.5145187.676.8%~93%
8GPT-5.4 Mini54.4%
9MiniMax M2.7Best Value78.0%
10Step 3.5 FlashFree74.4%97.388.2%
11Gemini 3.1 Flash Lite143286.9
12DeepSeek V3.2142282.473.0%89.3
13Qwen 3.5 35B84.269.2%78.6**
14MiMo V2 Pro
15Nemotron 3 SuperFree79.260.5%90.2
16Trinity Large PreviewFree63.324.0

Ready to deploy?

Pick any model and launch your AI chatbot in under 10 seconds.

Deploy Now