Featured models from the catalog
A sample of commonly selected models. Full availability depends on tier.
Filter by capability
Flagship models
Flagship models covering coding, GPT, Chinese, long-context, and RAG. The rest are grouped below.
OpenAI
GPT-5.5 Pro
Launch flagship- Latest GPT flagship
- Deep reasoning
In / Out
$31.20 / $187.20
per 1M tokens
Value
~40
chats / $10
Alibaba (Qwen)
Qwen3 Max
Launch flagship- Leading Chinese model
- Strong at SEA languages
In / Out
$0.81 / $4.06
per 1M tokens
Value
~1,759
chats / $10
Alibaba (Qwen)
Qwen3 Plus
- Balanced
- Long context
In / Out
$0.27 / $0.81
per 1M tokens
Value
~7,401
chats / $10
Alibaba (Qwen)
Qwen3 Turbo
- Cheapest tier
- Fast
In / Out
$0.05 / $0.20
per 1M tokens
Value
~33,333
chats / $10
Alibaba (Qwen)
Qwen3.6 Plus
- 1M context
- Visual understanding
In / Out
$0.29 / $1.72
per 1M tokens
Value
~4,364
chats / $10
Alibaba (Qwen)
Qwen2.5-VL Max
- Vision-language
In / Out
$1.00 / $4.00
per 1M tokens
Value
~1,666
chats / $10
Alibaba (Qwen)
Qwen2.5 Coder
- Code specialist
In / Out
$0.40 / $1.60
per 1M tokens
Value
~4,166
chats / $10
Alibaba (Qwen)
Qwen3 Coder
- Code specialist
In / Out
$0.22 / $1.78
per 1M tokens
Value
~4,512
chats / $10
Alibaba (Qwen)
Qwen3.6 Flash
- Fast & cheap
- Everyday tasks
In / Out
$0.21 / $0.83
per 1M tokens
Value
~8,012
chats / $10
Alibaba (Qwen)
Qwen3.6 35B-A3B
- MoE efficiency
- Strong reasoning
In / Out
$0.15 / $0.99
per 1M tokens
Value
~7,763
chats / $10
Alibaba (Qwen)
Qwen3.6 27B
- Balanced size
- Fast inference
In / Out
$0.32 / $3.17
per 1M tokens
Value
~2,628
chats / $10
Alibaba (Qwen)
Qwen3.6 Max Preview
- Top-tier reasoning
- Long context
In / Out
$1.03 / $6.18
per 1M tokens
Value
~1,213
chats / $10
Alibaba (Qwen)
Qwen3.5 Plus
- Solid all-rounder
- Agent capable
In / Out
$0.30 / $1.78
per 1M tokens
Value
~4,212
chats / $10
Alibaba (Qwen)
Qwen3.5 Flash
- Fast & affordable
- Lightweight
In / Out
$0.06 / $0.26
per 1M tokens
Value
~25,853
chats / $10
Alibaba (Qwen)
Qwen3.5 122B-A10B
- Large MoE
- Enterprise scale
In / Out
$0.26 / $2.06
per 1M tokens
Value
~3,881
chats / $10
Alibaba (Qwen)
Qwen3.5 35B-A3B
- MoE mid-size
- Good value
In / Out
$0.14 / $0.99
per 1M tokens
Value
~7,886
chats / $10
Alibaba (Qwen)
Qwen3.5 27B
- Dense mid-size
- Reliable
In / Out
$0.19 / $1.55
per 1M tokens
Value
~5,165
chats / $10
Alibaba (Qwen)
Qwen3.5 397B-A17B
- Massive MoE
- Top performance
In / Out
$0.39 / $2.32
per 1M tokens
Value
~3,234
chats / $10
Alibaba (Qwen)
Qwen3 235B-A22B
- Massive MoE
- Deep reasoning
In / Out
$0.45 / $1.80
per 1M tokens
Value
~3,700
chats / $10
Alibaba (Qwen)
Qwen3 30B-A3B
- MoE balanced
- Cost effective
In / Out
$0.09 / $0.45
per 1M tokens
Value
~16,020
chats / $10
Alibaba (Qwen)
Qwen3 14B
- Dense small
- Low latency
In / Out
$0.10 / $0.24
per 1M tokens
Value
~22,935
chats / $10
Alibaba (Qwen)
Qwen3 32B
- Dense capability
- Self-host friendly
In / Out
$0.08 / $0.28
per 1M tokens
Value
~22,967
chats / $10
Alibaba (Qwen)
Qwen3 8B
- Tiny & fast
- Edge deploy
In / Out
$0.05 / $0.40
per 1M tokens
Value
~20,202
chats / $10
Alibaba (Qwen)
Qwen3 4B
- Ultra-tiny
- Edge/on-device
In / Out
$0.02 / $0.08
per 1M tokens
Value
~80,128
chats / $10
Alibaba (Qwen)
Qwen3 1.7B
- Micro model
- Lowest cost
In / Out
$0.01 / $0.04
per 1M tokens
Value
~160,256
chats / $10
Alibaba (Qwen)
Qwen3 0.6B
- Nano model
- Bare-minimum
In / Out
$0.00 / $0.00
per 1M tokens
Value
—
chats / $10
Alibaba (Qwen)
Qwen3 VL Plus
- Vision-language
- Document parsing
In / Out
$1.25 / $4.99
per 1M tokens
Value
~1,335
chats / $10
Alibaba (Qwen)
Qwen3 VL Flash
- Fast vision
- Real-time
In / Out
$0.31 / $1.25
per 1M tokens
Value
~5,341
chats / $10
Alibaba (Qwen)
Qwen3 VL 235B-A22B
- Top vision MoE
- Deep understanding
In / Out
$0.20 / $0.87
per 1M tokens
Value
~7,886
chats / $10
Alibaba (Qwen)
Qwen3 VL 235B-A22B Thinking
- Vision + reasoning
- Step-by-step
In / Out
$0.26 / $2.58
per 1M tokens
Value
~3,229
chats / $10
Alibaba (Qwen)
Qwen VL Max
- Max vision quality
- Detailed analysis
In / Out
$1.04 / $4.16
per 1M tokens
Value
~1,602
chats / $10
Alibaba (Qwen)
Qwen VL Plus
- Vision plus
- Good value
In / Out
$0.52 / $2.08
per 1M tokens
Value
~3,205
chats / $10
Alibaba (Qwen)
Qwen VL OCR
- OCR specialist
- Text extraction
In / Out
$0.31 / $0.31
per 1M tokens
Value
~10,683
chats / $10
Alibaba (Qwen)
Qwen VL Max Latest
- Latest VL Max
- Auto-updated
In / Out
$1.04 / $4.16
per 1M tokens
Value
~1,602
chats / $10
Alibaba (Qwen)
Qwen VL Plus Latest
- Latest VL Plus
- Auto-updated
In / Out
$0.52 / $2.08
per 1M tokens
Value
~3,205
chats / $10
Alibaba (Qwen)
Qwen3 Coder Plus
- Code expert
- Multi-file edits
In / Out
$0.64 / $3.22
per 1M tokens
Value
~2,218
chats / $10
Alibaba (Qwen)
Qwen3 Coder Flash
- Fast coding
- Everyday dev
In / Out
$0.19 / $0.97
per 1M tokens
Value
~7,396
chats / $10
Alibaba (Qwen)
Qwen3 Coder 480B-A35B
- Massive coder MoE
- Deep code reasoning
In / Out
$1.04 / $4.16
per 1M tokens
Value
~1,602
chats / $10
Alibaba (Qwen)
Qwen3 Coder Next
- Next-gen coder
- Latest training
In / Out
$0.11 / $0.79
per 1M tokens
Value
~9,900
chats / $10
Alibaba (Qwen)
Qwen Coder Plus
- Solid coder
- Proven
In / Out
$0.31 / $1.25
per 1M tokens
Value
~5,341
chats / $10
Alibaba (Qwen)
Qwen Image 2.0
- Fast image gen
- Good quality
In / Out
$0.04 / $0.04
per 1M tokens
Value
~92,592
chats / $10
Alibaba (Qwen)
Qwen Image 2.0 Pro
- Pro image quality
- Photorealistic
In / Out
$0.08 / $0.08
per 1M tokens
Value
~42,735
chats / $10
Alibaba (Qwen)
WAN 2.7 Image
- Cinematic quality
- WAN series
In / Out
$0.04 / $0.04
per 1M tokens
Value
~79,365
chats / $10
Alibaba (Qwen)
WAN 2.7 Image Pro
- Pro cinematic
- Max quality
In / Out
$0.08 / $0.08
per 1M tokens
Value
~40,160
chats / $10
Alibaba (Qwen)
Qwen Image Max
- Max resolution
- Highest detail
In / Out
$0.06 / $0.06
per 1M tokens
Value
~53,763
chats / $10
Alibaba (Qwen)
Qwen Image Plus
- Balanced image
- Good value
In / Out
$0.03 / $0.03
per 1M tokens
Value
~107,526
chats / $10
Alibaba (Qwen)
Qwen Image Edit Plus
- Image editing
- Inpainting
In / Out
$0.04 / $0.04
per 1M tokens
Value
~79,365
chats / $10
Alibaba (Qwen)
Qwen Image Edit Max
- Max edit quality
- Professional
In / Out
$0.08 / $0.08
per 1M tokens
Value
~40,160
chats / $10
Alibaba (Qwen)
Qwen Image Edit
- Basic editing
- Affordable
In / Out
$0.02 / $0.02
per 1M tokens
Value
~158,730
chats / $10
Alibaba (Qwen)
Z Image Turbo
- Ultra-fast image
- Real-time gen
In / Out
$0.02 / $0.02
per 1M tokens
Value
~158,730
chats / $10
Alibaba (Qwen)
Qwen3 TTS Flash
- Fast TTS
- Natural voice
In / Out
$10.00 / $10.00
per 1M tokens
Value
~333
chats / $10
Alibaba (Qwen)
Qwen3 TTS Instruct Flash
- Controllable TTS
- Emotion & tone
In / Out
$15.00 / $15.00
per 1M tokens
Value
~222
chats / $10
Alibaba (Qwen)
Qwen3 TTS Flash Realtime
- Real-time TTS
- Streaming voice
In / Out
$12.00 / $12.00
per 1M tokens
Value
~277
chats / $10
Alibaba (Qwen)
Qwen3 TTS Instruct Realtime
- Real-time controlled TTS
- Live emotion
In / Out
$18.00 / $18.00
per 1M tokens
Value
~185
chats / $10
Alibaba (Qwen)
Qwen3 Speech-to-Speech
- Speech-to-speech
- Real-time
In / Out
$20.00 / $20.00
per 1M tokens
Value
~166
chats / $10
Alibaba (Qwen)
Qwen3 ASR Flash Realtime
- Real-time ASR
- Multilingual
In / Out
$5.00 / $5.00
per 1M tokens
Value
~666
chats / $10
Alibaba (Qwen)
Qwen3.5 Omni Plus
- All modalities
- Omni model
In / Out
$1.56 / $6.24
per 1M tokens
Value
~1,068
chats / $10
Alibaba (Qwen)
Qwen3.5 Omni Flash
- Fast omni
- Multimodal value
In / Out
$0.52 / $2.08
per 1M tokens
Value
~3,205
chats / $10
Alibaba (Qwen)
Qwen3 Omni Flash
- Omni flash
- All-in-one
In / Out
$0.31 / $1.25
per 1M tokens
Value
~5,341
chats / $10
Alibaba (Qwen)
Qwen3.5 Omni Plus Realtime
- Real-time omni
- Live multimodal
In / Out
$2.08 / $8.32
per 1M tokens
Value
~801
chats / $10
Alibaba (Qwen)
Qwen3.5 Omni Flash Realtime
- Fast real-time omni
- Streaming multimodal
In / Out
$0.83 / $3.33
per 1M tokens
Value
~2,003
chats / $10
Alibaba (Qwen)
Qwen MT Plus
- Translation pro
- High quality
In / Out
$0.31 / $0.31
per 1M tokens
Value
~10,683
chats / $10
Alibaba (Qwen)
Qwen MT Turbo
- Fast translation
- Good value
In / Out
$0.10 / $0.10
per 1M tokens
Value
~32,051
chats / $10
Alibaba (Qwen)
Qwen MT Flash
- Ultra-fast MT
- Cheapest translation
In / Out
$0.05 / $0.05
per 1M tokens
Value
~64,102
chats / $10
Alibaba (Qwen)
Qwen MT Lite
- Lite translation
- Budget MT
In / Out
$0.02 / $0.02
per 1M tokens
Value
~160,256
chats / $10
Alibaba (Qwen)
Qwen3 LiveTranslate Flash
- Real-time translation
- Live bilingual
In / Out
$0.16 / $0.16
per 1M tokens
Value
~21,367
chats / $10
Alibaba (Qwen)
Text Embedding V3
- Dense embeddings
- Search & RAG
In / Out
$0.02 / $0.00
per 1M tokens
Value
~240,384
chats / $10
Alibaba (Qwen)
Text Embedding V4
- Latest embeddings
- Best accuracy
In / Out
$0.03 / $0.00
per 1M tokens
Value
~160,256
chats / $10
Alibaba (Qwen)
QwQ Plus
- Deep reasoning
- Chain-of-thought
In / Out
$0.52 / $2.08
per 1M tokens
Value
~3,205
chats / $10
Alibaba (Qwen)
QvQ Max
- Visual reasoning
- Math + vision
In / Out
$1.04 / $4.16
per 1M tokens
Value
~1,602
chats / $10
Alibaba (Qwen)
Qwen3 Next 80B-A3B Thinking
- Next-gen thinking
- Deep analysis
In / Out
$0.10 / $0.77
per 1M tokens
Value
~10,349
chats / $10
Alibaba (Qwen)
Qwen3 Next 80B-A3B Instruct
- Next-gen instruct
- Versatile
In / Out
$0.09 / $1.09
per 1M tokens
Value
~7,885
chats / $10
Alibaba (Qwen)
Qwen Plus
- Legacy plus
- Stable
In / Out
$0.27 / $0.81
per 1M tokens
Value
~7,396
chats / $10
Alibaba (Qwen)
Qwen Max
- Legacy flagship
- Powerful
In / Out
$0.81 / $4.06
per 1M tokens
Value
~1,761
chats / $10
Alibaba (Qwen)
Qwen Turbo
- Legacy turbo
- Cheapest text
In / Out
$0.05 / $0.21
per 1M tokens
Value
~32,051
chats / $10
Alibaba (Qwen)
Qwen Flash
- Fast & simple
- Quick responses
In / Out
$0.10 / $0.42
per 1M tokens
Value
~16,025
chats / $10
Alibaba (Qwen)
Qwen Omni Turbo
- Budget omni
- All-modal value
In / Out
$0.21 / $0.83
per 1M tokens
Value
~8,012
chats / $10
Alibaba (Qwen)
Qwen Plus Character
- Character roleplay
- Persona engine
In / Out
$0.27 / $0.81
per 1M tokens
Value
~7,396
chats / $10
Alibaba (Qwen)
Qwen Flash Character
- Fast character
- Quick roleplay
In / Out
$0.10 / $0.42
per 1M tokens
Value
~16,025
chats / $10
Alibaba (Qwen)
Qwen3 Omni Captioner
- Image captioning
- Alt-text specialist
In / Out
$0.21 / $0.21
per 1M tokens
Value
~16,025
chats / $10
Alibaba (Qwen)
Qwen2 7B Instruct
- Open Qwen2
- Classic base
In / Out
$0.00 / $0.00
per 1M tokens
Value
—
chats / $10
Alibaba (Qwen)
CCAI Pro
- Customer service AI
- Contact center
In / Out
$0.21 / $0.83
per 1M tokens
Value
~8,012
chats / $10
Alibaba (Qwen)
Tongyi Tingwu SLP
- Meeting transcription
- Spoken language
In / Out
$5.00 / $5.00
per 1M tokens
Value
~666
chats / $10
Alibaba (Qwen)
Qwen3 LiveTranslate Realtime
- Live translation
- Real-time bilingual
In / Out
$0.21 / $0.21
per 1M tokens
Value
~16,025
chats / $10
Alibaba (Qwen)
Qwen Max Latest
- Latest Max alias
- Auto-updated
In / Out
$0.81 / $4.06
per 1M tokens
Value
~1,761
chats / $10
Alibaba (Qwen)
Qwen Plus Latest
- Latest Plus alias
- Auto-updated
In / Out
$0.27 / $0.81
per 1M tokens
Value
~7,396
chats / $10
Alibaba (Qwen)
Qwen Turbo Latest
- Latest Turbo alias
- Auto-updated
In / Out
$0.05 / $0.21
per 1M tokens
Value
~32,051
chats / $10
Chat & customer support
20 modelsGoogle DeepMind (Gemini)
Gemini 3 Flash
- Fast multimodal
- 1M context
In / Out
$0.52 / $3.12
per 1M tokens
Value
~2,403
chats / $10
DeepSeek
DeepSeek V4 Flash
- 1M context
- Competitive pricing
In / Out
$0.15 / $0.29
per 1M tokens
Value
~17,152
chats / $10
Google DeepMind (Gemini)
Gemini 2.0 Flash
- Fast & capable
In / Out
$0.10 / $0.40
per 1M tokens
Value
~16,666
chats / $10
Google DeepMind (Gemini)
Gemini 2.5 Flash
- Thinking model
In / Out
$0.30 / $1.50
per 1M tokens
Value
~4,761
chats / $10
Google DeepMind (Gemini)
Gemini 3.1 Flash-Lite
- Lowest-cost Gemini
- 1M context
In / Out
$0.26 / $1.56
per 1M tokens
Value
~4,807
chats / $10
OpenAI
GPT-4.1 Mini
- Balanced GPT-4.1
In / Out
$0.42 / $1.66
per 1M tokens
Value
~4,006
chats / $10
Alibaba (Qwen)
Qwen3.6 Plus
- 1M context
- Visual understanding
In / Out
$0.29 / $1.72
per 1M tokens
Value
~4,364
chats / $10
DeepSeek
DeepSeek V3.2
- MoE architecture
- Fast inference
In / Out
$0.22 / $0.33
per 1M tokens
Value
~12,836
chats / $10
Alibaba (Qwen)
Qwen3 VL 235B-A22B
- Top vision MoE
- Deep understanding
In / Out
$0.20 / $0.87
per 1M tokens
Value
~7,886
chats / $10
Zhipu (GLM)
GLM-4V Plus
- Vision-language
In / Out
$0.50 / $1.00
per 1M tokens
Value
~5,000
chats / $10
DeepSeek
DeepSeek V3
- MoE flagship
In / Out
$0.28 / $1.14
per 1M tokens
Value
~5,868
chats / $10
ByteDance (Doubao)
Doubao 1.5 Vision Pro
- Vision + CN tasks
In / Out
$0.40 / $1.20
per 1M tokens
Value
~5,000
chats / $10
OpenAI
GPT-4.1 Nano
- Cheapest GPT-4.1
In / Out
$0.10 / $0.42
per 1M tokens
Value
~16,025
chats / $10
OpenAI
GPT-4o
- Multimodal
- Strong tool use
In / Out
$2.60 / $10.40
per 1M tokens
Value
~641
chats / $10
ByteDance (Doubao)
Doubao 1.5 Pro 32k
- Strong on Chinese tasks
- Competitive pricing
In / Out
$0.12 / $0.29
per 1M tokens
Value
~19,267
chats / $10
xAI (Grok)
Grok 3 Mini
- Fast Grok
In / Out
$0.31 / $1.25
per 1M tokens
Value
~5,341
chats / $10
OpenAI
GPT-4.1
- Responses API
In / Out
$2.08 / $8.32
per 1M tokens
Value
~801
chats / $10
xAI (Grok)
Grok 3
- Real-time knowledge
In / Out
$2.08 / $8.32
per 1M tokens
Value
~801
chats / $10
Alibaba (Qwen)
Qwen Turbo
- Legacy turbo
- Cheapest text
In / Out
$0.05 / $0.21
per 1M tokens
Value
~32,051
chats / $10
Alibaba (Qwen)
Qwen Turbo Latest
- Latest Turbo alias
- Auto-updated
In / Out
$0.05 / $0.21
per 1M tokens
Value
~32,051
chats / $10
Code assistant
13 modelsDeepSeek
DeepSeek V4 Flash
- 1M context
- Competitive pricing
In / Out
$0.15 / $0.29
per 1M tokens
Value
~17,152
chats / $10
DeepSeek
DeepSeek V4 Pro
- 1M context
- Frontier reasoning
In / Out
$0.45 / $0.91
per 1M tokens
Value
~5,527
chats / $10
Google DeepMind (Gemini)
Gemini 2.5 Flash
- Thinking model
In / Out
$0.30 / $1.50
per 1M tokens
Value
~4,761
chats / $10
Alibaba (Qwen)
Qwen3.6 Plus
- 1M context
- Visual understanding
In / Out
$0.29 / $1.72
per 1M tokens
Value
~4,364
chats / $10
DeepSeek
DeepSeek Coder V2
- Code specialist
In / Out
$0.14 / $0.28
per 1M tokens
Value
~17,857
chats / $10
DeepSeek
DeepSeek V3.2
- MoE architecture
- Fast inference
In / Out
$0.22 / $0.33
per 1M tokens
Value
~12,836
chats / $10
DeepSeek
DeepSeek V3
- MoE flagship
In / Out
$0.28 / $1.14
per 1M tokens
Value
~5,868
chats / $10
OpenAI
GPT-4o
- Multimodal
- Strong tool use
In / Out
$2.60 / $10.40
per 1M tokens
Value
~641
chats / $10
ByteDance (Doubao)
Doubao 1.5 Pro 32k
- Strong on Chinese tasks
- Competitive pricing
In / Out
$0.12 / $0.29
per 1M tokens
Value
~19,267
chats / $10
xAI (Grok)
Grok 3 Mini
- Fast Grok
In / Out
$0.31 / $1.25
per 1M tokens
Value
~5,341
chats / $10
OpenAI
GPT-4.1
- Responses API
In / Out
$2.08 / $8.32
per 1M tokens
Value
~801
chats / $10
xAI (Grok)
Grok 3
- Real-time knowledge
In / Out
$2.08 / $8.32
per 1M tokens
Value
~801
chats / $10
OpenAI
o4-mini
- Reasoning model
In / Out
$1.14 / $4.58
per 1M tokens
Value
~1,456
chats / $10
Long documents
2 modelsDeepSeek
DeepSeek V4 Pro
- 1M context
- Frontier reasoning
In / Out
$0.45 / $0.91
per 1M tokens
Value
~5,527
chats / $10
Alibaba (Qwen)
Qwen3.6 Plus
- 1M context
- Visual understanding
In / Out
$0.29 / $1.72
per 1M tokens
Value
~4,364
chats / $10
Vision & OCR
10 modelsGoogle DeepMind (Gemini)
Gemini 3 Flash
- Fast multimodal
- 1M context
In / Out
$0.52 / $3.12
per 1M tokens
Value
~2,403
chats / $10
Google DeepMind (Gemini)
Gemini 2.0 Flash
- Fast & capable
In / Out
$0.10 / $0.40
per 1M tokens
Value
~16,666
chats / $10
Google DeepMind (Gemini)
Gemini 2.5 Flash
- Thinking model
In / Out
$0.30 / $1.50
per 1M tokens
Value
~4,761
chats / $10
Google DeepMind (Gemini)
Gemini 3.1 Flash-Lite
- Lowest-cost Gemini
- 1M context
In / Out
$0.26 / $1.56
per 1M tokens
Value
~4,807
chats / $10
Alibaba (Qwen)
Qwen3.6 Plus
- 1M context
- Visual understanding
In / Out
$0.29 / $1.72
per 1M tokens
Value
~4,364
chats / $10
Alibaba (Qwen)
Qwen3 VL 235B-A22B
- Top vision MoE
- Deep understanding
In / Out
$0.20 / $0.87
per 1M tokens
Value
~7,886
chats / $10
Zhipu (GLM)
GLM-4V Plus
- Vision-language
In / Out
$0.50 / $1.00
per 1M tokens
Value
~5,000
chats / $10
ByteDance (Doubao)
Doubao 1.5 Vision Pro
- Vision + CN tasks
In / Out
$0.40 / $1.20
per 1M tokens
Value
~5,000
chats / $10
OpenAI
GPT-4o
- Multimodal
- Strong tool use
In / Out
$2.60 / $10.40
per 1M tokens
Value
~641
chats / $10
Alibaba (Qwen)
Qwen3 Omni Captioner
- Image captioning
- Alt-text specialist
In / Out
$0.21 / $0.21
per 1M tokens
Value
~16,025
chats / $10
Automation & agents
13 modelsGoogle DeepMind (Gemini)
Gemini 3 Flash
- Fast multimodal
- 1M context
In / Out
$0.52 / $3.12
per 1M tokens
Value
~2,403
chats / $10
DeepSeek
DeepSeek V4 Flash
- 1M context
- Competitive pricing
In / Out
$0.15 / $0.29
per 1M tokens
Value
~17,152
chats / $10
Google DeepMind (Gemini)
Gemini 2.0 Flash
- Fast & capable
In / Out
$0.10 / $0.40
per 1M tokens
Value
~16,666
chats / $10
DeepSeek
DeepSeek V4 Pro
- 1M context
- Frontier reasoning
In / Out
$0.45 / $0.91
per 1M tokens
Value
~5,527
chats / $10
OpenAI
GPT-4.1 Mini
- Balanced GPT-4.1
In / Out
$0.42 / $1.66
per 1M tokens
Value
~4,006
chats / $10
Alibaba (Qwen)
Qwen3.6 Plus
- 1M context
- Visual understanding
In / Out
$0.29 / $1.72
per 1M tokens
Value
~4,364
chats / $10
DeepSeek
DeepSeek V3.2
- MoE architecture
- Fast inference
In / Out
$0.22 / $0.33
per 1M tokens
Value
~12,836
chats / $10
Alibaba (Qwen)
Qwen3 VL 235B-A22B
- Top vision MoE
- Deep understanding
In / Out
$0.20 / $0.87
per 1M tokens
Value
~7,886
chats / $10
OpenAI
GPT-4o
- Multimodal
- Strong tool use
In / Out
$2.60 / $10.40
per 1M tokens
Value
~641
chats / $10
ByteDance (Doubao)
Doubao 1.5 Pro 32k
- Strong on Chinese tasks
- Competitive pricing
In / Out
$0.12 / $0.29
per 1M tokens
Value
~19,267
chats / $10
OpenAI
GPT-4.1
- Responses API
In / Out
$2.08 / $8.32
per 1M tokens
Value
~801
chats / $10
xAI (Grok)
Grok 3
- Real-time knowledge
In / Out
$2.08 / $8.32
per 1M tokens
Value
~801
chats / $10
OpenAI
o4-mini
- Reasoning model
In / Out
$1.14 / $4.58
per 1M tokens
Value
~1,456
chats / $10
Video understanding
2 modelsZhipu (GLM)
GLM-4V Plus
- Vision-language
In / Out
$0.50 / $1.00
per 1M tokens
Value
~5,000
chats / $10
ByteDance (Doubao)
Doubao 1.5 Vision Pro
- Vision + CN tasks
In / Out
$0.40 / $1.20
per 1M tokens
Value
~5,000
chats / $10