AI Models

Browse and compare pricing for 19+ AI models from leading providers

Showing 19 of 19 models
Luma AI

Luma uni-1 pricing

Luma’s first unified understanding and generation model, a decoder-only autoregressive transformer that interleaves text and images for multimodal reasoning, visual editing, and image generation

Context:N/A
Alibaba Cloud (Qwen Team)

Qwen Image 2.0

Next-generation foundational image generation model unifying text-to-image generation and image editing with professional typography rendering, native 2K resolution support, and 1k-token prompt instructions.

Context:1K
OpenAI

GPT-5.4

Most capable and efficient frontier model for professional work, with native computer-use capabilities, tool search, and extreme reasoning

Input
$2.50/1M
Output
$15.00/1M
Context:1.1M
DeepSeek

DeepSeek-V3.2

An efficient Mixture-of-Experts language model with 671B total parameters, featuring DeepSeek Sparse Attention (DSA) for enhanced reasoning, agentic performance, and long-context efficiency, comparable to frontier models like GPT-5.

Input
$0.280/1M
Output
$0.420/1M
Context:128K
Google DeepMind

Nano Banana 2

State-of-the-art image generation and editing model (Gemini 3.1 Flash Image) combining Pro-level quality, advanced world knowledge, real-time web search grounding, subject consistency, and precise text rendering with Flash-level speed.

Image
$0.250/image
Video
$0.0000/sec
Context:N/A
xAI

Grok 4.20

Grok 4.20 Beta: xAI's frontier multimodal model with native 4-agent multi-agent collaboration system (Grok, Harper, Benjamin, Lucas) for real-time debate, fact-checking, and reduced hallucinations

Context:256K
Moonshot AI

Kimi 2.5

A 1-trillion parameter Mixture-of-Experts (MoE) multimodal model featuring native vision capabilities, complex reasoning, and Agent Swarm support for parallel sub-agent coordination.

Input
$0.600/1M
Output
$3.00/1M
Context:262K
Google

Gemini 3.1 Pro

Gemini 3.1 Pro is the next iteration in the Gemini 3 series of models, a suite of highly capable, natively multimodal reasoning models.

Input
$2.00/1M
Output
$12.00/1M
Context:1.0M
Zhipu AI (Z.ai)

GLM-4.6V-Flash

Lightweight 9B-parameter open-source multimodal vision-language model optimized for local deployment, low-latency inference, and edge/consumer hardware; part of the GLM-4.6V series with native multimodal function calling, strong visual understanding, and long-context capabilities

Input
$0.0000/1M
Output
$0.0000/1M
Context:128K
Zhipu AI (Z.ai)

GLM-4.6V-FlashX

Paid, enhanced version of GLM-4.6V-Flash multimodal model with higher capacity and stability; supports native multimodal tool calling, vision-language tasks, and long-context processing

Input
$0.0000/1M
Output
$0.400/1M
Context:128K
Zhipu AI (Z.ai)

GLM-4.6V

Open-source multimodal vision-language model with native function calling, state-of-the-art visual understanding and reasoning at its scale, long-context multimodal processing, and support for interleaved image-text generation and agentic workflows

Input
$0.300/1M
Output
$0.900/1M
Context:128K
Zhipu AI (Z.ai)

GLM-5-Code

Specialized coding variant of the GLM-5 flagship model, optimized for advanced programming, complex code generation, agentic development workflows, and superior performance in software engineering tasks

Input
$1.20/1M
Output
$5.00/1M
Context:200K
Zhipu AI (Z.ai)

GLM-5

Flagship Mixture-of-Experts foundation model designed for Agentic Engineering, complex systems reasoning, long-horizon agent tasks, advanced coding, and low hallucination rates

Input
$1.00/1M
Output
$3.20/1M
Context:200K
Mistral AI

Mistral OCR 3

A proprietary Optical Character Recognition (OCR) model specialized in complex tables, forms, handwritten content, and multi-page PDFs, outputting high-fidelity Markdown or HTML structures.

Context:N/A
Mistral AI

Mistral Medium 3

Frontier-class multimodal model optimized for enterprise-grade performance, high-speed reasoning, and long-context document understanding.

Input
$0.400/1M
Output
$2.00/1M
Context:128K
Mistral AI

Mistral Large 3

Open-weight, general-purpose, flagship multimodal and multilingual model.

Input
$0.500/1M
Output
$1.50/1M
Context:256K
Anthropic

Claude Opus 4.6

Released in Feb 2026, Opus 4.6 is Anthropic's frontier model featuring 'Adaptive Thinking' and 'Agent Teams' capabilities. It offers a 1M token context window (in beta) and is optimized for complex reasoning, deep research, and autonomous coding tasks.

Input
$5.00/1M
Output
$25.00/1M
Context:200K
Anthropic

Claude Sonnet 4.5

Anthropic's intelligent flagship model optimized for complex agents, software engineering, and computer use. It features 'extended thinking' for deep reasoning and achieves state-of-the-art performance on coding benchmarks like SWE-bench Verified.

Input
$3.00/1M
Output
$15.00/1M
Context:200K
openai

GPT-5.2

GPT-5.2 is a more reliable and capable AI model with stronger reasoning, longer context understanding, and improved real-world usability.

Input
$1.75/1M
Output
$14.00/1M
Context:400K