Zhipu AI (Z.ai)
GLM-4.6V Pricing & Cost Breakdown
Open-source multimodal vision-language model with native function calling, state-of-the-art visual understanding and reasoning at its scale, long-context multimodal processing, and support for interleaved image-text generation and agentic workflows
Pricing
Input Price
$0.300
per 1M tokens
Output Price
$0.900
per 1M tokens
Specifications
Context Length
128K tokens
Status
active
Supported Modalities
TextImageVideo
Features
StreamingFunction CallingStructured OutputNative Multimodal Tool UseLong Context Visual Reasoning
Cost Calculator
≈ 750,000 words
≈ 375,000 words
Input Cost$0.3000
Output Cost$0.4500
Total Cost$0.7500
