Zhipu AI (Z.ai)

GLM-4.6V Pricing & Cost Breakdown

Open-source multimodal vision-language model with native function calling, state-of-the-art visual understanding and reasoning at its scale, long-context multimodal processing, and support for interleaved image-text generation and agentic workflows

Pricing

Input Price
$0.300
per 1M tokens
Output Price
$0.900
per 1M tokens

Specifications

Context Length
128K tokens
Status
active
Supported Modalities
TextImageVideo
Features
StreamingFunction CallingStructured OutputNative Multimodal Tool UseLong Context Visual Reasoning

Cost Calculator

750,000 words
375,000 words
Input Cost$0.3000
Output Cost$0.4500
Total Cost$0.7500