Google's Gemini 3.5 Flash and Zhipu AI's GLM-5.1 compared on specs, pricing, and capabilities — as configured in AI Crucible ensembles.
| Specification | Gemini 3.5 Flash | GLM-5.1 |
|---|---|---|
| Provider | Zhipu AI | |
| API model ID | gemini-3.5-flash | zai-org/GLM-5.1 |
| Description | Google's latest Flash model, frontier reasoning and multimodal quality at high speed with 1M context. | Z.AI's most advanced GLM model, optimized for long-horizon agentic tasks with interleaved reasoning. |
| Context window | 1M tokens | 203K tokens |
| Max output tokens | 65,536 | 65,536 |
| Input cost ($/1M tokens) | $1.80 | $1.68 |
| Output cost ($/1M tokens) | $10.80 | $5.28 |
| Cache read cost ($/1M tokens) | $0.18 | $0.31 |
| Latency | low | medium |
| Reasoning model | Yes | Yes |
| Vision (image input) | Yes | No |
| Tool use | Yes | Yes |
See all model comparisons, full model specifications, benchmark results, or model comparison articles.