releaseMarch 15, 2026

Z.ai releases GLM-5-Turbo with 202K context for OpenClaw-style agent workflows

Z.ai released GLM-5-Turbo as a faster GLM-5 variant for OpenClaw-style tool use, with 202K context, OpenRouter access, and higher off-peak limits. Try it as a cheaper speed tier for agent workflows, but benchmark completion quality on your own tasks before wider use.

GLM Coding Agents Rate Limits

3 min read

Z.ai releases GLM-5-Turbo with 202K context for OpenClaw-style agent workflows

TL;DR

Z.ai's launch thread introduces GLM-5-Turbo as a faster GLM-5 variant tuned for "agent-driven environments such as OpenClaw," with an official developer guide describing stronger tool invocation, command following, and long-chain execution.
Access is split by plan: according to the rollout schedule, Pro users get GLM-5-Turbo in March, while Lite users get base GLM-5 in March and GLM-5-Turbo in April; Z.ai's early-access post also opened waitlists for both tiers.
The model is already exposed through routing and client surfaces: the OpenRouter listing shows 202,752-token context and pricing at $0.96 per million input tokens and $3.20 per million output tokens, while