Z.ai pitches GLM-5.2 for long-running software engineering tasks
Z.ai has released GLM-5.2, an MIT-licensed open-source AI model designed for long-running software engineering tasks, as the Chinese company seeks to challenge proprietary coding models on cost and performance. The company said GLM-5.2 ranked just behind Anthropic’s Claude Opus 4.8 on FrontierSWE, a long-horizon coding benchmark, trailing it by 1%. Z.ai said the model also edged out OpenAI’s GPT-5.5 by 1%. Z.ai said GLM-5.2 supports a one-million-token context window with up to 131,072 output tokens, positioning it for agentic coding workflows that require reasoning across large codebases. The company is also making an efficiency argument. It said GLM-5.2 uses a technique called IndexShare, which reduces per-token compute by 2.9 times at a one-million-token context length. It also said changes to the model’s multi-token prediction layer increased the acceptance length for speculative decoding by up to 20%. The changes are aimed at a practical problem for developers: long-context coding