IBL News | New York
Beijing-based AI pioneer Zhipu AI released GLM-5.2 this week, a low-cost, powerful, open-source LLM featuring a massive 1-million-token context window that aims to attract users from US rivals dissatisfied with high prices and Washington’s abrupt order to suspend top American models overseas. Its stock rocketed in the Hong Kong market.
The launch underscored how Chinese players are seeking to capture users who are seeking alternatives to top models from Western leaders.
Zhipu’s announcement came shortly after San Francisco-based Anthropic abruptly suspended access to its flagship models Fable-5 and Mythos-5 to all foreign nationals, following a federal export-control directive issued on national-security grounds.
GLM-5.2 will be available to all users of Zhipu’s new GLM Coding Plan subscription, which is priced at just a tenth of Anthropic’s premium Claude Code and Claude Max tiers. The GLM-5.2 application programming interface (API) went live this week, while the model itself has been open-sourced under the permissive MIT license.
GLM-5.2 features native integrations with more than 20 popular agent tools, including developer mainstays Claude Code, Cline, and Cursor.
Introducing GLM-5.2: Frontier Intelligence, Open Weights
– Significant improvements in coding and agentic tasks
– Strong long-horizon capabilities with a 1M context window
– Two levels of reasoning effort: GLM-5.2 (max) pushes the limits, while GLM-5.2 (high) strikes a strong… pic.twitter.com/SjGPSVhePJ— Z.ai (@Zai_org) June 16, 2026
To run the GLM-5.2, a 1.51TB AI model running on any laptop, Unsloth Studio just made it a lot less impossible, shrinking it by 84%. This way, it runs locally on a 256GB Mac
Unsloth’s fix the quantization, dropping the 2-bit version, drops to 238GB, retaining ~82% accuracy.
To run it locally, users actually need:
-
- A 256GB Mac (M3/M4 Ultra) for single-box setup
- Or a 24GB GPU plus 256GB RAM with memory offloading
- Tools: llama.cpp, LM Studio, or Unsloth Studio
These offline coding agents and long-context tasks previously required a cloud API. Now, no usage fees, no rate limits, no data leaving anyone’s machine.
