HomeInsightsAI Tools
AI Tools · 10 min read

A Chinese open model just matched the best AI at a fraction of the price. Should your business care?

In early July 2026, the Beijing company Zhipu AI released GLM-5.2, an open-weight model that runs at roughly 1.40 dollars per million input tokens and 4.40 dollars output, against about 5 and 25 for Claude Opus and 5 and 30 for GPT-5.5, while trailing the top models by only about one percentage point on long, messy coding tasks. For a small business the meaning is not that you should rush to switch, it is that the price of very capable AI just fell again, which makes automation cheaper to run whether or not you ever touch GLM-5.2 directly. Here is the honest read on where a cheap open model helps you and where it does not.

Every few months a new AI model arrives claiming to match the best in the world for a fraction of the cost, and most of the time the claim deserves a raised eyebrow. GLM-5.2, released by the Beijing-based company Zhipu AI, which also operates under the name Z.ai, is one of the more credible versions of that claim, and it arrived at a charged moment: just as the United States had briefly restricted access to some of Anthropic's newest models for foreign nationals, a Chinese lab shipped an open model that anyone in the world can download and run without regional limits. The timing turned a model release into a story about the global balance of AI power.

For a small business owner, most of that geopolitical drama is background. What you actually want to know is simpler and more useful. Is this model good enough to run real work, is it genuinely cheaper, and does any of that change what you should do this quarter? The short answers are yes it is very capable, yes it is markedly cheaper, and probably it does not change your immediate plans while quietly making the whole economics of AI automation better for you over time. This article explains why all three are true at once.

The five-second answer

GLM-5.2 is a very capable open-weight AI model that nearly matches the best frontier models on hard coding tasks while costing roughly a third as much per token, around 1.40 dollars input and 4.40 dollars output versus 5 and 25 to 30 for the top Western models. For most small businesses this is not a reason to switch tools tomorrow, because the model you already use is fine and switching has costs. It is a reason to be optimistic: cheap, capable models like this keep pushing down the cost of running AI automation, so the real opportunity is building the automation, confident that the engine underneath it is getting cheaper and more competitive every few months, not agonising over which specific model powers it.

What GLM-5.2 actually is

GLM-5.2 is a large language model in the same broad family as Claude, ChatGPT, and Gemini, meaning it reads and writes text, answers questions, writes and debugs code, and can drive multi-step automated tasks. Technically it is what is called a mixture-of-experts model with 744 billion total parameters and 40 billion active at any moment, and it carries a context window of one million tokens, which is roughly 750,000 words of working memory at once, quadruple its predecessor. Those specifications put it firmly in frontier territory rather than in the second tier of cheaper, weaker models.

The performance claims are what earned it attention. Zhipu says GLM-5.2 performs almost on par with Anthropic's Claude Opus 4.8 and OpenAI's GPT-5.5, and on the specific challenge of long, open-ended technical projects lasting hours to days, the kind of sustained agentic coding that stresses a model's ability to stay coherent, it reportedly trails Opus 4.8 by only about one percentage point while edging past GPT-5.5 and Anthropic's older Opus 4.7. Independent reporting placed it at or near the top of open-weight model rankings, which is a meaningful external check on a vendor's own numbers.

The context that makes this notable is that GLM-5.2 came from a Chinese lab and, in some reports, runs on Huawei silicon rather than the Western chips that dominate AI training, and it arrived right after the US had restricted foreign access to certain Anthropic models. That combination fuelled a broader debate about whether China has closed the gap with American AI. For your business that debate is mostly a spectator sport, but it does have one practical edge worth keeping: competition at the frontier from multiple countries and companies is what keeps prices falling and options open, and that competition works in your favour regardless of who leads in any given month.

What "open-weight" means for you

The single most important word in this story is open, and it is worth being precise about what it means, because it is both a genuine advantage and a source of confusion. An open-weight model is one whose underlying trained parameters are published, so that anyone can download the model and run it on their own hardware or through a provider of their choice, modify it, and use it without asking permission or being cut off by the original maker. This is fundamentally different from a closed model like Claude or GPT-5.5, which you can only access through the maker's own service and on the maker's terms.

The practical benefit of openness for a business is resilience and control. A closed model can be repriced, restricted, or made unavailable in your region by a single company or government decision, exactly the kind of disruption we wrote about in our piece on AI vendor availability risk. An open-weight model, once you have it, cannot be taken away in the same way, because it does not depend on one company's ongoing permission. For a business that wants to reduce its exposure to any single vendor, the existence of strong open models is a real strategic asset, even if you never run one yourself, because it keeps the closed providers honest on price.

The catch is that openness does not mean free or effortless. Running a 744-billion-parameter model well takes serious hardware or a paying relationship with a hosting provider, and most small businesses will access GLM-5.2, if at all, through a service like a model marketplace rather than by running it themselves. Openness is a property that benefits you mostly through the competition and optionality it creates, not through you personally downloading and operating a giant model, which is a task for specialists rather than a corner shop.

The price story that actually matters

Here is where GLM-5.2 becomes concretely relevant to a small business rather than just interesting. Accessed through providers, the model has been priced around 1.40 dollars per million input tokens and 4.40 dollars per million output tokens. Compare that to roughly 5 dollars input and 25 output for Claude Opus, and 5 and 30 for GPT-5.5, and you are looking at a model that costs somewhere around a third as much to run for output-heavy work while claiming near-parity on quality. For automation that runs at volume, that difference is not academic, it is the gap between a workflow that comfortably pays for itself and one that is marginal.

But notice the pattern rather than fixating on this one model. GLM-5.2 is the latest data point in a steady, relentless collapse in the price of capable AI, the same trend that let Anthropic price Claude Sonnet 5 far below its own flagship, which we covered in our Claude Sonnet 5 small-business guide. Every few months, a model arrives offering something close to last season's best quality at a fraction of last season's price. The specific winner rotates. The direction never does. Capable AI gets cheaper, consistently, and that is the fact a business should plan around rather than any single model's numbers.

What that means for you is that the economics of automating a task keep improving in the background whether you act or not. A workflow that was borderline too expensive to bother with a year ago may be comfortably profitable now purely because the underlying model prices fell, and models like GLM-5.2 push that trend further. The strategic move is to build your automation with the model treated as a cheap, swappable, ever-improving commodity, and to keep your attention on the work being automated, because that is where the durable value sits.

Should your business actually use it?

For most small businesses using AI through everyday chat tools, the honest answer is that GLM-5.2 is not something you need to touch directly. If your team types into ChatGPT or Claude and reads the replies, this model does not change your day, and there is no consumer chat product built around it that would tempt your staff away from the tools they know. The chat layer of your business is best left on whatever familiar tool your team is already fluent in, a point we make in detail in our comparison of Claude Sonnet 5 versus ChatGPT.

Where GLM-5.2 becomes a live option is in automation built on an API, where cost per call multiplied by volume is a real line item and no human cares which brand answers. If you run, or plan to run, high-volume workflows behind the scenes, a cheaper capable model is worth genuinely evaluating, and an open one with strong coding performance is a reasonable candidate to test against your incumbent. The right way to do that is not to switch on faith but to run the same real workload through both, compare cost and output quality honestly over a few days, and let the results decide.

The businesses most likely to benefit are those doing heavy, code-adjacent or long-running automated work, since that is where GLM-5.2's particular strengths show and where its price advantage compounds fastest. For lighter, lower-volume automation, the savings may be too small to justify moving off a model you already trust, and the sensible choice is to stay put and revisit when your volume grows. As always, the decision is an engineering one about your specific workload, not a verdict on which country or company is winning the AI race.

The honest cautions

A cheap, capable model is genuinely good news, but a responsible read includes the caveats. The first is that vendor benchmark claims, from any company in any country, deserve healthy scepticism until confirmed by your own testing on your own tasks. Near-parity on a coding benchmark does not guarantee near-parity on your specific customer emails or your particular document processing, and the only test that truly counts is the one you run on your real work. Treat impressive numbers as a reason to test, never as a reason to switch sight unseen.

The second caution concerns data and governance. Using any model, open or closed, Chinese or Western, means thinking about where your data goes and under what terms, especially if you handle sensitive customer information or operate under rules like the ones we covered in our EU AI Act guide. Some businesses will have legitimate reasons, contractual, regulatory, or reputational, to be deliberate about which providers they route data through, and that consideration is separate from raw model quality. Openness helps here, because an open model can be run through a provider or environment you choose, but it is a question to answer consciously rather than ignore.

The third is simply that novelty is not a strategy. A model being newer and cheaper this month does not obligate you to move, and businesses that chase every fresh release end up churning tools and getting worse at all of them. The disciplined posture is to let the market race, keep your automation portable so you can adopt a better engine when it clearly pays to, and change models deliberately on evidence rather than reflexively on hype. Cheap capable AI is a tailwind you benefit from by staying flexible, not by constantly switching.

The practical takeaway

Strip everything down and GLM-5.2 delivers one durable message to a small business: the cost of running capable AI is still falling fast, and the field of strong options keeps widening. You do not need to adopt this particular model, and most readers will not, but you should let its existence update your sense of what is affordable. Automations that felt too expensive to run at scale a year ago are increasingly cheap to operate, and that trend is only accelerating as open models from multiple labs pile pressure on prices.

So the move is not to chase GLM-5.2 specifically. It is to build the automation your business actually needs on a foundation that treats the model as a cheap, swappable commodity, then let the relentless competition between labs work in your favour by keeping your options open and your costs dropping. If you want help identifying which of your repetitive tasks are now cheap enough to automate profitably, and building them so you can ride the falling price of AI rather than betting on any one model, that is precisely what our €49 audit is designed to map out.

The bottom line

GLM-5.2 is a genuinely impressive open-weight model that nearly matches the best AI in the world at roughly a third of the price, and it arrived at a moment that made it a symbol of intensifying global AI competition. For your small business the symbolism matters less than the substance, and the substance is encouraging: capable AI keeps getting cheaper, open models keep the closed giants honest, and the economics of automating real work keep improving whether or not you ever run this specific model.

The winning strategy has not changed and this release confirms it. Do not chase individual models, do not agonise over which one is marginally ahead this month, and do not switch on hype. Build the automation your business needs, keep the model underneath it swappable, and let the falling price of AI reward your patience. GLM-5.2 is one more reason to be confident that the engine will keep getting cheaper and better, which frees you to focus on the only thing that was ever really the point, which is the work you are getting done.

Want to know which tasks are now cheap enough to automate profitably? Start with the €49 audit

Sources

Quick answers

Common questions.

Want this in your business?

The €49 audit shows you exactly which automations would pay back fastest in your specific operation.

€49 entryFull AI audit + strategy call included

Reserve your auditNo commitment. No contracts. Just clarity.