In the Token Battle, Huawei Cloud Chose the Third Path | Frontline
When Zhou Yuefeng stated on stage, "We don't care about the total number of Tokens," some in the audience might have dismissed it as mere rhetoric. After all, over the past year, the entire industry has been fiercely competing over token call volumes and pricing. When Volcengine ignited the price war with a rate of 0.0008 yuan per thousand tokens—followed by Alibaba, Baidu, and Tencent joining the fray—any cloud provider claiming "we don't care" would sound like a detached remark on the battlefi
Analysis
When Zhou Yuefeng declared "We don't care about the total number of Tokens" on stage, some in the audience might have seen it as empty talk. After all, the entire industry has been locked in a cutthroat battle over token call volumes and pricing over the past year. With Volcengine igniting the price war at 0.0008 yuan per thousand tokens—followed by Alibaba, Baidu, and Tencent entering the fray—any cloud provider claiming "we don't care" would sound like a detached remark on the battlefield. Yet Huawei Cloud seems genuinely intent on pulling this competition onto an entirely different track.
This is not merely a shift in rhetoric. While other vendors highlight "trillions of tokens" and "ultra-low unit prices" as core metrics, Huawei Cloud emphasizes "health" and "productivity enhancement"—essentially acknowledging an uncomfortable reality: amid the frenzied token production pipeline, much of it is fueled by "emotional value" bubbles. The example of casually asking an AI question on a phone precisely punctures the industry's facade of prosperity painted by call data. Tokens generated from casual chat and those produced in factory scheduling or hospital pathology diagnostics hold vastly different values. The former's boom may just be an AI reincarnation of mobile internet traffic anxiety, while even slight growth in the latter signifies real-world change.
This move is risky but smart. With computing power supply—especially high-end GPUs—continuously constrained, fighting head-on over the scale of token production pipelines would be like engaging in a war of attrition on the opponent's turf. Huawei Cloud's proposed "Agentic Infra" paradigm and its AICS Lingqu AI computing cluster, dubbed a "token factory," aim to "build the factory on our own foundation." Behind parameters like 100,000-card clusters and 10-millisecond latency lies a clear subtext: leveraging domestic computing power such as Ascend chips to establish a sustainable token production line unbound by supply bottlenecks. While competitors are still comparing whose factory consumes less energy, Huawei Cloud is saying: "First, I must ensure my factory can run reliably."
This represents a strategic "flanking maneuver." Price wars target market share in existing segments, but Huawei Cloud aims to define the rules for emerging markets. By collaborating with model developers like Zhipu, DeepSeek, and Kimi on "Hundred Models, Thousand Forms," and open-sourcing the OpenJiuwen agent platform, it is more akin to building an "Android Alliance for the AI era." Zhou Yuefeng’s repeated emphasis on being "the most open cloud" is not about selling its own models but becoming the "silicon-based fertile ground" for deploying all models. This path mirrors Huawei's traditional approach in telecommunications: not necessarily reigning as the flashiest terminal but becoming an indispensable player at the level of underlying infrastructure and protocols.
Of course, strategic elegance requires tangible results to back it up. When clients are still troubled by monthly cloud bills, the argument of "productivity value" demands solid case studies to be convincing. Huawei Cloud’s introduction of a medical pathology foundation model is a strong example—enabling county-level hospitals to achieve tier-three diagnostic capabilities addresses a clear societal pain point with undeniable value. The CloudRobo platform for embodied intelligence similarly tackles the common challenge of small and medium enterprises "affording computing power." These scenarios cannot be measured by "call volumes," but they serve as the most robust footnotes for Huawei Cloud’s "third path."
The sharpest point of this launch lies in its near-direct critique of a current AI competition trap: we may be using the old internet’s "traffic mindset" to operate an entirely new intelligent productivity system. Stacking token volumes resembles early internet competitions over website page views—seemingly vibrant but far from generating real economic value. Huawei Cloud’s pivot is a correction to the industry’s collective unconscious, even though this "deviation" is currently the hotspot chased by capital and traffic.
Avoiding price wars means Huawei Cloud will face pressure in acquiring new public cloud customers in the short term. However, Huawei Cloud appears willing to bet on its identity as a "long-termist." It wagers that when the industry’s frenzy subsides, the value of a cloud provider will ultimately be measured not by how many cheap tokens it produces, but by how many efficiency screws its token factory has tightened for the real economy, and how many critical pieces it has contributed to the domestic technology stack. The war over tokens may have only just entered deeper waters.
Disclaimer: The above content is generated by AI and is for reference only.