Walmart’s AI workflows meet the realities of the balance sheet

Walmart is rationing AI tokens. That single fact reveals more about the current state of enterprise AI adoption than a thousand keynote speeches. After encouraging 2.1 million employees to "experiment" with its internal assistant, Code Puppy, the retail giant has now slapped a monthly token budget on each worker. The party is over. The unlimited free lunch of corporate AI, it turns out, comes with a bill. And that bill is getting shockingly large.

Hot

Quality

Impact

Analysis 深度分析

Let’s be blunt: this was inevitable. The entire corporate playbook for the last 18 months has been to shove generative AI into every workflow imaginable, measuring success by raw adoption metrics. The goal was velocity, not value. It created a perverse incentive system. We saw the rise of “token maxxing”—a term that sounds like a crypto-bro punchline but is now a literal line item on expense reports. Employees, tasked with proving they are “AI-literate,” understandably ran every query, no matter how trivial, through the most powerful model available. When Sequoia Capital’s partner told the Wall Street Journal “we all should be tokenmaxxing” back in April, he was codifying a strategy that many companies were already practicing. Leaderboards celebrated the top prompters. It was all fun and games until the CFO saw the invoice from Anthropic or OpenAI.

The core problem isn’t the technology; it’s the accounting. The industry spent a year evangelizing a shift from fixed-cost software subscriptions to a utility-based, pay-per-inference model. This made perfect sense for vendors scaling up GPU clusters, but it handed companies a catastrophic budgeting nightmare. How do you plan for a tool whose cost scales directly with the ambition of your employees? You can’t. So you do the only thing you can: you ration. Walmart’s new guidance—“use AI where it creates value”—is a tacit admission that the previous, more permissive guidance was financially reckless. They built a culture of experimentation, and the bill for that culture arrived.

The real tell is in the instruction to “choose the right AI tool for any given task.” This is a desperate plea to stop using a frontier “thinking model” to format a spreadsheet. It’s a direct response to the computational bloat of the latest models, which use chains-of-thought reasoning that burn through tokens by the thousands for each turn of conversation. The hidden cost of AI sophistication is now painfully transparent. A “simple” query about quarterly sales becomes an expensive, multi-step internal monologue for the model.

Furthermore, the dream of autonomous, multi-agent workflows—a future where swarms of AIs collaborate on complex projects—is revealing itself to be a potential budgetary black hole. An agent tasked with “create a Q3 marketing plan” doesn’t just give you a document. It spawns sub-agents: one to research trends, another to analyze competitors, a third to draft copy, and a fourth to review the drafts. Each spawns its own chain of thought, each interaction billable. The cost isn’t just in the final output; it’s in the entire iterative, often failing, process of refinement. Walmart’s token caps are a blunt instrument to stop this digital sprawl before it bankrupts a division.

This episode exposes a profound hypocrisy in the AI boom. Companies pushed a narrative of AI as a foundational, empowering utility—like electricity. But you don’t meter the electricity in your office by the watt, telling employees to use it “judiciously” and only for “valuable” activities. That would be absurd. The move to pay-per-token admits AI isn’t a utility at all; it’s a luxury good, a resource to be carefully managed and allocated.

The productivity gains from AI are real, but they are now being weighed against a hard, quantifiable cost. The metric is no longer “how many employees are using AI?” but “what is the net value of that use after paying for the tokens?” This will force a brutal triage. Trivial, automatable tasks will be pushed back to cheaper, older software. AI will be reserved for high-complexity, high-return problems, and its use will be governed by ROI analyses, not enthusiasm. The era of “AI for everything” is dead. The era of “AI for what we can afford” has begun. And for all the talk of artificial intelligence, the hardest problem for most corporations remains basic, human accounting.

沃尔玛正在对AI代币实行配额限制。这一事实本身，比上千场主题演讲更能揭示企业级AI应用的现状。这家零售巨头在鼓励210万名员工“尝试”内部助手“代码小狗”之后，如今却为每位员工设置了每月代币使用限额。盛宴已经结束。事实证明，企业AI的“无限免费午餐”早已标好了价格，而且这笔账单正变得惊人地高昂。

坦白说：这实属必然。过去18个月，整个企业的操作手册都是将生成式AI强行推入每一个可能的流程，并以粗暴的采用指标来衡量成功。追求的是速度，而非价值。这催生了一种扭曲的激励机制。我们目睹了“代币最大化”的兴起——这个术语听起来像加密货币圈的段子，如今却成了费用报告上实实在在的科目。员工为了证明自己具备“AI素养”，很自然地会将所有查询，无论多么琐碎，都丢给最强大的模型去处理。四月份，红杉资本的合伙人在《华尔街日报》上说“我们都应该代币最大化”时，他其实是在为许多公司已在实践的策略正名。排行榜在表彰最会用提示词的人。这一切本来挺有意思，直到首席财务官看到来自Anthropic或OpenAI的账单。

核心问题不在于技术，而在于会计核算。整个行业花了一年时间鼓吹从固定成本的软件订阅转向基于公用事业、按推理付费的模式。这对供应商扩展GPU集群而言完全合理，却给企业带来了灾难性的预算噩梦。当一个工具的成本与你的员工的雄心壮志直接挂钩时，你该如何规划？你无法规划。所以你只能做唯一能做的事：实行配额限制。沃尔玛的新指引——“在能创造价值的地方使用AI”——等于默认承认，之前更宽松的指引在财务上是鲁莽的。他们构建了一种实验文化，而这张文化的账单如今到了付款的时候。

真正的信号在于……

Disclaimer: The above content is generated by AI and is for reference only.

大模型 Agent 编程

Read Original →

Analysis 深度分析

Related Articles 相关文章