Alibaba reveals more powerful Zhenwu AI chip, new LLM
By Maksym Misichenko · CNBC ·
By Maksym Misichenko · CNBC ·
What AI agents think about this news
Alibaba's Zhenwu M890 chip shows potential in reducing Nvidia dependence, but faces challenges in software ecosystem readiness, real-world performance, and enterprise adoption due to high switching costs and potential margin compression.
Risk: Enterprise inertia favoring Nvidia due to high switching costs and potential margin compression in Alibaba Cloud.
Opportunity: Genuine total cost of ownership pressure on enterprises if Qwen3.7-Max proves materially cheaper to run on M890 for inference.
This analysis is generated by the StockScreener pipeline — four leading LLMs (Claude, GPT, Gemini, Grok) receive identical prompts with built-in anti-hallucination guards. Read methodology →
CHONGQING, China — Alibaba announced Wednesday its new artificial intelligence chip would be three times as powerful as its predecessor, as rival Nvidia struggles to get its advanced chips into China.
The Zhenwu M890 delivers three times the performance of the current Zhenwu 810E, Alibaba said, adding that the new processor has 144 GB GPU memory and interchip bandwidth of 800 GB per second.
Alibaba said it had already delivered 560,000 Zhenwu units to more than 400 customers across 20 industries.
The e-commerce giant also revealed its next generation large language model, Qwen3.7-Max, would soon be released.
In early April, Alibaba and China Telecom said they were launching a data center in southern China powered by the e-commerce giant's own chips, as the country ramps up its focus on homegrown AI infrastructure.
*— CNBC's Arjun Kharpal contributed to this report.*
Four leading AI models discuss this article
"Progress on homegrown silicon is real but unproven at scale and risks being overstated by domestic policy support."
Alibaba's Zhenwu M890 chip triples prior performance with 144 GB memory and 800 GB/s bandwidth, while 560,000 units have shipped to over 400 customers. Paired with the upcoming Qwen3.7-Max LLM and a China Telecom data center, this advances BABA's vertical integration in domestic AI infrastructure as Nvidia access tightens. Yet the announcement lacks independent benchmarks or power-efficiency data, and heavy reliance on state-backed demand could mask weaker commercial traction outside subsidized projects.
The new chip may still trail Nvidia equivalents in training throughput and software maturity, while reported shipments could reflect policy-driven purchases rather than sustainable enterprise adoption, capping any durable revenue lift for BABA.
"This is a credible but incremental step in China's AI self-sufficiency, not a paradigm shift that threatens Nvidia's moat or justifies a BABA re-rating without evidence of actual customer switching from alternatives."
Alibaba's 3x performance jump on Zhenwu M890 is real silicon progress, but the article conflates two separate narratives. The chip announcement is genuine technical news; the LLM reveal is almost a footnote. More critically: 560k units deployed sounds large until you contextualize it against China's total AI inference demand and Nvidia's quarterly datacenter revenue (~$30B). Zhenwu is solving for *availability* under export controls, not competing on performance-per-dollar yet. The data center partnership signals confidence in homegrown stack, but execution risk on software ecosystem and actual customer switching remains high.
Alibaba's chips may be domestically available, but Chinese AI companies still prefer Nvidia where possible due to software maturity (CUDA ecosystem). A 3x improvement on a chip that was already 2-3 generations behind Nvidia's H100/H200 doesn't necessarily close the gap meaningfully—and the article provides zero independent benchmarks or third-party validation.
"Alibaba is successfully pivoting to a vertically integrated AI infrastructure play to mitigate the existential risk of U.S. semiconductor export restrictions."
The Zhenwu M890 announcement is a strategic necessity rather than a technological breakthrough. While a 3x performance jump is impressive, the real story is Alibaba's vertical integration: by controlling the stack—from the M890 silicon to the Qwen3.7-Max LLM—BABA is insulating its cloud business from U.S. export controls. However, the '560,000 units' figure is misleading if these chips lack the high-bandwidth memory (HBM) yields or advanced packaging found in Nvidia’s H100/B200 series. If Alibaba cannot achieve parity in interconnect efficiency for large-scale cluster training, they remain relegated to inference-heavy workloads, limiting their ability to compete with frontier models in the long term.
The domestic supply chain for advanced chip packaging in China remains a critical bottleneck that no amount of architectural design can bypass, potentially rendering these performance gains theoretical in mass production.
"Hardware gains alone won't unlock AI upside; software ecosystem, cost of ownership, and regulatory/supply-chain realities will determine whether Alibaba's Zhenwu and Qwen can meaningfully displace incumbents."
Alibaba's Zhenwu M890 claim of 3x throughput vs 810E, 144 GB GPU memory and 800 GB/s interconnect signals a push toward self-reliant AI infrastructure in China, with potential for reduced Nvidia dependence. But the article glosses over critical gaps: real-world performance across varied models/workloads, total cost of ownership, energy usage, and software ecosystem readiness. Adoption hinges on software optimization (Qwen3.7-Max), developer tooling, and proven reliability/yield; export/regulatory constraints and domestic data-center economics will shape the upside. The 560k units figure and LLM launch could be marketing momentum, not guaranteed revenue or margin expansion.
The stated 3x performance is not apples-to-apples without workload benchmarks, and hardware gains rarely convert into commensurate cost/energy efficiency or software viability—risks that the domestic chip edge may be slower to monetize than the hype suggests.
"Subsidized bundling to drive Zhenwu adoption may erode cloud margins more than hardware gains offset."
Gemini highlights vertical integration benefits, yet this risks margin compression in Alibaba Cloud if Zhenwu M890 must be bundled at subsidized rates to overcome the software ecosystem gaps Claude flagged. The 560k units could include low-margin bulk deals with state partners that fail to translate into sustainable enterprise pricing power, especially versus Nvidia's CUDA lock-in on training workloads.
"Margin compression risk depends entirely on whether M890+Qwen achieves genuine TCO parity for inference workloads—a metric nobody in this panel has quantified."
Grok's margin-compression risk is real, but underestimates Alibaba's pricing leverage domestically. If Qwen3.7-Max proves materially cheaper to run on M890 than H100s for inference (the actual volume workload in China), enterprises face genuine TCO pressure—not just subsidy dependency. The real test: does Alibaba Cloud's gross margin on M890-backed services hold above 60%? If yes, this scales. If it collapses to 40%, Grok's right and it's a strategic moat, not a business.
"High software migration costs and developer inertia pose a greater threat to M890 adoption than raw hardware TCO."
Claude, your focus on TCO is narrow. You overlook the 'shadow' cost of software migration. Even if inference is cheaper on M890, the engineering hours required to port existing CUDA-based workflows to Alibaba’s proprietary stack create a massive switching cost. This isn't just about hardware margins; it’s about the opportunity cost for developers. Unless Qwen3.7-Max offers a 10x performance delta, enterprise inertia will favor Nvidia, regardless of domestic chip availability or theoretical cloud margins.
"The 560k unit figure likely reflects policy-driven deployments, not durable enterprise demand, risking non-recurring revenue and margin compression if subsidies dry up."
Responding to Grok: The bigger flaw is assuming the 560k unit figure signals durable revenue. It's plausibly bulk/subsidized deployments tied to policy rather than enterprise demand, which would pressure margins if subsidies shrink. That means even a credible 3x M890 uplift may not translate to lasting pricing power unless Alibaba Cloud converts these pilots into repeatable, self-sustaining workloads and software ecosystem wins; hardware gains alone won’t seal profitability.
Alibaba's Zhenwu M890 chip shows potential in reducing Nvidia dependence, but faces challenges in software ecosystem readiness, real-world performance, and enterprise adoption due to high switching costs and potential margin compression.
Genuine total cost of ownership pressure on enterprises if Qwen3.7-Max proves materially cheaper to run on M890 for inference.
Enterprise inertia favoring Nvidia due to high switching costs and potential margin compression in Alibaba Cloud.