Site icon VMVirtualMachine.com

NVIDIA Has Managed to Reduce Token Costs by a Whopping 10x With Its Newest Blackwell Platform, Credited to Team Green’s “Extreme Codesign” Approach

NVIDIA Has Managed to Reduce Token Costs by a Whopping 10x With Its Newest Blackwell Platform, Credited to Team Green’s “Extreme Codesign” Approach

By Muhammad Zuhair
Publication Date: 2026-02-12 16:38:00

NVIDIA’s Blackwell platform has brought new levels of token optimization to AI inference workloads, as the company reveals a massive milestone in the realm of tokenomics.

NVIDIA’s GB200 NVL72 Achieves 10x Better Tokenomics Than Hopper, Credited “Expert-Level” Parallelism

While NVIDIA has been racing to build new infrastructure in the AI world, one of the company’s biggest focuses has been improving the efficiency of the hardware it deploys. And, with the Blackwell-trained frontier AI models dropping in the industry, we have seen how NVIDIA has progressed with token output and costs, and now, in a new blog post, the company has revealed that they have been working with businesses to scale up Blackwell performance, showing a significant ten-fold improvement over the Hopper generation.

That’s why leading inference providers including Baseten, DeepInfra, Fireworks AI and Together AI are using the NVIDIA Blackwell platform, which helps them reduce cost per token by up to 10x compared with the NVIDIA Hopper platform. These providers host advanced open source models, which have now reached frontier-level intelligence.

By combining open source frontier intelligence, the extreme hardware-software codesign of NVIDIA Blackwell and their own optimized inference stacks, these providers are enabling dramatic token cost reductions for businesses across every industry.

– NVIDIA

While discussing tokenomics on Blackwell, NVIDIA has labeled…

Exit mobile version