Nvidia Follows Google's Playbook With $20 Billion Groq Bet

By Janakiram MSV, Senior Contributor
Publication Date: 2026-03-29 12:16:00

Groq LPU

Nvidia

Nvidia licensed Groq’s inference technology for $20 billion last December and unveiled the resulting Groq 3 language processing unit at GTC 2026 in San Jose. The move represents the clearest admission yet from the world’s dominant AI chipmaker that graphics processing units, however powerful, are not the right tool for every AI workload. For enterprise technology leaders planning infrastructure investments, the implications are significant. The era of one-chip-fits-all AI computing is ending.

The deal, Nvidia’s largest ever, brought Groq founder Jonathan Ross and senior leaders to Nvidia along with a perpetual license to Groq’s patent portfolio and software stack. Groq continues to operate independently under new CEO Simon Edwards, but the technology now sits at the heart of Nvidia’s inference roadmap.

AI computing broadly splits into two phases. Training builds a model by processing massive datasets over weeks or months. Inference runs the trained model to generate responses, images and decisions in real time. Nvidia’s GPUs have dominated training because of their raw parallel processing power. But inference, particularly the autoregressive token generation that powers chatbots and AI agents, presents a fundamentally different computational profile.

The bottleneck during inference is not floating-point arithmetic. It is memory bandwidth, the speed at which a processor can move model weights and intermediate data through the chip….

Related Posts