ScaleFlux, FarmGPU and Lightbits Labs preview the long-context AI inference solution on NVIDIA GTC

ScaleFlux, FarmGPU and Lightbits Labs preview the long-context AI inference solution on NVIDIA GTC

By Business Wire
Publication Date: 2026-03-11 13:22:00

NVIDIA Terms and Conditions – San Jose | 16-19. March 2026 | Booth 7006

SAN JOSE, Calif., March 11, 2026–(BUSINESS WIRE)–ScaleFlux, FarmGPU and Lightbits Labs today announced the public debut of a collaborative architecture designed to solve one of the most persistent challenges in AI inference: the memory and I/O limitations imposed by long-context workloads.

At NVIDIA GTC San Jose in March, the companies will unveil an implementation that brings together ScaleFlux’s high-performance NVMe, FarmGPU’s managed inference environment, and Lightbits’ LightInferra™ software to solve how to persist, reuse, and stream KV cache data across inference sessions more efficiently, reduce GPU glitches through repeated context recalculations, and open the door to more predictable, scalable Open up performance and infrastructure efficiency.

“We are transforming inference storage from a reactive cache to an intelligent, streamed data layer,” said Arthur Rassmuson, director of AI…