By The Cloudflare Blog
Publication Date: 2026-06-15 13:00:00
Today we are excited to announce that key members of the Ensemble AI team are joining Cloudflare to accelerate our work in AI infrastructure and make it easier for developers to efficiently run powerful AI models at scale.
Founded in San Francisco in 2023, Ensemble AI has spent the last few years focused on one of the most important challenges in AI: operating large models faster, smaller, and more cost-effectively without sacrificing quality. The team has developed new approaches to model compression and efficient inference that aim to reduce the storage, computation, and deployment overhead of large language models and multimodal architectures.
As AI becomes a central part of how developers build applications, the economics of inference are more important than ever. The models are getting bigger and bigger; Workloads are becoming more dynamic. And customers increasingly expect AI to be available everywhere: globally distributed, fast, reliable and affordable. We bring the Ensemble AI team with us…