Perplexity unveils hybrid agentic inference: What it is and why it matters

Perplexity unveils hybrid agentic inference: What it is and why it matters

By @wion
Publication Date: 2026-06-03 16:52:00

AI company Perplexity has announced a new feature called “hybrid agentic inference” for its Personal Computer platform, aiming to solve one of the biggest challenges facing artificial intelligence today: deciding what should run on your device and what should run in the cloud.
The company says the new system will automatically split AI tasks between local models running on a user’s computer and larger, more powerful models operating in remote data centres. The goal is to improve privacy, reduce costs and deliver better performance without requiring users to make technical decisions themselves. The announcement comes as technology companies increasingly look for ways to combine the speed and privacy of on-device AI with the power of cloud-based systems.

What is hybrid agentic inference?

Add WION as a Preferred Source

Hybrid agentic inference is a system that allows AI workloads to be shared between a user’s device and cloud-based AI models. Instead of sending every request to a remote server, Perplexity’s system first evaluates the task and decides where different parts should be processed. For example, if a user asks an AI assistant to analyse financial records, personal documents or health information, the sensitive data can remain on the device. More complex reasoning tasks can then be sent to larger cloud-based AI models when additional computing power is needed. According to Perplexity, the entire process happens automatically without requiring users to choose between…