Nvidia’s closest rival once again obliterates cloud giants in AI performance; Cerebras Inference is 75x faster than AWS, 32x faster than Google on Llama 3.1 405B

Nvidia’s closest rival once again obliterates cloud giants in AI performance; Cerebras Inference is 75x faster than AWS, 32x faster than Google on Llama 3.1 405B


  • Cerebras hits 969 tokens/second on Llama 3.1 405B, 75x faster than AWS
  • Claims industry-low 240ms latency, twice as fast as Google Vertex
  • Cerebras Inference runs on the CS-3 with the WSE-3 AI processor

Cerebras Systems says it has set a new…

Article Source
https://www.techradar.com/pro/nvidias-closest-rival-once-again-obliterates-cloud-giants-in-ai-performance

More From Author

Why Fastly Inc. (FSLY) Is One of the Best AI Stocks to Invest in Under ?

Why Fastly Inc. (FSLY) Is One of the Best AI Stocks to Invest in Under $10?

Palantir Stock vs. Nvidia Stock: Billionaires Are Buying One and Selling the Other | The Motley Fool

Palantir Stock vs. Nvidia Stock: Billionaires Are Buying One and Selling the Other | The Motley Fool

Listen to the Podcast Overview

Watch the Keynote