Leni Tops Four Major AI Benchmarks, Outperforming Systems from OpenAI, Anthropic, Google, and Perplexity

Leni Tops Four Major AI Benchmarks, Outperforming Systems from OpenAI, Anthropic, Google, and Perplexity

By Leni
Publication Date: 2026-05-12 14:31:00

NEW YORK, May 12, 2026 /PRNewswire/ — Leni, an AI-powered analytics platform for commercial real estate, today announced top-tier results on four independent AI benchmarks. Leni placed first on the DRACO Benchmark for deep research, in the top two on SpreadsheetBench Verified, outperformed every public model on BullshitBench, and ranked ahead of Genspark, Manus and OpenAI Deep Research on GAIA.

“Most teams obsess over models, but the key engineering needed for effective AI adoption, which delivers highly accurate and reliable results for teams, relies on architecture or harness,” said Leni CEO and Co-Founder Arunabh Dastidar. “That’s why the most popular coding tool today is 98 percent harness and 2 percent models. We called it years ago and have produced purpose-built infrastructure that can reliably be used for serious work where accuracy and security are crucial. It shifts…