Which AI chatbot, if any, is good at simple math?

vm_adminDecember 30, 2025

By Servet Yanatma
Publication Date: 2025-12-30 07:01:00

Artificial intelligence (AI) is becoming an integral part of daily life, including everyday calculations. But how well do these systems actually handle basic math? And how much should users trust them?

A recent study advises caution. The Omni Research on Calculation in AI (ORCA) shows that if you ask an AI chatbot to perform everyday math tasks, the chance of an incorrect answer is around 40 percent. Accuracy varies significantly between AI companies and different types of math tasks.

So which AI tools are more accurate and how do they work for different types of calculations, such as statistics, finance or physics?

Results are based on performance on 500 prompts drawn from real-world, calculable problems. Each AI model was tested with the same set of 500 questions. The five AI models were tested in October 2025.

The selected models are:

ChatGPT-5 (OpenAI)
Gemini 2.5 Flash (Google)
Claude 4.5 Sonnet (anthropic)
DeepSeek V3.2 (DeepSeek…

Related Posts