AI LLMs Astonishingly Bad At Doing Proofs And Disturbingly Using Blarney In Their Answers

AI LLMs Astonishingly Bad At Doing Proofs And Disturbingly Using Blarney In Their Answers

In today’s column, I examine an insightful AI research study that sought to ascertain whether state-of-the-art generative AI and large language models (LLMs) are any good at devising mathematical proofs.

You see, there have been lots of flashy press that proclaim LLMs can do noticeably well at solving complex math problems, but those experiments and tests tend to…

Article Source
https://www.forbes.com/sites/lanceeliot/2025/04/08/ai-llms-astonishingly-bad-at-doing-proofs-and-disturbingly-using-blarney-in-their-answers/