Debates over AI benchmarking have reached Pokémon | TechCrunch

Debates over AI benchmarking have reached Pokémon | TechCrunch

Not even Pokémon is safe from AI benchmarking controversy.

Last week, a post on X went viral, claiming that Google’s latest Gemini model surpassed Anthropic’s flagship Claude model in the original Pokémon video game trilogy. Reportedly, Gemini had reached Lavender Town in a developer’s Twitch stream; Claude was stuck at Mount Moon as of late February.

Article Source
https://techcrunch.com/2025/04/14/debates-over-ai-benchmarking-have-reached-pokemon/