Not even Pokémon is safe from AI benchmarking controversy.
Last week, a post on X went viral, claiming that Google’s latest Gemini model surpassed Anthropic’s flagship Claude model in the original Pokémon video game trilogy. Reportedly, Gemini had reached Lavender Town in a developer’s Twitch stream; Claude was stuck at Mount Moon as of late February.
Gemini is literally ahead of Claude atm in pokemon after reaching Lavender Town
119 live views only btw, incredibly underrated…
Article Source
https://techcrunch.com/2025/04/14/debates-over-ai-benchmarking-have-reached-pokemon/

