An estimated 90% of the training data for current generative AI systems stems from English. However, English is an international lingua franca with about 1.5 billion speakers worldwide, and countless varieties.
So whose English is today’s technology based on? The answer is primarily the English of mainstream America.
This is no accident. Mainstream American English is entrenched in the digital infrastructure of the internet, in Silicon Valley’s corporate priorities, and in the…
Article Source
https://theconversation.com/ai-systems-are-built-on-english-but-not-the-kind-most-of-the-world-speaks-249710