The Unbelievable Scale of AI’s Pirated-Books Problem

The Unbelievable Scale of AI’s Pirated-Books Problem

Editor’s note: This analysis is part of The Atlantic’s investigation into the Library Genesis data set. You can access the search tool directly here. Find The Atlantic’s search tool for movie and television writing used to train AI here.


When employees at Meta started developing their flagship AI model, Llama 3, they faced a simple ethical question. The program would need to be trained on a huge amount of high-quality writing to be competitive with products such as ChatGPT, and…

Article Source
https://www.theatlantic.com/technology/archive/2025/03/libgen-meta-openai/682093/?utm_source\u003dfeed

More From Author

Intel Stock Rallies on Leadership Change—Time to Buy or Wait?

Intel Stock Rallies on Leadership Change—Time to Buy or Wait?

Nvidia CEO Jensen Huang on the company buying a stake in Intel: No one invited … – The Times of India

Nvidia CEO Jensen Huang on the company buying a stake in Intel: No one invited … – The Times of India

Listen to the Podcast Overview

Watch the Keynote