Posted by

AI scraping for LLM training data has significantly strained Wikipedia's infrastructure

From January 2024 to April 2025, the site's bandwidth increased by 50% as automated bots downloaded terabytes of data for the large language models powering AI tools. The Wikimedia Foundation found that bots accounted for 65% of the highest demand requests (e.g., videos) despite representing just 35% of page views.

Similar Posts

Showing 1440 posts similar to AI scraping for LLM training data has significantly strained Wikipedia's infrastructure

You've reached the end.