Technology

Reddit blocks Wayback Machine over concerns about data exploitation by AI.

Quoc Duan • August 13, 2025 16:44

Reddit will restrict Wayback Machine's access to most content to prevent AI from exploiting data, only allowing it to save popular homepage and headlines.

Quick summary:

Reddit restricts the Wayback Machine, only allowing it to save popular homepages and headlines.

Reason: Concerns that AI companies might exploit data in violation of policies.

Previously, Reddit blocked APIs and required search engines to pay for data.

Reddit has confirmed that it discovered several AI companies collecting data from the Internet Archive's Wayback Machine, violating its platform policies. Therefore, the social network will restrict access, allowing Wayback Machine to only store the Reddit.com homepage and a list of popular headlines, instead of all posts, comments, or user profiles as before.

Reddit chặn Wayback Machine vì lo ngại dữ liệu bị AI khai thác

Spokesperson Tim Rathschmidt said Reddit required the Internet Archive to comply with its privacy policies and remove the removed content before restoring full access.

According to Reddit, the restrictions will be rolled out gradually starting today. The company contacted the Internet Archive in advance to inform them of this decision and had previously expressed concerns about content being collected from the Wayback Machine.

This isn't the first time Reddit has blocked data-scanning tools. In 2023, Reddit changed its API policy, causing many third-party applications to shut down after being unable to pay for data access – the reason being that these APIs were being used to train AI.

Last year, Reddit signed a contract to provide data to Google for search and AI training, and began blocking other major search engines if they didn't pay. The company also reached an agreement with OpenAI, but sued Anthropic in June 2024 for allegedly continuing to scan data despite having announced it had stopped.

Mark Graham, director of Wayback Machine, said the Internet Archive has a long-standing relationship with Reddit and is still in discussions about the matter.

Reddit blocks Wayback Machine over concerns about data exploitation by AI.

Reddit

Wayback Machine

See more about Technology

Read more

Reddit blocks Wayback Machine over concerns about data exploitation by AI.

Reddit

Wayback Machine

See more about Technology

Read more

Log in