资讯

The Internet Archive can now only crawl Reddit's homepage. Reddit's goal is to block AI firms from scraping Reddit user data. Publishers (and others) are suing AI companies for copyright infringement.
Reddit is now blocking the Internet Archive (IA) from indexing popular Reddit threads after allegedly catching sneaky AI firms—restricted from scraping Reddit—instead simply scraping data from ...
Internet giant Cloudflare says it detected Perplexity crawling and scraping websites, even after customers had added technical blocks telling Perplexity not to scrape their pages.
Cloudflare finds that Perplexity AI is 'repeatedly modifying' the company’s web-crawling bots to evade data-scraping measures on third-party websites.
AI startup Perplexity is accused of scraping content from websites that block such actions. Cloudflare reported deceptive methods used by Perplexity to bypass restrictions.