Controversy Surrounding OpenAI and Anthropic's Disregard for Rules Against Online Content Scraping
Money | June 21, 2024, 6:14 p.m.
Generative AI tools rely on vast amounts of web content for model training, with top startups OpenAI and Anthropic found to be bypassing established rules like robots.txt to scrape websites for data. Despite publicly stating they respect such guidelines, these companies are either ignoring or circumventing them, Business Insider reports. TollBit, a startup facilitating paid licensing deals between publishers and AI firms, brought attention to this issue in a letter to major publishers. The letter, reported by Reuters, did not name specific AI companies. This behavior raises concerns about data ethics and copyright infringement, highlighting the need for stricter enforcement of web scraping regulations. To access the full article and stay informed on market, tech, and business news, subscribe to Business Insider for exclusive content.