Two professional organisations representing 800 newspapers and magazines employing over half of journalists in France announced Monday that they are taking “coordinated action” against public datasets used to train generative artificial intelligence services, such as ChatGPT. The Apig, the general news medial alliance, and the Sepm, the magazine publisher’s union, aim to remove their members’ content from Common Crawl, C4 and Oscar.
Source: French press take on digital databases to defend journalist copyright against AI








The prospect of relying heavily on synthetic data hasn’t gone unnoticed by the creative industries. “I believe the main reason companies like OpenAI are having to rely more on synthetic data now is that they’ve run out of high-quality human created data to mine from the public facing internet,” says Reid Southern, a film concept artist and illustrator, adding, “It further distances them from any copyrighted materials they’ve trained on that could land them in hot water.”