OpenAI may soon be forced to explain why it deleted a pair of controversial datasets composed of pirated books, and the stakes could not be higher. At the heart of a class-action lawsuit from authors alleging that ChatGPT was illegally trained on their works, OpenAI’s decision to delete the datasets could end up being a deciding factor that gives the authors the win.
Source: OpenAI desperate to avoid explaining why it deleted pirated book datasets
OpenAI must produce millions of anonymized chat logs from ChatGPT users in its high-stakes copyright dispute with the New York Times and other news outlets, a federal judge in Manhattan ruled. U.S. Magistrate Judge Ona Wang in a decision made public on Wednesday that the 20 million logs were relevant to the outlets’ claims and that handing them over would not risk violating users’ privacy.
The newspaper alleged Perplexity AI had distributed and displayed journalists’ work without permission en masse. The Times said that Perplexity AI was also violating its trademarks under the Lanham Act, claiming the startup’s generative AI products create fabricated content, or “hallucinations”, and falsely attribute them to the newspaper by displaying them alongside its registered trademarks.
Can an ISP be held liable for piracy simply by “doing nothing”? Yesterday, the Supreme Court addressed this billion-dollar question. While record labels argued that Cox turned a blind eye to “habitual abusers,” the ISP warned that expanding liability without proof of active intent would turn internet providers into “Internet Police” and threaten essential access for hospitals, schools, or even entire towns.
The EU on Wednesday unveiled new proposals to simplify AI and privacy regulations, drawing fire from the tech sector for not going far enough and consumer groups for bowing to Big Tech. The EU Commission’s “Digital Omnibus”, which faces debate and votes from European countries, proposed to delay stricter rules on use of AI in “high-risk” areas until late 2027, ease rules around cookies and enable more use of data.

In a significant shift, policymakers in Brussels are moving to scale back and simplify landmark rules for artificial intelligence and data privacy. Driven by growing concern that overregulation is stifling economic growth, officials and business leaders across the 27-nation bloc are questioning whether Europe’s digital rulebook has gone too far and left companies lagging the United States and China.