OpenAI has been accused by many parties of training its AI on copyrighted content sans permission. Now a new paper by an AI watchdog organization makes the serious accusation that the company increasingly relied on non-public books it didn’t license to train more sophisticated AI models. The new paper, out of the AI Disclosures Project, draws the conclusion that OpenAI likely trained its GPT-4o model on paywalled books from O’Reilly Media.
Source: Researchers suggest OpenAI trained AI models on paywalled O’Reilly books | TechCrunch