Meta Admits Use of ‘Pirated’ Book Dataset to Train AI 

With AI initiatives developing at a rapid pace, copyright holders are on high alert. In addition to legislation, several currently ongoing lawsuits will help to define what’s allowed and what isn’t. Responding to a lawsuit from several authors, Meta now admits that it used portions of the Books3 dataset to train its Llama models. This dataset includes many pirated books.

Source: Meta Admits Use of ‘Pirated’ Book Dataset to Train AI * TorrentFreak

Get the latest RightsTech news and analysis delivered directly in your inbox every week
We respect your privacy.