Meta Admits Use of ‘Pirated’ Book Dataset to Train AI

January 12, 2024By rightstech

With AI initiatives developing at a rapid pace, copyright holders are on high alert. In addition to legislation, several currently ongoing lawsuits will help to define what’s allowed and what isn’t. Responding to a lawsuit from several authors, Meta now admits that it used portions of the Books3 dataset to train its Llama models. This dataset includes many pirated books.

Source: Meta Admits Use of ‘Pirated’ Book Dataset to Train AI * TorrentFreak

Share this: