With AI initiatives developing at a rapid pace, copyright holders are on high alert. In addition to legislation, several currently ongoing lawsuits will help to define what’s allowed and what isn’t. Responding to a lawsuit from several authors, Meta now admits that it used portions of the Books3 dataset to train its Llama models. This dataset includes many pirated books.
Source: Meta Admits Use of ‘Pirated’ Book Dataset to Train AI * TorrentFreak