The foundation of any generative AI model is the underlying data. Developing generative AI typically requires exceptionally large datasets, especially in the pre-training step. The data used in this step forms the foundation of the model in the chosen domain, such as language or images. The volume and quality of data required to pre-train a generative AI model from scratch may impact the ability of new players to enter the market.