Long Short-Term Memory (LSTM) networks, a type of recurrent neural network, are adept at analyzing sequential data, making them ideal for detecting temporal manipulations in audio. Even transformer models, the technology behind advanced language AI like ChatGPT, are being applied to audio analysis, processing long sequences of audio data to identify complex patterns and relationships that might escape other systems.
Source: AI vs. audio pirates: catching sophisticated copyright evasion with AI