Nvidia (NVDA) announced his new AI audio model, Fugattothis week, which can generate or transform “any mix of music, voices, and sounds described with prompts using any combination of text and audio files.”
Fugatto is short for Foundational Generative Audio Transformer Opus 1, Nvidia said.
With the new model, users can enter a text prompt and generate a sample of music, remove or add instruments to an already existing song, change the accents or emotions of a voice, and “produce never-before-heard sounds” .
“Fugatto is the first fundamental generative AI model that exhibits emergent properties – capabilities that arise from the interaction of its various trained capabilities – and the ability to combine free-form instructions,” Nvidia said.