MAGNeT is a text-to-music and text-to-sound model capable of generating high-quality audio samples conditioned on text descriptions. It is a masked generative non-autoregressive Transformer trained over a 32kHz EnCodec tokenizer with 4 codebooks sampled at 50 Hz. Unlike prior work, MAGNeT doesn't require neither semantic token conditioning nor model cascading, and it generates all 4 codebooks using a single non-autoregressive Transformer.
https://huggingface.co/facebook/magnet-medium-30secs
https://github.com/facebookresearch/audiocraft
'AI > Music' 카테고리의 다른 글
Udio 음악을 생성해보자 (0) | 2024.04.17 |
---|---|
suno.ai 노래 만들어 보기 (0) | 2024.02.27 |
deepmind_DreamTrack_Music AI Tools (0) | 2023.11.20 |
Stable Audio (0) | 2023.09.17 |
AudioCraft: Generative AI for audio made simple and available to all (0) | 2023.08.10 |
댓글