eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

Spectrogram Inpainting for Interactive Generation of Instrument Sounds

Resource Type: Working Paper
Authors: Bazin, Théis; Hadjeres, Gaëtan; Esling, Philippe; Malt, Mikhail
Source: Proceedings of the 1st Joint Conference on AI Music Creativity, 2020 (p. 10). Stockholm, Sweden: AIMC
Subject: Computer Science - Sound
Computer Science - Artificial Intelligence
Computer Science - Human-Computer Interaction
Electrical Engineering and Systems Science - Audio and Speech Processing
Language

Online Access

초록

Modern approaches to sound synthesis using deep neural networks are hard to control, especially when fine-grained conditioning information is not available, hindering their adoption by musicians. In this paper, we cast the generation of individual instrumental notes as an inpainting-based task, introducing novel and unique ways to iteratively shape sounds. To this end, we propose a two-step approach: first, we adapt the VQ-VAE-2 image generation architecture to spectrograms in order to convert real-valued spectrograms into compact discrete codemaps, we then implement token-masked Transformers for the inpainting-based generation of these codemaps. We apply the proposed architecture on the NSynth dataset on masked resampling tasks. Most crucially, we open-source an interactive web interface to transform sounds by inpainting, for artists and practitioners alike, opening up to new, creative uses.
Comment: 8 pages + references + appendices. 4 figures. Published as a conference paper at the The 2020 Joint Conference on AI Music Creativity, October 19-23, 2020, organized and hosted virtually by the Royal Institute of Technology (KTH), Stockholm, Sweden

공지

DAU Library

eArticles

요약정보

Spectrogram Inpainting for Interactive Generation of Instrument Sounds

Online Access

초록