End-to-end sound source separation conditioned on instrument labels

Citació

Slizovskaia O, Kim L, Haro G, Gomez E. End-to-end sound source separation conditioned on instrument labels. In: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); 2019 May 12-17; Brighton, United Kingdom. New Jersey: Institute of Electrical and Electronics Engineers; 2019. p. 306-10. DOI: 10.1109/ICASSP.2019.8683800

Enllaç permanent

Descripció

Resum
Can we perform an end-to-end music source separation with a variable number of sources using a deep learning model? This paper presents an extension of the Wave-UNet [1] model which allows end-to-end monaural source separation with a non-fixed number of sources. Furthermore, we propose multiplicative conditioning with instrument labels at the bottleneck of the Wave-U-Net and show its effect on the separation results. This approach can be further extended to other types of conditioning such as audio-visual source separation and score-informed source separation.
Descripció
Comunicació presentada a: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing celebrat de 12 al 17 de maig de 2019 a Brighton, Regne Unit.
DOI
http://dx.doi.org/10.1109/ICASSP.2019.8683800
Col·leccions
Congressos (Departament de Tecnologies de la Informació i les Comunicacions)
Documents OpenAIRE (Open Access Infrastructure for Research in Europe)

Fitxers