Percussive/harmonic sound separation by non-negative matrix factorization with smoothness/sparseness constraints
Percussive/harmonic sound separation by non-negative matrix factorization with smoothness/sparseness constraints
Citació
- Canadas-Quesada FJ, Vera-Candeas P, Ruiz-Reyes N, Carabias-Orti J, Cabanas-Molero P. Percussive/harmonic sound separation by non-negative matrix factorization with smoothness/sparseness constraints. EURASIP Journal on Audio, Speech, and Music Processing. 2014; 2014: 26. DOI 10.1186/s13636-014-0026-5
Enllaç permanent
Descripció
Resum
In this paper, unsupervised learning is used to separate percussive and harmonic sounds from monaural non-vocal polyphonic signals. Our algorithm is based on a modified non-negative matrix factorization (NMF) procedure that no labeled data is required to distinguish between percussive and harmonic bases because information from percussive and harmonic sounds is integrated into the decomposition process. NMF is performed in this process by assuming that harmonic sounds exhibit spectral sparseness (narrowband sounds) and temporal smoothness (steady sounds), whereas percussive sounds exhibit spectral smoothness (broadband sounds) and temporal sparseness (transient sounds). The evaluation is performed using several real-world excerpts from different musical genres. Comparing the developed approach to three current state-of-the art separation systems produces promising results.