A comparison of pitch chroma extraction algorithms
Mostra el registre complet Registre parcial de l'ítem
- dc.contributor.author Pérez Fernández, Miguel
- dc.contributor.author Kirchhoff, Holger
- dc.contributor.author Serra, Xavier
- dc.date.accessioned 2023-02-07T07:13:29Z
- dc.date.available 2023-02-07T07:13:29Z
- dc.date.issued 2022
- dc.description Comunicació presentada a: 19th Sound and Music Computing Conference, celebrat del 5 a 12 de juny de 2022 a Sant-Étienne, França.
- dc.description.abstract The pitch chroma is a popular way to represent pitch information in an octave independent way, with applications in automatic chord recognition, cover song identification, audio-to-score alignment, and others. Early chroma extraction algorithms employed expert knowledge to derive pitch chromas from short-time spectra. With the rise of deep learning, the emphasis moved from algorithm design to the structure of the network and the selection of appropriate training data. The approaches perform differently for various types of audio input. We conducted a set of experiments in order to explore the qualitative properties that each algorithm exhibits. These include how the number of concurrent pitches influences the chroma representation, and how noise or unpitched percussion can degrade the performance of the algorithms. We performed a quantitative analysis of various algorithms under these scenarios. The results show that chromas based on deep learning show huge potential, especially when it comes to noise reduction and ignoring non-tonal aspects of the music. However, we also found that some deep learning based chromas fail to accurately detect pitches at lower polyphony levels. We reflect on these results and discuss some paths to improvements for future chroma extraction algorithms.
- dc.format.mimetype application/pdf
- dc.identifier.citation Pérez M, Kirchhoff H, Serra X. A comparison of pitch chroma extraction algorithms. In: Michon R, Pottier L, Orlarey Y, editors. Proceedings of the 19th Sound and Music Computing Conference (SMC/JIM/IFC 2022); 2022 Jun 05-12; Saint-Étienne, France. [place unknown]: SMC Network; 2022. p. 222-9. DOI: 10.5281/zenodo.6573083
- dc.identifier.doi http://dx.doi.org/10.5281/zenodo.6573083
- dc.identifier.uri http://hdl.handle.net/10230/55653
- dc.language.iso eng
- dc.publisher SMC Network
- dc.relation.ispartof Michon R, Pottier L, Orlarey Y, editors. Proceedings of the 19th Sound and Music Computing Conference (SMC/JIM/IFC 2022); 2022 Jun 05-12; Saint-Étienne, France. [place unknown]: SMC Network; 2022. p. 222-9.
- dc.rights © 2022 Miguel Perez et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 Unported License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- dc.rights.accessRights info:eu-repo/semantics/openAccess
- dc.rights.uri https://creativecommons.org/licenses/by/3.0/
- dc.subject.other Algorismes
- dc.title A comparison of pitch chroma extraction algorithms
- dc.type info:eu-repo/semantics/conferenceObject
- dc.type.version info:eu-repo/semantics/publishedVersion