Welcome to the UPF Digital Repository

Automatic transcription of flamenco singing from polyphonic music recordings

Show simple item record

dc.contributor.author Kroher, Nadine
dc.contributor.author Gómez Gutiérrez, Emilia, 1975-
dc.date.accessioned 2017-11-13T09:37:00Z
dc.date.available 2017-11-13T09:37:00Z
dc.date.issued 2016
dc.identifier.citation Kroher N, Gómez E. Automatic transcription of flamenco singing from polyphonic music recordings. IEEE/ACM Trans Audio, Speech, Language Process. 2016;24(5): 901-13. DOI: 10.1109/TASLP.2016.2531284
dc.identifier.issn 2329-9290
dc.identifier.uri http://hdl.handle.net/10230/33199
dc.description.abstract Automatic note-level transcription is considered one of the most challenging tasks in music information retrieval. The specific case of flamenco singing transcription poses a particular challenge due to its complex melodic progressions, intonation inaccuracies, the use of a high degree of ornamentation, and the presence of guitar accompaniment. In this study, we explore the limitations of existing state of the art transcription systems for the case of flamenco singing and propose a specific solution for this genre: We first extract the predominant melody and apply a novel contour filtering process to eliminate segments of the pitch contour which originate from the guitar accompaniment. We formulate a set of onset detection functions based on volume and pitch characteristics to segment the resulting vocal pitch contour into discrete note events. A quantised pitch label is assigned to each note event by combining global pitch class probabilities with local pitch contour statistics. The proposed system outperforms state of the art singing transcription systems with respect to voicing accuracy, onset detection, and overall performance when evaluated on flamenco singing datasets.
dc.description.sponsorship This work was supported in part by the Ph.D. Fellowship of the Department of Information and Communication Technologies, Universitat Pompeu Fabra and in part by the projects SIGMUS (TIN2012-36650) and COFLA II (P12-TIC-1362).
dc.format.mimetype application/pdf
dc.language.iso eng
dc.publisher Institute of Electrical and Electronics Engineers (IEEE)
dc.relation.ispartof IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2016;24(5): 901-13.
dc.rights © 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The final published article can be found at http://ieeexplore.ieee.org/document/7410016/
dc.title Automatic transcription of flamenco singing from polyphonic music recordings
dc.type info:eu-repo/semantics/article
dc.identifier.doi http://dx.doi.org/10.1109/TASLP.2016.2531284
dc.subject.keyword Automatic music transcription
dc.subject.keyword Music information retrieval
dc.subject.keyword Singing voice
dc.subject.keyword Pitch contour
dc.subject.keyword Audio content description
dc.relation.projectID info:eu-repo/grantAgreement/ES/3PN/TIN2012-36650
dc.rights.accessRights info:eu-repo/semantics/openAccess
dc.type.version info:eu-repo/semantics/acceptedVersion

This item appears in the following Collection(s)

Show simple item record

Search DSpace

Advanced Search


My Account


Compliant to Partaking