TAPE: An End-to-End Timbre-Aware Pitch Estimator

Tamer, Nazif C; Özer, Yigitcan; Müller, Meinard; Serra, Xavier

TAPE: An End-to-End Timbre-Aware Pitch Estimator

Mostra el registre complet Registre parcial de l'ítem

dc.contributor.author Tamer, Nazif C
dc.contributor.author Özer, Yigitcan
dc.contributor.author Müller, Meinard
dc.contributor.author Serra, Xavier
dc.date.accessioned 2023-04-25T13:55:00Z
dc.date.available 2023-04-25T13:55:00Z
dc.date.issued 2023-04-25
dc.description This work has been accepted at the 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023), at Rhodes, Greece. June 4-10, 2023.
dc.description.abstract Pitch estimation of a target musical source within a multi-source polyphonic signal is of great interest for music performance analysis. One possible approach for extracting the pitch of a target source is to first perform source separation and then estimate the pitch of the separated track. However, as we will show, this typically leads to poor results. As an alternative to this approach, we introduce a timbre-aware pitch estimator (TAPE), which estimates the pitch of a target source in an end-to-end manner without the need for an explicit source separation step. Opposed to existing approaches that assume the predominance of a lead voice, our approach builds upon other cues that only rely on the timbral characteristics. Our results on real violin-piano duets show that, without any pre-processing step, TAPE outperforms the sequential procedure of source separation and pitch estimation under many settings, even if the target source is not predominant.ca
dc.description.sponsorship This research is funded by the project Musical AI - PID2019-111403GB-I00/AEI/10.13039/501100011033 funded by the Spanish Ministerio de Ciencia, Innovacion y Universidades (MCIU) and the Agencia Estatal de Investigacion (AEI), and by the German Research Foundation (DFG MU 2686/10-2).
dc.format.mimetype application/pdf*
dc.identifier.uri http://hdl.handle.net/10230/56565
dc.language.iso engca
dc.relation.projectID info:eu-repo/grantAgreement/ES/2PE/PID2019-111403GB-I00
dc.rights.accessRights info:eu-repo/semantics/openAccessca
dc.subject.keyword Pitch Estimation
dc.subject.keyword Audio Source Separation,
dc.subject.keyword Music Performance Analysis
dc.subject.keyword Weakly Supervised Learning
dc.title TAPE: An End-to-End Timbre-Aware Pitch Estimatorca
dc.type info:eu-repo/semantics/preprintca
dc.type.version info:eu-repo/semantics/submittedVersionca

Col·leccions

Informes (Departament de Tecnologies de la Informació i les Comunicacions)