Dzhambazov, Georgi BogomilovMiron, MariusSerra, Xavier2018-04-092018-04-092016Dzhambazov G, Miron M, Serra X. Lyrics to audio alignment in polyphonic audio. Paper presented at: ISMIR 2016. 17th International Society for Music Information Retrieval Conference; 2016 Aug 7-11; New York City (NY).http://hdl.handle.net/10230/34324Comunicació presentada a la 17th International Society for Music Information Retrieval Conference (ISMIR 2016), celebrada els dies 7 a 11 d'agost de 2018 a Nova York, EUA.In this paper we describe the two algorithms we submitted for the MIREX 2017 task of Automatic Lyrics-to-Audio Alignment. The task has as a goal the automatic detection of word boundaries in multi-instrumental English pop music. We rely on a phonetic recognizer based on hidden Markov models (HMM): a widely-used method for tracking phonemes in speech processing problems. Tracking lyrics in music audio is harder than tracking text in speech because, unlike speech, the singing voice is mixed with multiple instruments. To address this obstacle we propose the application of two separate methods for segregating the singing voice from the multi-instrumental mix. One of them is based on the detection of vocal harmonic partials, whereas the other extracts the vocal content by means of source separation.application/pdfeng© Georgi Dzhambazov, Marius Miron, Xavier Serra. Licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0). Attribution: Georgi Dzhambazov, Marius Miron, Xavier Serra. “Lyrics to Audio Alignment in polyphonic audio”, 17th International Society for Music Information Retrieval Conference, 2016.So -- InformàticaLyrics to audio alignment in polyphonic audioinfo:eu-repo/semantics/conferenceObjectinfo:eu-repo/semantics/openAccess