Lyrics to audio alignment for karaoke in pop music

Dzhambazov, Georgi BogomilovMiron, MariusSerra, Xavier2018-04-092018-04-092016Dzhambazov G, Miron M, Serra X. Lyrics to audio alignment for karaoke in pop music. Paper presented at: ISMIR 2016. 17th International Society for Music Information Retrieval Conference; 2016 Aug 7-11; New York City (NY).http://hdl.handle.net/10230/34323Comunicació preseentada a la 17th International Society for Music Information Retrieval Conference (ISMIR 2016), celebrada els dies 7 a 11 d'agost de 2016 a Nova York, EUA.In this paper we describe an algorithm for automatic lyricsto-audio alignment. It has as a goal the automatic detection of word boundaries in multi-instrumental English pop songs. We rely on a phonetic recognizer based on hidden Markov models: a widely-used method for tracking phonemes in speech processing problems. Tracking lyrics in music audio is harder than tracking text in speech because, unlike speech, the singing voice is mixed with multiple instruments. To address this obstacle we apply a convolution neural networks-based method for singing voice separation. We present a prototype of a practical application based on the alignment method - the highliting of lyrics in a karaoke-like fashion.application/pdfeng© Georgi Dzhambazov, Marius Miron, Xavier Serra. Licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0). Attribution: Georgi Dzhambazov, Marius Miron, Xavier Serra. "Lyrics to audio alignment for karaoke in pop music", 17th International Society for Music Information Retrieval Conference, 2016.So -- InformàticaLyrics to audio alignment for karaoke in pop musicinfo:eu-repo/semantics/conferenceObjectinfo:eu-repo/semantics/openAccess