Title:
|
Lyrics to audio alignment for karaoke in pop music
|
Author:
|
Dzhambazov, Georgi Bogomilov; Miron, Marius; Serra, Xavier
|
Abstract:
|
Comunicació preseentada a la 17th International Society for Music Information Retrieval Conference (ISMIR 2016), celebrada els dies 7 a 11 d'agost de 2016 a Nova York, EUA. |
Abstract:
|
In this paper we describe an algorithm for automatic lyricsto-audio
alignment. It has as a goal the automatic detection
of word boundaries in multi-instrumental English pop
songs. We rely on a phonetic recognizer based on hidden
Markov models: a widely-used method for tracking
phonemes in speech processing problems. Tracking lyrics
in music audio is harder than tracking text in speech because,
unlike speech, the singing voice is mixed with multiple
instruments. To address this obstacle we apply a convolution
neural networks-based method for singing voice
separation. We present a prototype of a practical application
based on the alignment method - the highliting of
lyrics in a karaoke-like fashion. |
Abstract:
|
This work is supportedby the Spanish Ministry of Economy and Competitiveness, through the ”María de Maeztu” Programme for Centres/Units of Excellence in R&D” (MDM-2015-0502). |
Subject(s):
|
-So -- Informàtica |
Rights:
|
© Georgi Dzhambazov, Marius Miron, Xavier Serra. Licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0). Attribution: Georgi Dzhambazov, Marius Miron, Xavier Serra. "Lyrics to audio alignment for karaoke in pop music", 17th International Society for Music Information Retrieval Conference, 2016.
http://creativecommons.org/licenses/by/4.0/ |
Document type:
|
Conference Object Article - Published version |
Published by:
|
International Society for Music Information Retrieval (ISMIR)
|
Share:
|
|