End­-to-­end learning for music audio tagging at scale

Inicio | ¿Qué es? | Contacto

English | Català

Consultar RECERCAT

Por comunidades y
colecciones Por fecha Por autores Por títulos Por temas (CDU)

Consultar departamento

Por fecha Por autores Por títulos Por temas (CDU)

Estadisticas

Del documento Todo RECERCAT

Mi RECERCAT

Entrar Alertas por correo-e

Directorio de otros repositorios

RECERCAT Principal > Universitat Pompeu Fabra > Articles, congressos, llibres > Visualizar documento

Para acceder a los documentos con el texto completo, por favor, siga el siguiente enlace: http://hdl.handle.net/10230/37217

Título:	End-to-end learning for music audio tagging at scale
Autor/a:	Pons Puig, Jordi; Nieto, Oriol; Prockup, Matthew; Schmidt, Erik M.; Ehmann, Andreas F.; Serra, Xavier
Abstract:	Comunicació presentada a: Workshop Machine Learning for Audio Signal Processing at NIPS 2017 (ML4Audio@NIPS17) celebrat del 4 al 9 de desembre de 2017 a Long Beach, California.
Abstract:	The lack of data tends to limit the outcomes of deep learning research – specially, when dealing with end-to-end learning stacks processing raw data such as waveforms. In this study we make use of musical labels annotated for 1.2 million tracks. This large amount of data allows us to unrestrictedly explore different front-end paradigms: from assumption-free models – using waveforms as input with very small convolutional filters; to models that rely on domain knowledge – log-mel spectrograms with a convolutional neural network designed to learn temporal and timbral features. Results suggest that while spectrogram-based models surpass their waveform-based counterparts, the difference in performance shrinks as more data are employed.
Abstract:	This work is partially supported by the Maria de Maeztu Programme (MDM-2015-0502).
Derechos:	© Sound & Music Computing
Tipo de documento:	Objeto de conferencia Artículo - Versión publicada
Compartir:

Mostrar el registro completo del ítem

Documentos relacionados

Otros documentos del mismo autor/a

End-to-end learning for music audio tagging at scale

Pons Puig, Jordi; Nieto Caballero, Oriol; Prockup, Matthew; Schmidt, Erik M.; Ehmann, Andreas F.; Serra, Xavier

Multimodal deep learning for music genre classification

Oramas, Sergio; Barbieri, Francesco; Nieto, Oriol; Serra, Xavier

Randomly weighted CNNs for (music) audio classification

Pons Puig, Jordi; Serra, Xavier

Experimenting with musically motivated convolutional neural networks

Pons Puig, Jordi; Lidy, Thomas; Serra, Xavier

Freesound datasets: a platform for the creation of open audio datasets

Fonseca, Eduardo; Pons Puig, Jordi; Favory, Xavier; Font Corbera, Frederic; Bogdanov, Dmitry; Ferraro, Andrés; Oramas, Sergio; Porter, Alastair; Serra, Xavier

Accesibilidad | Aviso legal | Política de Cookies | Documentos de uso interno

Coordinación

Patrocinio