To access the full text documents, please follow this link: http://hdl.handle.net/10230/33299

Freesound datasets: a platform for the creation of open audio datasets
Fonseca, Eduardo; Pons Puig, Jordi; Favory, Xavier; Font Corbera, Frederic; Bogdanov, Dmitry; Ferraro, Andrés; Oramas, Sergio; Porter, Alastair; Serra, Xavier
Comunicació presentada al 18th International Society for Music Information Retrieval Conference celebrada a Suzhou, Xina, del 23 al 27 d'cotubre de 2017.
Openly available datasets are a key factor in the advancement of data-driven research approaches, including many of the ones used in sound and music computing. In the last few years, quite a number of new audio datasets have been made available but there are still major shortcomings in many of them to have a significant research impact. Among the common shortcomings are the lack of transparency in their creation and the difficulty of making them completely open and sharable. They often do not include clear mechanisms to amend errors and many times they are not large enough for current machine learning needs. This paper introduces Freesound Datasets, an online platform for the collaborative creation of open audio datasets based on principles of transparency, openness, dynamic character, and sustainability. As a proof-of-concept, we present an early snapshot of a large-scale audio dataset built using this platform. It consists of audio samples from Freesound organised in a hierarchy based on the AudioSet Ontology. We believe that building and maintaining datasets following the outlined principles and using open tools and collaborative approaches like the ones presented here will have a significant impact in our research community.
This work has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 688382 “AudioCommons”, and from the Maria de Maeztu Units of Excellence Programme (MDM-2015-0502).
-Freesound -- Bases de dades
© Eduardo Fonseca, Jordi Pons, Xavier Favory, Frederic Font, Dmitry Bogdanov, Andres Ferraro, Sergio Oramas, Alastair Porter, Xavier Serra. Licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0). Attribution: Eduardo Fonseca, Jordi Pons, Xavier Favory, Frederic Font, Dmitry Bogdanov, Andres Ferraro, Sergio Oramas, Alastair Porter, Xavier Serra. “Freesound Datasets: A platform for the creation of open audio datasets”, 18th International Society for Music Information Retrieval Conference, Suzhou, China, 2017.
https://creativecommons.org/licenses/by/4.0/
Conference Object
Article - Published version
International Society for Music Information Retrieval (ISMIR)
         

Show full item record

Related documents

Other documents of the same author

Fonseca, Eduardo; Plakal, Manoj; Font, Frederic; Ellis, Daniel P. W.; Favory, Xavier; Pons Puig, Jordi; Serra, Xavier
Bogdanov, Dmitry; Porter, Alastair; Schreiber, Hendrik; Urbano, Julián; Oramas, Sergio
Favory, Xavier; Fonseca, Eduardo; Font, Frederic; Serra, Xavier
Fonseca, Eduardo; Plakal, Manoj; Ellis, Daniel P. W.; Font, Frederic; Favory, Xavier; Serra, Xavier
 

Coordination

 

Supporters