2022-10-03T09:57:56Z
2022-10-03T09:57:56Z
2022-09-08
2022-10-03T09:57:57Z
Crowdsourcing is a popular cheap alternative in machine learning for gathering information from a set of annotators. Learning from crowd-labelled data involves dealing with its inherent uncertainty and inconsistencies. In the classical framework, each annotator provides a single label per example, which fails to capture the complete knowledge of annotators. We propose candidate labelling, that is, to allow annotators to provide a set of candidate labels for each example and thus express their doubts. We propose an appropriate model for the annotators, and present two novel learning methods that deal with the two basic steps (label aggregation and model learning) sequentially or jointly. Our empirical study shows the advantage of candidate labelling and the proposed methods with respect to the classical framework.
Published version
Article
English
Aprenentatge automàtic; Cultura participativa; Dades massives; Machine learning; Participatory culture; Big data
Institute of Electrical and Electronics Engineers (IEEE)
Reproducció del document publicat a: https://doi.org/10.1109/MIS.2022.3205053
IEEE Intelligent Systems, 2022
https://doi.org/10.1109/MIS.2022.3205053
cc by-nc-nd (c) Beñaran-Muñoz, Iker et al., 2022
http://creativecommons.org/licenses/by-nc-nd/3.0/es/