Machine learning from crowds using candidate set-based labelling

Publication date

2022-10-03T09:57:56Z

2022-10-03T09:57:56Z

2022-09-08

2022-10-03T09:57:57Z

Abstract

Crowdsourcing is a popular cheap alternative in machine learning for gathering information from a set of annotators. Learning from crowd-labelled data involves dealing with its inherent uncertainty and inconsistencies. In the classical framework, each annotator provides a single label per example, which fails to capture the complete knowledge of annotators. We propose candidate labelling, that is, to allow annotators to provide a set of candidate labels for each example and thus express their doubts. We propose an appropriate model for the annotators, and present two novel learning methods that deal with the two basic steps (label aggregation and model learning) sequentially or jointly. Our empirical study shows the advantage of candidate labelling and the proposed methods with respect to the classical framework.

Document Type

Published version


Article

Language

English

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Related items

Reproducció del document publicat a: https://doi.org/10.1109/MIS.2022.3205053

IEEE Intelligent Systems, 2022

https://doi.org/10.1109/MIS.2022.3205053

Recommended citation

This citation was generated automatically.

Rights

cc by-nc-nd (c) Beñaran-Muñoz, Iker et al., 2022

http://creativecommons.org/licenses/by-nc-nd/3.0/es/

This item appears in the following Collection(s)