The CAMOMILE collaborative annotation platform for multi-modal, multi-lingual and multi-media documents

Poignant, Johann; Budnik, Mateusz; Bredin, Herve; Barras, Claude; Adda, Gilles; Hernando Pericás, Francisco Javier; Mariani, Joseph; Morros Rubió, Josep Ramon; Poignant, Johann; Budnik, Mateusz; Bredin, Herve; Barras, Claude; Adda, Gilles; Hernando Pericás, Francisco Javier; Mariani, Joseph; Morros Rubió, Josep Ramon

The CAMOMILE collaborative annotation platform for multi-modal, multi-lingual and multi-media documents

Author

Hernando Pericás, Francisco Javier

Mariani, Joseph

Morros Rubió, Josep Ramon

Other authors

Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions

Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla

Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo

Publication date

2016

Abstract

In this paper, we describe the organization and the implementation of the CAMOMILE collaborative annotation framework for multimodal, multimedia, multilingual (3M) data. Given the versatile nature of the analysis which can be performed on 3M data, the structure of the server was kept intentionally simple in order to preserve its genericity, relying on standard Web technologies. Layers of annotations, defined as data associated to a media fragment from the corpus, are stored in a database and can be managed through standard interfaces with authentication. Interfaces tailored specifically to the needed task can then be developed in an agile way, relying on simple but reliable services for the management of the centralized annotations. We then present our implementation of an active learning scenario for person annotation in video, relying on the CAMOMILE server; during a dry run experiment, the manual annotation of 716 speech segments was thus propagated to 3504 labeled tracks. The code of the CAMOMILE framework is distributed in open source.

Peer Reviewed

Postprint (author's final draft)

Document Type

Conference lecture

Language

English

Subjects and keywords

Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic; Automatic speech recognition; Annotation tool; Collaborative annotation; Multimedia; Active learning; Person annotation.; Reconeixement automàtic de la parla

Publisher

European Language Resources Association

Related items

http://www.lrec-conf.org/proceedings/lrec2016/index.html

Recommended citation

This citation was generated automatically.

Export

DIDL MARC MARC_CCUC METS OAI_DC ORE QDC RDF

Rights

Open Access

Cretaive Commons License (by-nc-nd)

This item appears in the following Collection(s)

E-prints [73012]

The CAMOMILE collaborative annotation platform for multi-modal, multi-lingual and multi-media documents

Author

Other authors

Publication date

Share

Abstract

Document Type

Language

Subjects and keywords

Publisher

Related items

Recommended citation

Export

Rights

This item appears in the following Collection(s)