Title:
|
Towards large scale multimedia indexing: a case study on person discovery in broadcast news
|
Author:
|
Le, Nam; Bredin, Herve; Sergent, Gabriel; India Massana, Miquel Àngel; López-Otero, Paula; Barras, Claude; Guinaudeau, Camille; Gravier, Guillaume; Barbosa da Fonseca, Gabriel; Lyon Freire, Izabela; Patrocinio Jr., Zenilton; Jamil F. Guimarães, Silvio; Martí Juan, Gerard; Morros Rubió, Josep Ramon; Hernando Pericás, Francisco Javier; Docio-Fernández, Laura; García-Mateo, Carmen; Meignier, Sylvain; Odobez, Jean-Marc
|
Other authors:
|
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions; Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo; Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla |
Abstract:
|
The rapid growth of multimedia databases and the human interest
in their peers make indices representing the location and identity
of people in audio-visual documents essential for searching
archives. Person discovery in the absence of prior identity knowledge
requires accurate association of audio-visual cues and detected
names. To this end, we present 3 different strategies to approach
this problem: clustering-based naming, verification-based naming,
and graph-based naming. Each of these strategies utilizes different
recent advances in unsupervised face / speech representation, verification,
and optimization. To have a better understanding of the
approaches, this paper also provides a quantitative and qualitative
comparative study of these approaches using the associated corpus
of the Person Discovery challenge at MediaEval 2016. From the
results of our experiments, we can observe the pros and cons of
each approach, thus paving the way for future promising research
directions. |
Abstract:
|
Peer Reviewed |
Subject(s):
|
-Àrees temàtiques de la UPC::Informàtica::Sistemes d'informació::Bases de dades -Àrees temàtiques de la UPC::So, imatge i multimèdia -Databases -Audio-visual materials -Cluster -Bases de dades -Audiovisuals -Sistemes productius locals |
Rights:
|
|
Document type:
|
Article - Published version Conference Object |
Published by:
|
Association for Computing Machinery (ACM)
|
Share:
|
|