Title:
|
A multilingual corpus for rich audio-visual scene description in a meeting-room environment
|
Author:
|
Butko, Taras; Nadeu Camprubí, Climent; Moreno Bilbao, M. Asunción
|
Other authors:
|
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions; Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla |
Abstract:
|
In this paper, we present a multilingual database specifically designed to develop technologies for rich audio-visual scene
description in meeting-room environments. Part of that database includes the already existing CHIL audio-visual recordings, whose annotations have been extended. A relevant objective in the new recorded sessions was to include situations in which the semantic content can not be extracted from a single modality. The presented database, that includes five hours of rather spontaneously generated scientific presentations, was manually annotated using standard or previously reported annotation schemes, and will be publicly available for the research purposes. |
Abstract:
|
Peer Reviewed |
Subject(s):
|
-Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic -Multimedia databases -Audio-visual materials -- Equipment and supplies -Conference rooms -- Equipment and supplies -Bases de dades multimedia |
Rights:
|
|
Document type:
|
Article - Submitted version Conference Object |
Published by:
|
ACM Press. Association for Computing Machinery
|
Share:
|
|