Title:
|
Clustering initialization based on spatial information for speaker diarization of meetings
|
Author:
|
Luque Serrano, Jordi; Segura, C.; Hernando Pericás, Francisco Javier
|
Other authors:
|
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions; Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla |
Abstract:
|
This paper proposes an initialization for an agglomerative
system applied to speaker diarization in the meeting environment.
The initialization is based on a previous clustering of the
temporal sequence generated by the estimation of the Time Delay
of Arrival (TDOA) among pair of sensors. That initial clustering
has the purpose of obtaining initial classes with speaker
information from a sole speaker. The aim is to ensure the purity
of the initial segments based on the position of the speakers in
a meeting along time. The TDOA initialization was tested with
the dataset used in the RT07s evaluation where an improvement
of the diariazation error rate is obtained with respect to the classical
uniform initialization. The most of the experiments show
that the purity of the beginning segments leads to a better clustering
on the posterior hierarchical strategy based on cepstral
features. |
Abstract:
|
Peer Reviewed |
Subject(s):
|
-Àrees temàtiques de la UPC::Enginyeria de la telecomunicació -Loudspeakers -Speaker diarization -Speaker segmentation -Speaker clustering -Cluster initialization -Altaveus |
Rights:
|
|
Document type:
|
Article - Published version Conference Object |
Share:
|
|