Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
1995
This paper studies different sets of subword speech units to be used for recognizing Spanish. In particular it compares context dependent phones, syllables and demisyllables. It shows how context dependent units can effectively reduce the error in a 15% with respect to context independent phones. The benefit of merging similar contexts when there are not enough training data is also validated. On the other hand the paper study the behavior of syllables based units: first, the study reveals that syllables give a similar performance than triphones whereas demisyllables give a similar performance than right (or left) context dependent phones. However, when different types of units are used, context dependent phones give the best results. Results achieved with these sets of units exceed 70% in acoustic-phonetic decoding of Spanish speech.
Peer Reviewed
Postprint (published version)
Conference report
English
Àrees temàtiques de la UPC::Enginyeria de la telecomunicació; Telecommunication; Telecomunicació
ESCA - J.M. PARDO, E. ENRIQUEZ, J. ORTEGA, J. FERREIROS GTM-UPM
http://www.isca-speech.org/archive/eurospeech_1995/e95_1607.html
http://creativecommons.org/licenses/by-nc-nd/3.0/es/
Open Access
E-prints [72986]