Title:
|
End-to-end speech translation with the transformer
|
Author:
|
Cross Vila, Laura; Escolano Peinado, Carlos; Rodríguez Fonollosa, José Adrián; Ruiz Costa-Jussà, Marta
|
Other authors:
|
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions; Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla |
Abstract:
|
Speech Translation has been traditionally addressed with the concatenation of two tasks: Speech Recognition and Machine Translation. This approach has the main drawback that errors are concatenated. Recently, neural approaches to Speech Recognition and Machine Translation have made possible facing the task by means of an End-to-End Speech Translation architecture. In this paper, we propose to use the architecture of the Transformer which is based solely on attention-based mechanisms to address the End-to-End Speech Translation system. As a contrastive architecture, we use the same Transformer to built the Speech Recognition and Machine Translation systems to perform Speech Translation through concatenation of systems. Results on a Spanish-to-English standard task show that the end-to-end architecture is able to outperform the concatenated systems by half point BLEU. |
Abstract:
|
Peer Reviewed |
Subject(s):
|
-Àrees temàtiques de la UPC::Ensenyament i aprenentatge -Àrees temàtiques de la UPC::Enginyeria de la telecomunicació -Machine translating -Speech perception -Automatic speech recognition -End-to-End speech translation -Transformer -Traducció automàtica -Percepció del llenguatge -Reconeixement automàtic de la parla |
Rights:
|
|
Document type:
|
Article - Published version Conference Object |
Published by:
|
Antonio Bonafonte, Jordi Luque and Francesc Alías Pujol
|
Share:
|
|