dc.contributor.author
Öktem, Alp
dc.contributor.author
Farrús, Mireia
dc.contributor.author
Bonafonte Cávez, Antonio
dc.date.issued
2019-10-04T09:39:14Z
dc.date.issued
2019-10-04T09:39:14Z
dc.identifier
Öktem A, Farrús M, Bonafonte A. Prosodic phrase alignment for machine dubbing. In: Interspeech 2019; 2019 Sep 15-19; Graz, Austria. [Baixas]: ISCA; 2019. p. 4215-19. DOI: 10.21437/Interspeech.2019-1621
dc.identifier
http://hdl.handle.net/10230/42380
dc.identifier
http://dx.doi.org/10.21437/Interspeech.2019-1621
dc.description.abstract
Comunicació presentada a: Interspeech 2019, celebrat del 15 al 19 setembre de 2019 a Graz, Àustria.
dc.description.abstract
Dubbing is a type of audiovisual translation where dialogues are
translated and enacted so that they give the impression that the
media is in the target language. It requires a careful alignment
of dubbed recordings with the lip movements of performers in
order to achieve visual coherence. In this paper, we deal with
the specific problem of prosodic phrase synchronization within
the framework of machine dubbing. Our methodology exploits
the attention mechanism output in neural machine translation
to find plausible phrasing for the translated dialogue lines and
then uses them to condition their synthesis. Our initial work in
this field records comparable speech rate ratio to professional
dubbing translation, and improvement in terms of lip-syncing
of long dialogue lines.
dc.format
application/pdf
dc.format
application/pdf
dc.publisher
International Speech Communication Association (ISCA)
dc.relation
Interspeech 2019; 2019 Sep 15-19; Graz, Austria. [Baixas]: ISCA; 2019.
dc.rights
info:eu-repo/semantics/openAccess
dc.subject
Audiovisual translation
dc.subject
Spoken machine translation
dc.title
Prosodic phrase alignment for machine dubbing
dc.type
info:eu-repo/semantics/conferenceObject
dc.type
info:eu-repo/semantics/publishedVersion