Prosodic phrase alignment for machine dubbing

Mostra el registre complet Registre parcial de l'ítem

  • dc.contributor.author Öktem, Alp
  • dc.contributor.author Farrús, Mireia
  • dc.contributor.author Bonafonte Cávez, Antonio
  • dc.date.accessioned 2019-10-04T09:39:14Z
  • dc.date.available 2019-10-04T09:39:14Z
  • dc.date.issued 2019
  • dc.description Comunicació presentada a: Interspeech 2019, celebrat del 15 al 19 setembre de 2019 a Graz, Àustria.
  • dc.description.abstract Dubbing is a type of audiovisual translation where dialogues are translated and enacted so that they give the impression that the media is in the target language. It requires a careful alignment of dubbed recordings with the lip movements of performers in order to achieve visual coherence. In this paper, we deal with the specific problem of prosodic phrase synchronization within the framework of machine dubbing. Our methodology exploits the attention mechanism output in neural machine translation to find plausible phrasing for the translated dialogue lines and then uses them to condition their synthesis. Our initial work in this field records comparable speech rate ratio to professional dubbing translation, and improvement in terms of lip-syncing of long dialogue lines.
  • dc.format.mimetype application/pdf
  • dc.identifier.citation Öktem A, Farrús M, Bonafonte A. Prosodic phrase alignment for machine dubbing. In: Interspeech 2019; 2019 Sep 15-19; Graz, Austria. [Baixas]: ISCA; 2019. p. 4215-19. DOI: 10.21437/Interspeech.2019-1621
  • dc.identifier.doi http://dx.doi.org/10.21437/Interspeech.2019-1621
  • dc.identifier.issn 1990-9772
  • dc.identifier.uri http://hdl.handle.net/10230/42380
  • dc.language.iso eng
  • dc.publisher International Speech Communication Association (ISCA)
  • dc.relation.ispartof Interspeech 2019; 2019 Sep 15-19; Graz, Austria. [Baixas]: ISCA; 2019.
  • dc.rights © 2019 ISCA
  • dc.rights.accessRights info:eu-repo/semantics/openAccess
  • dc.subject.keyword Audiovisual translation
  • dc.subject.keyword Dubbing
  • dc.subject.keyword Spoken machine translation
  • dc.subject.keyword Prosody
  • dc.title Prosodic phrase alignment for machine dubbing
  • dc.type info:eu-repo/semantics/conferenceObject
  • dc.type.version info:eu-repo/semantics/publishedVersion