Paragraph prosodic patterns to enhance text-to-speech naturalness

Mostra el registre complet Registre parcial de l'ítem

  • dc.contributor.author Peiró Lilja, Àlexca
  • dc.contributor.author Farrús, Mireiaca
  • dc.date.accessioned 2018-06-14T16:53:04Z
  • dc.date.available 2018-06-14T16:53:04Z
  • dc.date.issued 2018
  • dc.description Comunicació presentada a: the 9th International Conference on Speech Prosody 2018, celebrat del 13 al 16 de juny a Poznań, Polònia.ca
  • dc.description.abstract Speech synthesis has reached a reasonable high quality in recent years. However, there is still room for improvement in terms of naturalness and expressiveness when dealing with large multisentential discourse, since most text-to-speech synthesizers do not fully take into account the prosodic differences that have been observed in discourse units such as paragraphs. This work presents an implementation of paragraph-based prosodic patterns into the open-source MARYTTS platform, enriching its prosody output by means of intra- and inter-paragraph prosodic features. The set of characteristics include pitch decay, pitch range and speech rate variation (as intra-paragraph features), as well as paragraph break pauses and speech rate variation (as inter-paragraph features), previously analyzed in a large set of TED Talks and read-speech sections of the Spoken Wikipedia Corpus. The perception tests, performed both in English and German parametric voices, suggest that paragraph-based features should be further studied and taken into account on future implementations to synthesize large discourse speech.en
  • dc.description.sponsorship The authors would like to thank all listeners that took part in the experiments. This work is part of the KRISTINA project, which has received funding from the European Union’s Horizon 2020 Research and Innovation Programme under the Grant Agreement number H2020-RIA-645012. The second author is partially funded by the Spanish Ministry of Economy, Industry and Competitiveness through the Ram´on y Cajal program.en
  • dc.format.mimetype application/pdf
  • dc.identifier.citation Peiró-Lilja A, Farrús M. Paragraph prosodic patterns to enhance text-to-speech naturalness. In: Klessa K, Bachan J, Wagner A, Karpiński M, Śledziński D. Proceedings of the 9th International Conference on Speech Prosody; 2018 June 13-16; Poznań, Poland. [Lous Tourils]: ISCA; 2018. p. 512-6. DOI: 10.21437/SpeechProsody.2018-124
  • dc.identifier.doi http://dx.doi.org/10.21437/SpeechProsody.2018-124
  • dc.identifier.issn 2333-2042
  • dc.identifier.uri http://hdl.handle.net/10230/34906
  • dc.language.iso eng
  • dc.publisher International Speech Communication Association (ISCA)ca
  • dc.relation.ispartof Klessa K, Bachan J, Wagner A, Karpiński M, Śledziński D. Proceedings of the 9th International Conference on Speech Prosody; 2018 June 13-16; Poznań, Poland. [Lous Tourils]: ISCA; 2018. p. 512-6.
  • dc.relation.projectID info:eu-repo/grantAgreement/EC/H2020/645012
  • dc.rights © 2018 ISCA.
  • dc.rights.accessRights info:eu-repo/semantics/openAccess
  • dc.subject.keyword Parametric speech synthesisen
  • dc.subject.keyword Text-to-speechen
  • dc.subject.keyword Prosodyen
  • dc.subject.keyword Paragraph prosody patternsen
  • dc.subject.keyword MARYTTS
  • dc.title Paragraph prosodic patterns to enhance text-to-speech naturalnessca
  • dc.type info:eu-repo/semantics/conferenceObject
  • dc.type.version info:eu-repo/semantics/publishedVersion