Thematicity-based prosody enrichment for text-to-speech applications

Domínguez Bajo, Mónica; Burga Díaz, Alicia; Farrús, Mireia; Wanner, Leo

Thematicity-based prosody enrichment for text-to-speech applications

Citation

Domínguez M, Burga A, Farrús M, Wanner L. Thematicity-based prosody enrichment for text-to-speech applications. In: Klessa K, Bachan J, Wagner A, Karpiński M, Śledziński D. Proceedings of the 9th International Conference on Speech Prosody; 2018 June 13-16; Poznań, Poland. [Lous Tourils]: ISCA; 2018. p. 612-6. DOI: 10.21437/SpeechProsody.2018-119

Permanent Link

http://hdl.handle.net/10230/34905

Description

Abstract
Theoretical studies on the information structure–prosody interface argue that the content packaged in terms of theme and rheme correlates with the intonation of the corresponding sentence as regards to rising and falling patterns (L*+H LH% and H* LL% respectively). When such a correspondence is used to derive prosody in text-to-speech applications, it is often the case that ToBI labels are statically mapped to acoustic parameters. Such an approach is insufficient to solve the problem of monotonous synthetic voices for two reasons: it is repetitive with respect to prosody enrichment, and a binary flat themerheme representation does not serve to describe properly long complex sentences. In this paper, we introduce a methodology for a more versatile thematicity-based prosody enrichment based on: (i) a hierarchical tripartite thematicity model as proposed in the Meaning–Text Theory, and (ii) a corpus-based approach for the automatic extraction of acoustic parameters (fundamental frequency, breaks and speech rate) that are mapped to a varied range of prosody control tags of the synthesized speech. Such a prosody enrichment has shown to provide higher results in a perception test when implemented in a TTS system.
Description
Comunicació presentada a: the 9th International Conference on Speech Prosody 2018, celebrat del 13 al 16 de juny a Poznań, Polònia.
DOI
http://dx.doi.org/10.21437/SpeechProsody.2018-119
Collections
Congressos (Departament de Tecnologies de la Informació i les Comunicacions)
Documents OpenAIRE (Open Access Infrastructure for Research in Europe)

Full item page

Thematicity-based prosody enrichment for text-to-speech applications

Thematicity-based prosody enrichment for text-to-speech applications

Files

Date

Authors

Abstract

Description

DOI

Collections