Welcome to the UPF Digital Repository

Towards expressive prosody generation in TTS for reading aloud applications

Show simple item record

dc.contributor.author Domínguez Bajo, Mónica
dc.contributor.author Burga Díaz, Alicia
dc.contributor.author Farrús, Mireia
dc.contributor.author Wanner, Leo
dc.date.accessioned 2018-11-27T15:14:19Z
dc.date.available 2018-11-27T15:14:19Z
dc.date.issued 2018
dc.identifier.citation Domínguez M, Burga A, Farrús M, Wanner L. Towards expressive prosody generation in TTS for reading aloud applications. In: IberSpeech 2018; 2018 Nov 21-23; Barcelona, Spain. Baixas, France: ISCA; 2018. p. 40-4. DOI: 10.21437/IberSPEECH.2018-9
dc.identifier.uri http://hdl.handle.net/10230/35867
dc.description Comunicació presentada a: IberSpeech 2018, celebrat a Barcelona del 21 al 23 de novembre de 2018.
dc.description.abstract Conversational technologies that assist elderly people need to adapt to common disabilities in old age. Visual, hearing and even more so cognitive impairments pose serious difficulties for our seniors to handle a standard conversation with a human. Understanding a virtual agent may be ever harder. In this case, communicative strategies are key to adapt the virtual agent to the needs of elderly users. This paper addresses the role of the communicative structure for expressive speech prosody, which is known to be crucial for better speech comprehension. It reports on efforts to improve prosody within a text-to-speech system based on one aspect of the communicative structure, namely thematicity. The work has been implemented as an application in a social virtual agent, KRISTINA, which reads aloud news articles upon request for elderly users in German.
dc.format.mimetype application/pdf
dc.language.iso eng
dc.publisher International Speech Communication Association (ISCA)
dc.relation.ispartof IberSpeech 2018; 2018 Nov 21-23; Barcelona, Spain. Baixas, France: ISCA; 2018. p. 40-4.
dc.rights © 2018 ISCA
dc.title Towards expressive prosody generation in TTS for reading aloud applications
dc.type info:eu-repo/semantics/conferenceObject
dc.identifier.doi http://dx.doi.org/10.21437/IberSPEECH.2018-9
dc.subject.keyword Intelligent conversational agents
dc.subject.keyword Geriatric applications
dc.subject.keyword Communicative structure
dc.subject.keyword Thematicity
dc.subject.keyword Prosody
dc.subject.keyword Text-to-speech
dc.subject.keyword Human-machine interaction
dc.relation.projectID info:eu-repo/grantAgreement/EC/H2020/645012
dc.rights.accessRights info:eu-repo/semantics/openAccess
dc.type.version info:eu-repo/semantics/publishedVersion

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account

Statistics

In collaboration with Compliant to Partaking