Using prosody to classify discourse relations

Kleinhans, Janine; Farrús, Mireia; Gravano, Agustín; Pérez, Juan Manuel; Lai, Catherine; Wanner, Leo

Using prosody to classify discourse relations

Citació

Kleinhans J, Farrús M, Gravano A, Pérez JM, Lai C, Wanner L. Using prosody to classify discourse relations. In: Proceedings of the 18th Annual Conference of the International Speech Communication Association (INTERSPEECH 2017); 2017 Aug. 20-24; Stockholm, Sweden. Baixas: ISCA; 2017. p. 778-81. DOI: 10.21437/Interspeech.2017-710

Enllaç permanent

http://hdl.handle.net/10230/32717

Descripció

Resum
This work aims to explore the correlation between the discourse structure of a spoken monologue and its prosody by predicting discourse relations from different prosodic attributes. For this purpose, a corpus of semi-spontaneous monologues in English has been automatically annotated according to the Rhetorical Structure Theory, which models coherence in text via rhetorical relations. From corresponding audio files, prosodic features such as pitch, intensity, and speech rate have been extracted from different contexts of a relation. Supervised classification tasks using Support Vector Machines have been performed to find relationships between prosodic features and rhetorical relations. Preliminary results show that intensity combined with other features extracted from intra- and intersegmental environments is the feature with the highest predictability for a discourse relation. The prediction of rhetorical relations from prosodic features and their combinations is straightforwardly applicable to several tasks such as speech understanding or generation. Moreover, the knowledge of how rhetorical relations should be marked in terms of prosody will serve as a basis to improve speech synthesis applications and make voices sound more natural and expressive.
Descripció
Comunicació presentada a: The 18th Annual Conference of the International Speech Communication Association (INTERSPEECH 2017), celebrada a Estocolm, Suència, del 20 al 24 d'agost de 2017.
DOI
http://dx.doi.org/10.21437/Interspeech.2017-710
Col·leccions
Congressos (Departament de Tecnologies de la Informació i les Comunicacions)
Documents OpenAIRE (Open Access Infrastructure for Research in Europe)

Mostra el registre complet

Using prosody to classify discourse relations

Using prosody to classify discourse relations

Fitxers

Data

Autories

Resum

Descripció

DOI

Col·leccions