Automatic paragraph segmentation with lexical and prosodic features

Lai, Catherine; Farrús, Mireia; Moore, Johanna D.

Automatic paragraph segmentation with lexical and prosodic features

Citació

Lai C, Farrús M, Moore JD. Automatic paragraph segmentation with lexical and prosodic features. In: Interspeech 2016; 2016 Sep 08-12; San Francisco (CA). [place unknown]: ISCA; 2016. p. 1034-8. DOI: 10.21437/Interspeech.2016-992

Enllaç permanent

http://hdl.handle.net/10230/28063

Descripció

Resum
As long-form spoken documents become more ubiquitous in everyday life, so does the need for automatic discourse segmentation in spoken language processing tasks. Although previous work has focused on broad topic segmentation, detection of finer-grained discourse units, such as paragraphs, is highly desirable for presenting and analyzing spoken content. To better understand how different aspects of speech cue these subtle discourse transitions, we investigate automatic paragraph segmentation of TED talks. We build lexical and prosodic paragraph segmenters using Support Vector Machines, AdaBoost, and Long Short Term Memory (LSTM) recurrent neural networks. In general, we find that induced cue words and supra-sentential prosodic features outperform features based on topical coherence, syntactic form and complexity. However, our best performance is achieved by combining a wide range of individually weak lexical and prosodic features, with the sequence modelling LSTM generally outperforming the other classifiers by a large margin. Moreover, we find that models that allow lower level interactions between different feature types produce better results than treating lexical and prosodic contributions as separate, independent information sources.
Descripció
Comunicació presentada a la Interspeech 2016, celebrada per la International Speech Communication Association (ISCA) els dies 8 a 12 de septembre de 2016 a San Francisco (EUA).
DOI
http://dx.doi.org/10.21437/Interspeech.2016-992
Col·leccions
Congressos (Departament de Tecnologies de la Informació i les Comunicacions)
Documents OpenAIRE (Open Access Infrastructure for Research in Europe)

Mostra el registre complet

Automatic paragraph segmentation with lexical and prosodic features

Automatic paragraph segmentation with lexical and prosodic features

Fitxers

Data

Autories

Resum

Descripció

DOI

Col·leccions