Welcome to the UPF Digital Repository

Visualizing punctuation restoration in speech transcripts with prosograph

Show simple item record

dc.contributor.author Öktem, Alp
dc.contributor.author Farrús, Mireia
dc.contributor.author Bonafonte, Antonio
dc.date.accessioned 2018-11-20T15:47:33Z
dc.date.available 2018-11-20T15:47:33Z
dc.date.issued 2018
dc.identifier.citation Öktem A, Farrús M, Bonafonte A. Visualizing punctuation restoration in speech transcripts with prosograph. In: Interspeech 2018; 2018 Sep 2-6; Hyderabad, India. Baixas: ISCA; 2018. p. 1493-4.
dc.identifier.issn 1990-9772
dc.identifier.uri http://hdl.handle.net/10230/35801
dc.description Comunicació presentada a: Interspeech 2018, celebrat del 2 al 6 de setembre de 2018 a Hyderabad, Índia.
dc.description.abstract We have developed a neural architecture that tests the effect of lexical, morphosyntactic and prosodic features in restoring punctuation in speech transcriptions. Having outperformed a baseline model in terms of precision and recall, we further extend our performance tests by attaching it in a speech recognition pipeline. The visual and interactive testing environment that we prepared helps us observe how our models generalizes in unseen data and also plan our next steps for improvement.
dc.description.sponsorship The first author has received Maria de Maeztu Reproducibility Award from Department of Information and Communication Technologies of Universitat Pompeu Fabra in 2018 through presentation of this work. The second author is funded by the Spanish Ministry of Economy, Industry and Competitiveness through the Ram´on y Cajal program.
dc.format.mimetype application/pdf
dc.language.iso eng
dc.publisher International Speech Communication Association (ISCA)
dc.relation.ispartof Interspeech 2018; 2018 Sep 2-6; Hyderabad, India. Baixas: ISCA; 2018. p. 1493-4.
dc.rights © 2018 ISCA
dc.title Visualizing punctuation restoration in speech transcripts with prosograph
dc.type info:eu-repo/semantics/conferenceObject
dc.subject.keyword Prosody
dc.subject.keyword Punctuation
dc.subject.keyword Automatic speech recognition
dc.rights.accessRights info:eu-repo/semantics/openAccess
dc.type.version info:eu-repo/semantics/publishedVersion


This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account

Statistics

Compliant to Partaking