PunkProse [software]

Citació

Öktem A. PunkProse [software]. Repositori Digital de la UPF: Barcelona; 2018. Disponible a: http://hdl.handle.net/10230/33982

Enllaç permanent

Descripció

Dades relacionades
http://hdl.handle.net/10230/33936
http://hdl.handle.net/10230/33981
Resum
Punctuation marks support understandability and readability in written language. In spoken language, punctuation of the transcribed speech is influenced by two phenomena: (1) syntax and (2) prosody. We present a software architecture that makes it possible to train punctuation restoration models from any combination of lexical, morphosyntactic, prosodic and acoustic features. Architecture is language independent and feeds on word-segmented data. A dataset compiled from English TED talks is given in http://hdl.handle.net/10230/33981
Descripció
This software is stored and maintained in the following github repository: https://github.com/alpoktem/punkProse Instructions to use is explained there in detail.
DOI
https://doi.org/10.34810/data484
Col·leccions
Departament de Tecnologies de la Informació i les Comunicacions. Dades primàries

Fitxers