Prosodically annotated TED talks
Mostra el registre complet Registre parcial de l'ítem
- dc.contributor.author Öktem, Alpca
- dc.contributor.author Farrús, Mireiaca
- dc.contributor.author Lai, Catherine
- dc.date.accessioned 2018-02-23T09:19:16Z
- dc.date.available 2018-02-23T09:19:16Z
- dc.date.issued 2018-02-23
- dc.description "Audio files of the recordings are provided in the partitioned archives as WAV format. ""talk_proscripts"" archive contains Proscript format annotations of complete talks. ""punkProse_dataset"" archive contains sampled dataset partitioning used in prosodic punctuation modelling experiments (See http://github.com/alpoktem/punkProse). README.txt file contains information on the dataset and authors. Indexing of the files and their corresponding talks are listed in TED_talk_ids.txt. Proscript format files contain the sequence of uttered words in a recording, their approximate timings and corresponding acoustic measurements (pitch, intensity, speech rate). For more information on Proscript format see http://github.com/alpoktem/proscript."ca
- dc.description.abstract TED talks are a set of conference talks that have been held worldwide in more than 100 languages. They include a large variety of topics, from technology and design to science, culture and academia. This corpus consists of speech recordings and Proscript format annotations of 1046 talks by 877 English speakers, uttering a total amount of 155174 sentences.ca
- dc.identifier.citation Öktem A, Farrús M, Lai C. Prosodically annotated TED talks. Repositori Digital de la UPF: Barcelona; 2018. Disponible a: http://hdl.handle.net/10230/33981
- dc.identifier.doi https://doi.org/10.34810/data501
- dc.identifier.uri http://hdl.handle.net/10230/33981
- dc.language.iso engca
- dc.publisher Universitat Pompeu Fabraca
- dc.relation Publicació relacionada: Öktem A, Farrús M, Wanner L. Attentional parallel RNNs for generating punctuation in transcribed speech. In: Camelin N, Estève Y, Martín-Vide C. Statistical Language and Speech Processing. 5th International Conference SLSP 2017; 2017 Oct 23-25; Le Mans, France. Cham: Springer, 2017. p. 131-42. (LNCS; no. 10583 ). DOI: 10.1007/978-3-319-68456-7_11 http://hdl.handle.net/10230/33936
- dc.relation Sotware relacionat: http://hdl.handle.net/10230/33982
- dc.relation.isreferencedby http://hdl.handle.net/10230/33936
- dc.relation.isreferencedby http://hdl.handle.net/10230/33982
- dc.relation.projectID info:eu-repo/grantAgreement/EC/H2020/645012
- dc.rights Attribution 4.0 International (CC BY 4.0)ca
- dc.rights.accessRights info:eu-repo/semantics/openAccessca
- dc.rights.uri https://creativecommons.org/licenses/by/4.0/ca
- dc.subject.keyword Speech transcription
- dc.subject.keyword Recurrent neural networks
- dc.subject.keyword Prosody
- dc.subject.keyword Punctuation generation
- dc.subject.keyword Automatic speech recognition
- dc.subject.keyword Speech dataset
- dc.subject.keyword Conference talks
- dc.title Prosodically annotated TED talksca
- dc.type info:eu-repo/semantics/otherca
- dc.type Dataset