Öktem, A. Heroes Corpus. 2018. doi: 10.34810/data500
This citation was generated automatically.
Abstract
Heroes corpus contains mapped bilingual (English and Spanish) speech segments from the TV series Heroes. It contains 7000 single speaker speech segments extracted from the original and Spanish dubbed version of 21 episodes. Audio segments are accompanied with subtitle transcriptions and word-level prosodic/paralinguistic information.
Other authors
Description
Each episode directory contains word-level and segment-level information of the whole episode and also parallel samples extracted under segments_eng and segments_spa subdirectories. Each sample is stored as an WAV audio file, text file and a CSV file containing word timing information and word-level paralinguistic and prosodic features.
Publicació relacionada: Öktem A, Farrús M, Bonafonte A. Bilingual prosodic dataset compilation for spoken language translation. Paper presented at: IberSPEECH'18; 2018 Nov 21-23; Barcelona, Spain. http://hdl.handle.net/10230/35600