Browsing by Author "Öktem, Alp"

Sort by: Order: Results:

  • Öktem, Alp; Farrús, Mireia; Wanner, Leo (Springer, 2017)
    Until very recently, the generation of punctuation marks for automatic speech recognition (ASR) output has been mostly done by looking at the syntactic structure of the recognized utterances. Prosodic cues such as breaks, ...
  • Öktem, Alp; Farrús, Mireia; Wanner, Leo (ACL (Association for Computational Linguistics), 2017)
    This paper presents a methodology to extract parallel speech corpora based on any language pair from dubbed movies, together with an application framework in which some corresponding prosodic parameters are extracted. ...
  • Öktem, Alp; Farrús, Mireia; Bonafonte, Antonio (International Speech Communication Association (ISCA), 2018)
    This paper builds on a previous methodology that exploits dubbed media material to build prosodically annotated bilingual corpora. The almost fully-automatized process serves for building data for training spoken ...
  • Öktem, Alp (2018-10-05)
    Heroes corpus contains mapped bilingual (English and Spanish) speech segments from the TV series Heroes. It contains 7000 single speaker speech segments extracted from the original and Spanish dubbed version of 21 episodes. ...
  • Öktem, Alp (Universitat Pompeu Fabra, 2019-02-25)
    In this dissertation, I study the inclusion of prosody into two applications that involve speech understanding:~automatic speech transcription and spoken language translation. In the former case, I propose a method that ...
  • Öktem, Alp; Farrús, Mireia; Lai, Catherine (Universitat Pompeu Fabra, 2018-02-23)
    TED talks are a set of conference talks that have been held worldwide in more than 100 languages. They include a large variety of topics, from technology and design to science, culture and academia. This corpus consists ...
  • Öktem, Alp; Farrús, Mireia; Wanner, Leo (International Speech Communication Association (ISCA), 2017)
    This paper presents an open-source tool that has been developed to visualize a speech corpus with its transcript and prosodic features aligned at word level. In particular, the tool is aimed at providing a simple and clear ...
  • Öktem, Alp (Universitat Pompeu Fabra, 2018-02)
    Punctuation marks support understandability and readability in written language. In spoken language, punctuation of the transcribed speech is influenced by two phenomena: (1) syntax and (2) prosody. We present a software ...
  • Burga Díaz, Alicia; Öktem, Alp; Wanner, Leo (ACL (Association for Computational Linguistics), 2017)
    In this paper, we present a revision of the training set of the METU-Sabancı Turkish syntactic dependency treebank composed of 4997 sentences in accordance with the principles of the Meaning-Text Theory (MTT). MTT reflects ...
  • Öktem, Alp; Farrús, Mireia; Bonafonte, Antonio (International Speech Communication Association (ISCA), 2018)
    We have developed a neural architecture that tests the effect of lexical, morphosyntactic and prosodic features in restoring punctuation in speech transcriptions. Having outperformed a baseline model in terms of precision ...