Browsing by Author "Öktem, Alp"

Sort by: Order: Results:

  • Öktem, Alp; Farrús, Mireia; Wanner, Leo (Springer, 2017)
    Until very recently, the generation of punctuation marks for automatic speech recognition (ASR) output has been mostly done by looking at the syntactic structure of the recognized utterances. Prosodic cues such as breaks, ...
  • Öktem, Alp; Farrús, Mireia; Wanner, Leo (ACL (Association for Computational Linguistics), 2017)
    This paper presents a methodology to extract parallel speech corpora based on any language pair from dubbed movies, together with an application framework in which some corresponding prosodic parameters are extracted. ...
  • Öktem, Alp; Farrús, Mireia; Bonafonte Cávez, Antonio (International Speech Communication Association (ISCA), 2018)
    This paper builds on a previous methodology that exploits dubbed media material to build prosodically annotated bilingual corpora. The almost fully-automatized process serves for building data for training spoken language ...
  • Külebi, Baybars; Öktem, Alp; Peiró Lilja, Àlex; Pascual, Santiago; Farrús, Mireia (International Speech Communication Association (ISCA), 2020)
    We present Catotron, a neural network-based open-source speech synthesis system in Catalan. Catotron consists of a sequence-to-sequence model trained with two small opensource datasets based on semi-spontaneous and read ...
  • Öktem, Alp; Farrús, Mireia; Bonafonte Cávez, Antonio (Springer, 2021)
    Research on speech technologies necessitates spoken data, which is usually obtained through read recorded speech, and specifically adapted to the research needs. When the aim is to deal with the prosody involved in speech, ...
  • Öktem, Alp (2018-10-05)
    Heroes corpus contains mapped bilingual (English and Spanish) speech segments from the TV series Heroes. It contains 7000 single speaker speech segments extracted from the original and Spanish dubbed version of 21 episodes. ...
  • Öktem, Alp (Universitat Pompeu Fabra, 2019-03-13)
    In this dissertation, I study the inclusion of prosody into two applications that involve speech understanding:~automatic speech transcription and spoken language translation. In the former case, I propose a method that ...
  • Öktem, Alp; Farrús, Mireia; Bonafonte Cávez, Antonio (International Speech Communication Association (ISCA), 2019)
    Dubbing is a type of audiovisual translation where dialogues are translated and enacted so that they give the impression that the media is in the target language. It requires a careful alignment of dubbed recordings ...
  • Öktem, Alp; Farrús, Mireia; Lai, Catherine (Universitat Pompeu Fabra, 2018-02-23)
    TED talks are a set of conference talks that have been held worldwide in more than 100 languages. They include a large variety of topics, from technology and design to science, culture and academia. This corpus consists ...
  • Öktem, Alp; Farrús, Mireia; Wanner, Leo (International Speech Communication Association (ISCA), 2017)
    This paper presents an open-source tool that has been developed to visualize a speech corpus with its transcript and prosodic features aligned at word level. In particular, the tool is aimed at providing a simple and clear ...
  • Öktem, Alp (Universitat Pompeu Fabra, 2018-02)
    Punctuation marks support understandability and readability in written language. In spoken language, punctuation of the transcribed speech is influenced by two phenomena: (1) syntax and (2) prosody. We present a software ...
  • Burga Díaz, Alicia; Öktem, Alp; Wanner, Leo (ACL (Association for Computational Linguistics), 2017)
    In this paper, we present a revision of the training set of the METU-Sabancı Turkish syntactic dependency treebank composed of 4997 sentences in accordance with the principles of the Meaning-Text Theory (MTT). MTT reflects ...
  • Öktem, Alp; Farrús, Mireia; Bonafonte Cávez, Antonio (International Speech Communication Association (ISCA), 2018)
    We have developed a neural architecture that tests the effect of lexical, morphosyntactic and prosodic features in restoring punctuation in speech transcriptions. Having outperformed a baseline model in terms of precision ...

Search DSpace

Browse

My Account

In collaboration with Compliant to Partaking