Benvinguts al Repositori Digital de la UPF

Visualitza Congressos (Departament de Tecnologies de la Informació i les Comunicacions) per autor "Öktem, Alp"

Visualitza Congressos (Departament de Tecnologies de la Informació i les Comunicacions) per autor "Öktem, Alp"

Ordena per: Ordre: Resultats:

  • Öktem, Alp; Farrús, Mireia; Wanner, Leo (Springer, 2017)
    Until very recently, the generation of punctuation marks for automatic speech recognition (ASR) output has been mostly done by looking at the syntactic structure of the recognized utterances. Prosodic cues such as breaks, ...
  • Öktem, Alp; Farrús, Mireia; Wanner, Leo (ACL (Association for Computational Linguistics), 2017)
    This paper presents a methodology to extract parallel speech corpora based on any language pair from dubbed movies, together with an application framework in which some corresponding prosodic parameters are extracted. ...
  • Öktem, Alp; Farrús, Mireia; Bonafonte, Antonio (International Speech Communication Association (ISCA), 2018)
    This paper builds on a previous methodology that exploits dubbed media material to build prosodically annotated bilingual corpora. The almost fully-automatized process serves for building data for training spoken ...
  • Öktem, Alp; Farrús, Mireia; Wanner, Leo (2017)
    This paper presents an open-source tool that has been developed to visualize a speech corpus with its transcript and prosodic features aligned at word level. In particular, the tool is aimed at providing a simple and clear ...
  • Burga Díaz, Alicia; Öktem, Alp; Wanner, Leo (ACL (Association for Computational Linguistics), 2017)
    In this paper, we present a revision of the training set of the METU-Sabancı Turkish syntactic dependency treebank composed of 4997 sentences in accordance with the principles of the Meaning-Text Theory (MTT). MTT reflects ...
  • Öktem, Alp; Farrús, Mireia; Bonafonte, Antonio (International Speech Communication Association (ISCA), 2018)
    We have developed a neural architecture that tests the effect of lexical, morphosyntactic and prosodic features in restoring punctuation in speech transcriptions. Having outperformed a baseline model in terms of precision ...