Welcome to the UPF Digital Repository

Browsing Congressos (Departament de Tecnologies de la Informació i les Comunicacions) by Author "Wanner, Leo"

Browsing Congressos (Departament de Tecnologies de la Informació i les Comunicacions) by Author "Wanner, Leo"

Sort by: Order: Results:

  • Mille, Simon; Wanner, Leo (ACL (Association for Computational Linguistics), 2017)
    This demo paper presents the multilingual deep sentence generator developed by the TALN group at Universitat Pompeu Fabra, implemented as a series of rule-based graphtransducers.
  • Gialampoukidis, Ilias; Vrochidis, Stefanos; Kompatsiaris, Ioannis; Wanner, Leo (Springer, 2016)
    Nowadays there is an important need by journalists and media monitoring companies to cluster news in large amounts of web articles, in order to ensure fast access to their topics or events of interest. Our aim in this work ...
  • Ballesteros, Miguel; Wanner, Leo (ACL (Association for Computational Linguistics), 2016)
    Even syntactically correct sentences are perceived as awkward if they do not contain correct punctuation. Still, the problem of automatic generation of punctuation marks has been largely neglected for a long time. We/npresent ...
  • Soler Company, Juan; Wanner, Leo (LREC, 2016)
    In most of the research studies on Author Profiling, large quantities of correctly labeled data are used to train the models. However, this does not reflect the reality in forensic scenarios: in practical linguistic forensic ...
  • Domínguez Bajo, Mónica; Farrús, Mireia; Wanner, Leo (International Speech Communication Association (ISCA), 2017)
    This paper presents a demonstration of a stochastic prosody tool for enrichment of synthesized speech using SSML prosody tags applied over hierarchical thematicity spans in the context of a CTS application. The motivation ...
  • Domínguez Bajo, Mónica; Farrús, Mireia; Wanner, Leo (COLING, 2016)
    Speech prosody is known to be central in advanced communication technologies. However, despite the advances of theoretical studies in speech prosody, so far, no large scale prosody annotated resources that would facilitate ...
  • Pérez-Mayos, Laura; Táboas García, Alba; Mille, Simon; Wanner, Leo (ACL (Association for Computational Linguistics), 2021)
    Multilingual Transformer-based language models, usually pretrained on more than 100 languages, have been shown to achieve outstanding results in a wide range of crosslingual transfer tasks. However, it remains unknown ...
  • Öktem, Alp; Farrús, Mireia; Wanner, Leo (Springer, 2017)
    Until very recently, the generation of punctuation marks for automatic speech recognition (ASR) output has been mostly done by looking at the syntactic structure of the recognized utterances. Prosodic cues such as breaks, ...
  • Öktem, Alp; Farrús, Mireia; Wanner, Leo (ACL (Association for Computational Linguistics), 2017)
    This paper presents a methodology to extract parallel speech corpora based on any language pair from dubbed movies, together with an application framework in which some corresponding prosodic parameters are extracted. ...
  • Fisas Elizalde, Beatriz; Espinosa-Anke, Luis; Codina Filbà, Joan; Wanner, Leo (ACL (Association for Computational Linguistics), 2020)
    Collocations in the sense of idiosyncratic lexical co-occurrences of two syntactically bound words traditionally pose a challenge to language learners and many Natural Language Processing (NLP) applications alike. Reliable ...
  • Domínguez Bajo, Mónica; Farrús, Mireia; Wanner, Leo (International Speech Communication Association (ISCA), 2016)
    Intonation is traditionally considered to be the most important prosodic feature, whereupon an important research effort has been devoted to automatic segmentation and labeling of speech samples to grasp intonation cues. ...
  • Domínguez Bajo, Mónica; Burga Díaz, Alicia; Farrús, Mireia; Wanner, Leo (ELRA (European Language Resources Association), 2018)
    Theoretical studies on the Information Structure–prosody interface argue that the content packaged in terms of theme and rheme correlates with the intonation of the corresponding sentence. However, there are few empirical ...
  • Ballesteros, Miguel; Bohnet, Bernd; Mille, Simon; Wanner, Leo (ACL (Association for Computational Linguistics), 2015)
    Abstract structures from which the generation naturally starts often do not contain any functional nodes, while surface-syntactic structures or a chain of tokens in a linearized tree contain all of them. Therefore, data-driven ...
  • Espinosa-Anke, Luis; Codina Filbà, Joan; Wanner, Leo (ACL (Association for Computational Linguistics), 2021)
    Lexical collocations are idiosyncratic combinations of two syntactically bound lexical items (e.g., “heavy rain”, “take a step” or “undergo surgery”). Understanding their degree of compositionality and idiosyncrasy, as ...
  • Rodríguez Fernández, Sara; Carlini, Roberto; Espinosa-Anke, Luis; Wanner, Leo (ELRA (European Language Resources Association), 2016)
    Collocations such as heavy rain or make [a] decision, are combinations of two elements where one (the base) is freely chosen, while the/nchoice of the other (collocate) is restricted, depending on the base. Collocations ...
  • Espinosa-Anke, Luis; Camacho-Collados, Jose; Rodríguez Fernández, Sara; Saggion, Horacio; Wanner, Leo (COLING, 2016)
    WordNet is probably the best known lexical resource in Natural Language Processing. While it/nis widely regarded as a high quality repository of concepts and semantic relations, updating and/nextending it manually is costly. ...
  • Mille, Simon; Carlini, Roberto; Burga Díaz, Alicia; Wanner, Leo (ACL (Association for Computational Linguistics), 2017)
    We present the contribution of Universitat Pompeu Fabra’s NLP group to the Sem-Eval Task 9.2 (AMR-to-English Generation). The proposed generation pipeline comprises: (i) a series of rule-based graphtransducers for the ...
  • Pérez-Mayos, Laura; Ballesteros, Miguel; Wanner, Leo (ACL (Association for Computational Linguistics), 2021)
    Transformers-based pretrained language models achieve outstanding results in many wellknown NLU benchmarks. However, while pretraining methods are very convenient, they are expensive in terms of time and resources. This ...
  • Wanner, Leo; André, Elisabeth; Blat, Josep; Dasiopoulou, Stamatia; Farrús, Mireia; Kamateri, Eleni; Lingenfelser, Florian; Llorach, Gerard; Martínez, Oriol; Meditskos, Georgios; Mille, Simon; Minker, Wolfgang; Pragst, Louisa; Schiller, Dominik; Stam, Andries; Stellingwerff, Ludo; Sukno, Federico Mateo; Vieru, Bianca; Vrochidis, Stefanos (Springer, 2017)
    We present an intelligent embodied conversation agent with linguistic, social and emotional competence. Unlike the vast majority of the state-of-the-art conversation agents, the proposed agent is constructed around an ...
  • Espinosa-Anke, Luis; Shvets, Alexander; Mohammadshahi, Alireza; Henderson, James; Wanner, Leo (ACL (Association for Computational Linguistics), 2022)
    Recognizing and categorizing lexical collocations in context is useful for language learning, dictionary compilation and downstream NLP. However, it is a challenging task due to the varying degrees of frozenness lexical ...

Search DSpace


Advanced Search

Browse

My Account

In collaboration with Compliant to Partaking