Browsing by Author "Wanner, Leo"

Sort by: Order: Results:

  • Mille, Simon; Wanner, Leo (ACL (Association for Computational Linguistics), 2017)
    This demo paper presents the multilingual deep sentence generator developed by the TALN group at Universitat Pompeu Fabra, implemented as a series of rule-based graphtransducers.
  • Gialampoukidis, Ilias; Vrochidis, Stefanos; Kompatsiaris, Ioannis; Wanner, Leo (Springer, 2016)
    Nowadays there is an important need by journalists and media monitoring companies to cluster news in large amounts of web articles, in order to ensure fast access to their topics or events of interest. Our aim in this work ...
  • Vrochidis, Stefanos; Moumtzidou, Anastasia; Gialampoukidis, Ilias; Liparas, Dimitris; Casamayor, Gerard; Wanner, Leo; Heise, Nicolaus; Wagner, Tilman; Bilous, Andriy; Jamin, Emmanuel; Simeonov, Boyan; Alexiev, Vladimir; Busch, Reinhard; Arapakis, Ioannis; Kompatsiaris, Ioannis (Frontiers, 2018)
    Analysts and journalists face the problem of having to deal with very large, heterogeneous, and multilingual data volumes that need to be analyzed, understood, and aggregated. Automated and simplified editorial and authoring ...
  • Ballesteros, Miguel; Wanner, Leo (ACL (Association for Computational Linguistics), 2016)
    Even syntactically correct sentences are perceived as awkward if they do not contain correct punctuation. Still, the problem of automatic generation of punctuation marks has been largely neglected for a long time. We/npresent ...
  • Nazar, Rogelio (Universitat Pompeu Fabra, 2010-10-01)
    The present research focuses on the study of the distribution of lexis in corpus and its aim is to inquire into the relations that exist between concepts through the occurrences of the terms that designate them. The initial ...
  • Soler Company, Juan; Wanner, Leo (LREC, 2016)
    In most of the research studies on Author Profiling, large quantities of correctly labeled data are used to train the models. However, this does not reflect the reality in forensic scenarios: in practical linguistic forensic ...
  • Domínguez Bajo, Mónica; Farrús, Mireia; Wanner, Leo (International Speech Communication Association (ISCA), 2017)
    This paper presents a demonstration of a stochastic prosody tool for enrichment of synthesized speech using SSML prosody tags applied over hierarchical thematicity spans in the context of a CTS application. The motivation ...
  • Domínguez Bajo, Mónica; Farrús, Mireia; Wanner, Leo (COLING, 2016)
    Speech prosody is known to be central in advanced communication technologies. However, despite the advances of theoretical studies in speech prosody, so far, no large scale prosody annotated resources that would facilitate ...
  • Pérez-Mayos, Laura; Táboas García, Alba; Mille, Simon; Wanner, Leo (ACL (Association for Computational Linguistics), 2021)
    Multilingual Transformer-based language models, usually pretrained on more than 100 languages, have been shown to achieve outstanding results in a wide range of crosslingual transfer tasks. However, it remains unknown ...
  • Öktem, Alp; Farrús, Mireia; Wanner, Leo (Springer, 2017)
    Until very recently, the generation of punctuation marks for automatic speech recognition (ASR) output has been mostly done by looking at the syntactic structure of the recognized utterances. Prosodic cues such as breaks, ...
  • Öktem, Alp; Farrús, Mireia; Wanner, Leo (ACL (Association for Computational Linguistics), 2017)
    This paper presents a methodology to extract parallel speech corpora based on any language pair from dubbed movies, together with an application framework in which some corresponding prosodic parameters are extracted. ...
  • Fisas Elizalde, Beatriz; Espinosa-Anke, Luis; Codina Filbà, Joan; Wanner, Leo (ACL (Association for Computational Linguistics), 2020)
    Collocations in the sense of idiosyncratic lexical co-occurrences of two syntactically bound words traditionally pose a challenge to language learners and many Natural Language Processing (NLP) applications alike. Reliable ...
  • Rodríguez Fernández, Sara (Universitat Pompeu Fabra, 2018-03-19)
    Suele admitirse que las colocaciones en el sentido de coocurrencias idiosincráticas de palabras son un reto en el aprendizaje de lenguas. Los estudiantes producen frecuentemente combinaciones “agramaticales”' como *dar una ...
  • Domínguez Bajo, Mónica; Farrús, Mireia; Wanner, Leo (International Speech Communication Association (ISCA), 2016)
    Intonation is traditionally considered to be the most important prosodic feature, whereupon an important research effort has been devoted to automatic segmentation and labeling of speech samples to grasp intonation cues. ...
  • Domínguez Bajo, Mónica; Burga Díaz, Alicia; Farrús, Mireia; Wanner, Leo (ELRA (European Language Resources Association), 2018)
    Theoretical studies on the Information Structure–prosody interface argue that the content packaged in terms of theme and rheme correlates with the intonation of the corresponding sentence. However, there are few empirical ...
  • Ballesteros, Miguel; Bohnet, Bernd; Mille, Simon; Wanner, Leo (Cambridge University Press, 2016)
    ‘Deep-syntactic’ dependency structures that capture the argumentative, attributive and co-/nordinative relations between full words of a sentence have a great potential for a number/nof NLP-applications. The abstraction ...
  • Ballesteros, Miguel; Bohnet, Bernd; Mille, Simon; Wanner, Leo (ACL (Association for Computational Linguistics), 2015)
    Abstract structures from which the generation naturally starts often do not contain any functional nodes, while surface-syntactic structures or a chain of tokens in a linearized tree contain all of them. Therefore, data-driven ...
  • Mille, Simon (Universitat Pompeu Fabra, 2014-07-25)
    The present Ph.D. thesis addresses the problem of deep data-driven Natural Language Generation (NLG), and in particular the role of proper corpus annotation schemata for stochastic sentence realization. The lack of multilevel ...
  • Wanner, Leo; André, Elisabeth; Blat, Josep; Dasiopoulou, Stamatia; Farrús, Mireia; Fraga, Thiago; Kamateri, Eleni; Lingenfelser, Florian; Llorach, Gerard; Martínez, Oriol; Meditskos, Georgios; Mille, Simon; Minker, Wolfgang; Pragst, Louisa; Schiller, Dominik; Stam, Andries; Stellingwerff, Ludo; Sukno, Federico Mateo; Vieru, Bianca; Vrochidis, Stefanos (Elsevier, 2017)
    We present work in progress on an intelligent embodied conversation agent that is supposed to act as a social companion with linguistic and emotional competence in the context of basic and health care. The core of the agent ...
  • Espinosa-Anke, Luis; Codina Filbà, Joan; Wanner, Leo (ACL (Association for Computational Linguistics), 2021)
    Lexical collocations are idiosyncratic combinations of two syntactically bound lexical items (e.g., “heavy rain”, “take a step” or “undergo surgery”). Understanding their degree of compositionality and idiosyncrasy, as ...