Welcome to the UPF Digital Repository

Browsing Congressos (Departament de Tecnologies de la Informació i les Comunicacions) by Title

Browsing Congressos (Departament de Tecnologies de la Informació i les Comunicacions) by Title

Sort by: Order: Results:

  • Font Corbera, Frederic; Serrà Julià, Joan; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2012)
    Collaborative tagging has emerged as an efficient way to/nsemantically describe online resources shared by a community/nof users. However, tag descriptions present some/ndrawbacks such as tag scarcity or concept ...
  • Mille, Simon; Carlini, Roberto; Burga Díaz, Alicia; Wanner, Leo (ACL (Association for Computational Linguistics), 2017)
    We present the contribution of Universitat Pompeu Fabra’s NLP group to the Sem-Eval Task 9.2 (AMR-to-English Generation). The proposed generation pipeline comprises: (i) a series of rule-based graphtransducers for the ...
  • Furelos Blanco, Daniel; Jonsson, Anders, 1973-; Palacios Verdes, Héctor Luis; Jiménez, Sergio (Association for the Advancement of Artificial Intelligence (AAAI) - Congrés ICAPS17, 2018)
    In this paper we describe STP, a novel algorithm for temporal planning. Similar to several existing temporal planners, STP relies on a transformation from temporal planning to classical planning, and constructs a temporal ...
  • Akkermans, Vincent; Font Corbera, Frederic; Funollet, Jordi; de Jong, Bram; Roma Trepat, Gerard; Togias, Stelios; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2011)
    Freesound.org is an online collaborative sound database where people from different disciplines share recorded sound clips under Creative Commons licenses. It was started in 2005 and it is being further developed by the ...
  • Fonseca, Eduardo; Pons Puig, Jordi; Favory, Xavier; Font Corbera, Frederic; Bogdanov, Dmitry; Ferraro, Andrés; Oramas, Sergio; Porter, Alastair; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2017)
    Openly available datasets are a key factor in the advancement of data-driven research approaches, including many of the ones used in sound and music computing. In the last few years, quite a number of new audio datasets ...
  • Font Corbera, Frederic (2017)
    Freesound Explorer is a visual interface for exploring Freesound content in a two-dimensional space and creating music by linking content in that space. Freesound Explorer is implemented as a web application which takes ...
  • Roma Trepat, Gerard; Herrera Boyer, Perfecto, 1964-; Serra, Xavier (2009)
    The habit of sharing media online has created a platform with great potential for creative applications that are accessible to large numbers of users with very different backgrounds. As an example, a lively community has ...
  • Albó Pérez, Laia; Gelpí Arroyo, Cristina (2017)
    This paper presents a case study of transforming an existing MOOC into a SPOC for being used in a campus course using a blended learning ap-proach with the aim of providing a reflection of the experience and reporting the ...
  • Ruiz, Adrià; Van de Weijer, Joost; Binefa i Valls, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2015)
    Limited annotated training data is a challenging problem in Action Unit recognition. In this paper, we investigate how the use of large databases labelled according to the 6 universal facial expressions can increase the ...
  • Boutros, Joseph Jean; Zémor, Gilles; Guillén i Fàbregas, Albert; Biglieri, Ezio (Institute of Electrical and Electronics Engineers (IEEE), 2008)
    We show how to build full-diversity product codes under both iterative encoding and decoding over non-ergodic channels, in presence of block erasure and block fading. The concept of a rootcheck or a root subcode is introduced ...
  • Ruiz, Adrià; Martinez, Oriol; Binefa i Valls, Xavier; Sukno, Federico Mateo (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    An essential issue when training and validating computer vision systems for affect analysis is how to obtain reliable ground-truth labels from a pool of subjective annotations. In this paper, we address this problem ...
  • Galdran, Adrian; Vazquez-Corral, Javier; Pardo, David; Bertalmío, Marcelo (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    We propose a novel image-dehazing technique based on the minimization of two energy functionals and a fusion scheme to combine the output of both optimizations. The proposed fusion-based variational image-dehazing (FVID) ...
  • Farrús, Mireia; Anguita, Jan; Hernando, Javier; Cerdà, Ramon (Dirección Xeral de Creación e Difusión Cultural, 2007)
    Los sistemas automáticos de reconocimiento de locutor más utilizados recientemente están basados en características acústicas de bajo nivel, como las magnitudes espectrales, las frecuencias de los formantes, etc. Sin ...
  • Zamir, Syed Waqas; Vazquez-Corral, Javier; Bertalmío, Marcelo (Society of Photo Optical Instrumentation Engineers (SPIE), 2015)
    Wide gamut digital display technology, in order to show its full potential in terms of colors, is creating an opportunity to develop gamut extension algorithms (GEAs). To this end, in this work we present two contributions. ...
  • Fonseca, Eduardo; Plakal, Manoj; Font, Frederic; Ellis, Daniel P. W.; Favory, Xavier; Pons Puig, Jordi; Serra, Xavier (Tampere University of Technology, 2018)
    This paper describes Task 2 of the DCASE 2018 Challenge, titled “General-purpose audio tagging of Freesound content with AudioSet labels”. This task was hosted on the Kaggle platform as “Freesound General-Purpose Audio ...
  • Batard, Thomas; Bertalmío, Marcelo (Springer, 2013)
    We introduce a gradient operator that generalizes the Eu- clidean and Riemannian gradients. This operator acts on sections of vector bundles and is determined by three geometric data: a Rieman- nian metric on the base ...
  • Boutros, Joseph Jean; Zémor, Gilles; Guillén i Fàbregas, Albert; Biglieri, Ezio (Institute of Electrical and Electronics Engineers (IEEE), 2008)
    A new graph-based construction of generalized low density codes (GLD-Tanner) with binary BCH constituents is described. The proposed family of GLD codes is optimal on block erasure channels and quasi-optimal on block fading ...
  • Segovia-Aguas, Javier; Jiménez, Sergio; Jonsson, Anders, 1973- (IJCAI, 2017)
    This paper presents a novel approach for generating Context-Free Grammars (CFGs) from small sets of input strings (a single input string in some cases). Our approach is to compile this task into a classical planning ...
  • Miron, Marius; Janer Mestres, Jordi; Gómez Gutiérrez, Emilia, 1975- (Aalto University, 2017)
    Deep learning approaches have become increasingly popular in estimating time-frequency masks for audio source separation. However, training neural networks usually requires a considerable amount of data. Music data is ...
  • Maestre Gómez, Esteban; Pérez Carrillo, Alfonso Antonio, 1977-; Ramírez, Rafael,1966- (International Computer Music Association, 2010)
    This paper presents a framework in which samples of bowing gesture parameters are retrieved and concatenated from a database of violin performances by attending to an annotated input score. Resulting bowing parameter signals ...

Search DSpace


Advanced Search

Browse

My Account

Compliant to Partaking