Welcome to the UPF Digital Repository

Browsing Congressos (Departament de Tecnologies de la Informació i les Comunicacions) by Title

Browsing Congressos (Departament de Tecnologies de la Informació i les Comunicacions) by Title

Sort by: Order: Results:

  • Fonseca, Eduardo; Pons Puig, Jordi; Favory, Xavier; Font Corbera, Frederic; Bogdanov, Dmitry; Ferraro, Andrés; Oramas, Sergio; Porter, Alastair; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2017)
    Openly available datasets are a key factor in the advancement of data-driven research approaches, including many of the ones used in sound and music computing. In the last few years, quite a number of new audio datasets ...
  • Font Corbera, Frederic (2017)
    Freesound Explorer is a visual interface for exploring Freesound content in a two-dimensional space and creating music by linking content in that space. Freesound Explorer is implemented as a web application which takes ...
  • Roma Trepat, Gerard; Herrera Boyer, Perfecto; Serra, Xavier (2009)
    The habit of sharing media online has created a platform with great potential for creative applications that are accessible to large numbers of users with very different backgrounds. As an example, a lively community has ...
  • Albó Pérez, Laia; Gelpí Arroyo, Cristina (2017)
    This paper presents a case study of transforming an existing MOOC into a SPOC for being used in a campus course using a blended learning ap-proach with the aim of providing a reflection of the experience and reporting the ...
  • Ruiz, Adrià; Van de Weijer, Joost; Binefa i Valls, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2015)
    Limited annotated training data is a challenging problem in Action Unit recognition. In this paper, we investigate how the use of large databases labelled according to the 6 universal facial expressions can increase the ...
  • Boutros, Joseph Jean; Zémor, Gilles; Guillén i Fàbregas, Albert; Biglieri, Ezio (Institute of Electrical and Electronics Engineers (IEEE), 2008)
    We show how to build full-diversity product codes under both iterative encoding and decoding over non-ergodic channels, in presence of block erasure and block fading. The concept of a rootcheck or a root subcode is introduced ...
  • Ruiz, Adrià; Martinez, Oriol; Binefa i Valls, Xavier; Sukno, Federico Mateo (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    An essential issue when training and validating computer vision systems for affect analysis is how to obtain reliable ground-truth labels from a pool of subjective annotations. In this paper, we address this problem ...
  • Galdran, Adrian; Vazquez-Corral, Javier; Pardo, David; Bertalmío, Marcelo (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    We propose a novel image-dehazing technique based on the minimization of two energy functionals and a fusion scheme to combine the output of both optimizations. The proposed fusion-based variational image-dehazing (FVID) ...
  • Farrús, Mireia; Anguita, Jan; Hernando, Javier; Cerdà, Ramon (Dirección Xeral de Creación e Difusión Cultural, 2007)
    Los sistemas automáticos de reconocimiento de locutor más utilizados recientemente están basados en características acústicas de bajo nivel, como las magnitudes espectrales, las frecuencias de los formantes, etc. Sin ...
  • Zamir, Syed Waqas; Vazquez-Corral, Javier; Bertalmío, Marcelo (Society of Photo Optical Instrumentation Engineers (SPIE), 2015)
    Wide gamut digital display technology, in order to show its full potential in terms of colors, is creating an opportunity to develop gamut extension algorithms (GEAs). To this end, in this work we present two contributions. ...
  • Fonseca, Eduardo; Plakal, Manoj; Font, Frederic; Ellis, Daniel P. W.; Favory, Xavier; Pons, Jordi; Serra, Xavier (Tampere University of Technology, 2018)
    This paper describes Task 2 of the DCASE 2018 Challenge, titled “General-purpose audio tagging of Freesound content with AudioSet labels”. This task was hosted on the Kaggle platform as “Freesound General-Purpose Audio ...
  • Boutros, Joseph Jean; Zémor, Gilles; Guillén i Fàbregas, Albert; Biglieri, Ezio (Institute of Electrical and Electronics Engineers (IEEE), 2008)
    A new graph-based construction of generalized low density codes (GLD-Tanner) with binary BCH constituents is described. The proposed family of GLD codes is optimal on block erasure channels and quasi-optimal on block fading ...
  • Segovia-Aguas, Javier; Jiménez, Sergio; Jonsson, Anders, 1973- (2017)
    This paper presents a novel approach for generating Context-Free Grammars (CFGs) from small sets of input strings (a single input string in some cases). Our approach is to compile this task into a classical planning ...
  • Miron, Marius; Janer Mestres, Jordi; Gómez Gutiérrez, Emilia, 1975- (Aalto University, 2017)
    Deep learning approaches have become increasingly popular in estimating time-frequency masks for audio source separation. However, training neural networks usually requires a considerable amount of data. Music data is ...
  • Maestre Gómez, Esteban; Pérez Carrillo, Alfonso Antonio, 1977-; Ramírez, Rafael,1966- (International Computer Music Association, 2010)
    This paper presents a framework in which samples of bowing gesture parameters are retrieved and concatenated from a database of violin performances by attending to an annotated input score. Resulting bowing parameter signals ...
  • Knees, Peter; Andersen, Kristina; Jordà Puig, Sergi; Hlatky, Michael; Geiger, Günter; Gaebele, Wulf; Kaurson, Roman (Institute of Electrical and Electronics Engineers (IEEE), 2015)
    We present the GiantSteps project, an EU-funded project/ninvolving institutions from academia, practitioners, and industrial/npartners with the goal of developing new concepts for/nintelligent and collaborative interfaces ...
  • Ramírez Jávega, Miquel; Geffner, Héctor (Association for the Advancement of Artificial Intelligence (AAAI), 2011)
    Plan recognition is the problem of inferring the goals and plans of an agent from partial observations of her behavior. Recently, it has been shown that the problem can be formulated and solved using/nplanners, reducing ...
  • Bandiera, Giuseppe; Romani Picas, Oriol; Tokuda, Hiroshi; Hariya, Wataru; Oishi, Koji; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2016)
    We introduce good-sounds.org, a community driven framework based on freesound.org to explore the concept of goodness in instrumental sounds. Goodness is considered here as the common agreed basic sound quality of ...
  • Hernández Leo, Davinia (2016)
    Competencias transversales como el trabajo en equipo o el dominio del inglés son objeto/nexplicitico de formación en diferentes asignaturas de los planes de estudio de las Ingenierías/nTIC de la Universitat Pompeu Fabra ...
  • Derkach, Dmytro; Ruiz Ovejero, Adrià; Sukno, Federico Mateo (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    In this paper we present a system that is able to estimate head pose using only depth information from consumer RGB-D cameras such as Kinect 2. In contrast to most approaches addressing this problem, we do not rely on ...

Search DSpace


Advanced Search

Browse

My Account

Compliant to Partaking