Benvinguts al Repositori Digital de la UPF

Visualitzant Congressos (Departament de Tecnologies de la Informació i les Comunicacions) per títol

Visualitzant Congressos (Departament de Tecnologies de la Informació i les Comunicacions) per títol

Ordena per: Ordre: Resultats:

  • Miron, Marius; Janer Mestres, Jordi; Gómez Gutiérrez, Emilia, 1975- (International Society for Music Information Retrieval (ISMIR), 2017)
    Score information has been shown to improve music source separation when included into non-negative matrix factorization (NMF) frameworks. Recently, deep learning approaches have outperformed NMF methods in terms ...
  • Chandna, Pritish; Miron, Marius; Janer Mestres, Jordi; Gómez Gutiérrez, Emilia, 1975- (Springer, 2017)
    In this paper we introduce a low-latency monaural source separation framework using a Convolutional Neural Network (CNN). We use a CNN to estimate time-frequency soft masks which are applied for source separation. We ...
  • Rezaeirowshan, Babak; Ballester, Coloma; Haro Ortega, Gloria (SCITEPRESS – Science and Technology Publications, Lda., 2016)
    In this paper we propose a method to estimate a global depth order between the objects of a scene using information from a single image coming from an uncalibrated camera. The method we present stems from early vision cues ...
  • Karakurt, Altug; Sentürk, Sertan; Serra, Xavier (ACM Association for Computer Machinery, 2016)
    In the general sense, mode defines the melodic framework and tonic acts as the reference tuning pitch for the melody in the performances of many music cultures. The mode and tonic information of the audio recordings is ...
  • Oliver, Maria; Raad, Lara; Ballester, Coloma; Haro Ortega, Gloria (Institute of Electrical and Electronics Engineers (IEEE), 2018)
    This work presents an automatic method for optical flow inpainting. Given a video, each frame domain is endowed with a Riemannian metric based on the video pixel values. The missing optical flow is recovered by solving the ...
  • Bellur, Ashwin; Ishwar, Vignesh; Murthy, Hema A. (Universitat Pompeu Fabra, 2012)
    A raga is a collective melodic expression consisting of motifs. A raga can be identified using motifs which are/nunique to it. Motifs can be thought of as signature prosodic phrases. Different ragas may be composed of the ...
  • Santos, Maria; Schaper, Marie-Monique; Parés, Narcís, 1966- (ACM Association for Computer Machinery, 2017)
    This paper presents a brief overview of the design and evaluation process of a Virtual Heritage (VH) experience for children in the context of Refugi 307, a bomb shelter built during the Spanish Civil War. The shelter ...
  • Ruiz Ovejero, Adrià; Rudovic, Ognjen; Binefa i Valls, Xavier; Pantic, Maja (Springer, 2017)
    In this paper, we address the Multi-Instance-Learning (MIL) problem when bag labels are naturally represented as ordinal variables (Multi-Instance-Ordinal Regression). Moreover, we consider the case where bags are temporal ...
  • Oramas, Sergio; Nieto Caballero, Oriol; Barbieri, Francesco; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2017)
    Music genres allow to categorize musical items that share common characteristics. Although these categories are not mutually exclusive, most related research is traditionally focused on classifying tracks into a single ...
  • Accuosto, Pablo; Ronzano, Francesco; Ferrés, Daniel; Saggion, Horacio (2017)
    We present a system to mine and visualize collections of scientific documents by semantically browsing information extracted from single publications or aggregated throughout corpora of articles. The text mining tool ...
  • Barbieri, Francesco; Marujo, Luís; Karuturi, Pradeep; Brendel, William (CEUR Workshop Proceedings, 2018)
    Emojis are very common in social media and understanding their underlying semantics is of great interest from a Natural Language Processing point of view. In this work, we investigate emoji prediction in short ...
  • Papiotis, Panos; Marchini, Marco, 1984-; Maestre Gómez, Esteban (Associoation Européenne des Conservatoires, 2013)
    In a musical ensemble such as a string quartet, the performers can influence each other’s actions in several aspects of the performance simultaneously. Based on a set of recorded string quartet exercises, we carried out a ...
  • Perez Miguel, Naiara; Cuadros Oller, Montse (ACL (Association for Computational Linguistics), 2017)
    This paper describes a web-based application to design and answer exercises for language learning. It is available in Basque, Spanish, English, and French. Based on open-source Natural Language Processing (NLP) ...
  • Agerri, Rodrigo; Aldabe, Itziar; Laparra, Egoitz; Rigau Claramunt, German; Fokkens, Antske; Huijgen, Paul; Izquierdo Beviá, Rubén; van Erp, Marieke; Vossen, Piek; Minard, Anne-Lyse; Magnini, Bernardo (International Conference on Language Resources and Evaluation (LREC), 2016)
    We describe a novel modular system for cross-lingual event extraction for English, Spanish,, Dutch and Italian texts. The system consists of a ready-to-use modular set of advanced multilingual Natural Language Processing ...
  • Barbieri, Francesco; Ballesteros, Miguel; Ronzano, Francesco; Saggion, Horacio (ACL (Association for Computational Linguistics), 2018)
    Emojis are small images that are commonly included in social media text messages. The combination of visual and textual content in the same message builds up a modern way of communication, that automatic systems are not ...
  • Farrús, Mireia; Luque, Jordi; Morros, R.; Anguita, Jan; Macho, D.; Marqués, Marta; Martínez, C.; Vilaplana, V.; Hernando, Javier (Universidad de Zaragoza, 2006)
    In this paper we present a person identification system based on a combination of acoustic features and 2D face images. We address the modality integration issue on the example of a smart room environment. In order to ...
  • Angelosante, Daniele; Biglieri, Ezio; Lops, Marco (European Association for Signal Processing (EURASIP), 2008)
    This paper presents several algorithms for joint estimation of the target number and state in a time-varying scenario. Building on the results presented in [1], which considers estimation of the target number only, we ...
  • Saggion, Horacio; Ronzano, Francesco; Accuosto, Pablo; Ferrés, Daniel (CEUR Workshop Proceedings, 2017)
    In the current online Open Science context, scientific datasets and tools for deep text analysis, visualization and exploitation play a major role.We present a system for deep analysis and annotation of scientific text ...
  • Vrochidis, Stefanos; Kompatsiaris, Ioannis; Casamayor, Gerard; Arapakis, Ioannis; Busch, Reinhard; Alexiev, Vladimir; Jamin, Emmanuel; Jugov, Michael; Heise, Nicolaus; Forrellat, Teresa; Liparas, Dimitris; Wanner, Leo (Institute of Electrical and Electronics Engineers (IEEE), 2015)
    This paper presents an overview and the first results of the FP7 MULTISENSOR project, which deals with multidimensional content integration of multimedia content for intelligent sentiment enriched and context ...
  • Angelosante, Daniele; Biglieri, Ezio; Lops, Marco (European Association for Signal Processing (EURASIP), 2008)
    In multiuser detection, the set of users active at any time may be unknown to the receiver. In these conditions, optimum reception consists of detecting simultaneously the set of active/nusers and their data, problem that ...