Welcome to the UPF Digital Repository

Browsing Congressos (Departament de Tecnologies de la Informació i les Comunicacions) by Title

Browsing Congressos (Departament de Tecnologies de la Informació i les Comunicacions) by Title

Sort by: Order: Results:

  • Dzhambazov, Georgi Bogomilov; Holzapfel, Andre; Srinivasamurthy, Ajay; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2017)
    The goal of this study is the automatic detection of onsets of the singing voice in polyphonic audio recordings. Starting with a hypothesis that the knowledge of the current position in a metrical cycle (i.e. metrical ...
  • Amatriain, Xavier; Bonada, Jordi, 1973-; Serra, Xavier (International Conference on Digital Audio Effects, 1998)
    Since the MIDI 1.0 specification, well over 15 years ago, many have been the attempts to give a solution to all the limitations that soon became clear. None of these have had a happy ending, mainly due to commercial ...
  • Gulati, Sankalp; Serrà Julià, Joan; Ishwar, Vignesh; Serra, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2014)
    Discovery of repeating structures in music is fundamental to its analysis, understanding and interpretation. We present a data-driven approach for the discovery of shorttime melodic patterns in large collections of ...
  • Porter, Alastair; Bogdanov, Dmitry; Serra, Xavier (ACM Association for Computer Machinery, 2016)
    Semantic annotations of music collections in digital libraries are important for organization and navigation of the collection. These annotations and their associated metadata are useful in many Music Information Retrieval ...
  • Urbano, Julián (Music Information Retrieval Evaluation eXchange (MIREX), 2013)
    This short paper describes our three submissions to the 2013 edition of the MIREX Symbolic Melodic Similarity task. All three submissions rely on a geometric model that represents melodies as spline curves in the pitch-time ...
  • Anantapadmanabhan, Akshay; Bellur, Ashwin; Murthy, Hema A. (Institute of Electrical and Electronics Engineers (IEEE), 2013)
    In this paper we use a Non-negative Matrix Factorization/n(NMF) based approach to analyze the strokes of the mri-/ndangam, a South Indian hand drum, in terms of the normal/nmodes of the instrument. Using ...
  • Blaauw, Merlijn; Bonada, Jordi, 1973- (International Speech Communication Association (ISCA), 2016)
    Latent generative models can learn higher-level underlying factors from complex data in an unsupervised manner. Such models can be used in a wide range of speech processing applications, including synthesis, transformation ...
  • Dzhambazov, Georgi Bogomilov; Serra, Xavier (Music Technology Research Group, Department of Computer Science, Maynooth University, 2015)
    In this work we propose how to modify a standard scheme for text-to-speech alignment for the alignment of lyrics and singing voice. To this end we model the duration of phonemes specific for the case of singing. We rely ...
  • Miron, Marius; Janer Mestres, Jordi; Gómez Gutiérrez, Emilia, 1975- (International Society for Music Information Retrieval (ISMIR), 2017)
    Score information has been shown to improve music source separation when included into non-negative matrix factorization (NMF) frameworks. Recently, deep learning approaches have outperformed NMF methods in terms ...
  • Chandna, Pritish; Miron, Marius; Janer Mestres, Jordi; Gómez Gutiérrez, Emilia, 1975- (Springer, 2017)
    In this paper we introduce a low-latency monaural source separation framework using a Convolutional Neural Network (CNN). We use a CNN to estimate time-frequency soft masks which are applied for source separation. We ...
  • Rezaeirowshan, Babak; Ballester, Coloma; Haro Ortega, Gloria (SCITEPRESS – Science and Technology Publications, Lda., 2016)
    In this paper we propose a method to estimate a global depth order between the objects of a scene using information from a single image coming from an uncalibrated camera. The method we present stems from early vision cues ...
  • Karakurt, Altug; Sentürk, Sertan; Serra, Xavier (ACM Association for Computer Machinery, 2016)
    In the general sense, mode defines the melodic framework and tonic acts as the reference tuning pitch for the melody in the performances of many music cultures. The mode and tonic information of the audio recordings is ...
  • Oliver, Maria; Raad, Lara; Ballester, Coloma; Haro Ortega, Gloria (Institute of Electrical and Electronics Engineers (IEEE), 2018)
    This work presents an automatic method for optical flow inpainting. Given a video, each frame domain is endowed with a Riemannian metric based on the video pixel values. The missing optical flow is recovered by solving the ...
  • Bellur, Ashwin; Ishwar, Vignesh; Murthy, Hema A. (Universitat Pompeu Fabra, 2012)
    A raga is a collective melodic expression consisting of motifs. A raga can be identified using motifs which are/nunique to it. Motifs can be thought of as signature prosodic phrases. Different ragas may be composed of the ...
  • Santos, Maria; Schaper, Marie-Monique; Parés, Narcís, 1966- (ACM Association for Computer Machinery, 2017)
    This paper presents a brief overview of the design and evaluation process of a Virtual Heritage (VH) experience for children in the context of Refugi 307, a bomb shelter built during the Spanish Civil War. The shelter ...
  • Ruiz Ovejero, Adrià; Rudovic, Ognjen; Binefa i Valls, Xavier; Pantic, Maja (Springer, 2017)
    In this paper, we address the Multi-Instance-Learning (MIL) problem when bag labels are naturally represented as ordinal variables (Multi-Instance-Ordinal Regression). Moreover, we consider the case where bags are temporal ...
  • Oramas, Sergio; Nieto Caballero, Oriol; Barbieri, Francesco; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2017)
    Music genres allow to categorize musical items that share common characteristics. Although these categories are not mutually exclusive, most related research is traditionally focused on classifying tracks into a single ...
  • Accuosto, Pablo; Ronzano, Francesco; Ferrés, Daniel; Saggion, Horacio (2017)
    We present a system to mine and visualize collections of scientific documents by semantically browsing information extracted from single publications or aggregated throughout corpora of articles. The text mining tool ...
  • Barbieri, Francesco; Marujo, Luís; Karuturi, Pradeep; Brendel, William (CEUR Workshop Proceedings, 2018)
    Emojis are very common in social media and understanding their underlying semantics is of great interest from a Natural Language Processing point of view. In this work, we investigate emoji prediction in short ...
  • Papiotis, Panos; Marchini, Marco, 1984-; Maestre Gómez, Esteban (Associoation Européenne des Conservatoires, 2013)
    In a musical ensemble such as a string quartet, the performers can influence each other’s actions in several aspects of the performance simultaneously. Based on a set of recorded string quartet exercises, we carried out a ...

Search DSpace


Advanced Search

Browse

My Account

Compliant to Partaking