Welcome to the UPF Digital Repository

Browsing Informes (Departament de Tecnologies de la Informació i les Comunicacions) by Author "Serra, Xavier"

Browsing Informes (Departament de Tecnologies de la Informació i les Comunicacions) by Author "Serra, Xavier"

Sort by: Order: Results:

  • Plaja-Roglans, Genís; Miron, Marius; Serra, Xavier (2022-09-22)
    Notable progress in music source separation has been achieved using multi-branch networks that operate on both temporal and spectral domains. However, such networks tend to be complex and heavy-weighted. In this work, we ...
  • Serra, Xavier (1988)
    This paper describes an environment developed at CCRMA for the analysis, transformation, and resynthesis of sounds. It has been written on a Lisp Machine workstation, using an Array Processor to speed up the signal processing ...
  • Cortès, Guillem; Ciurana, Alex; Molina, Emilio; Miron, Marius; Meyers, Owen; Six, Joren; Serra, Xavier (2022-09-21)
    Audio Fingerprinting (AFP) is a well-studied problem in music information retrieval for various use-cases e.g. content-based copy detection, DJ-set monitoring, and music excerpt identification. However, AFP for ...
  • Morsi, Alia; Serra, Xavier (2022-09-22)
    Although audio to score alignment is a classic Music Information Retrieval problem, it has not been defined uniquely with the scope of musical scenarios representing its core. The absence of a unified vision makes it ...
  • Kim, Hyon; Serra, Xavier (2023-08-31)
    In any piano performance, expressiveness is paramount for effectively conveying the intent of the performer, and one of the most significant aspects of expressiveness is the loudness at the individual key or note level. ...
  • Correya, Albin Andrew; Bogdanov, Dmitry; Alonso Jiménez, Pablo; Serra, Xavier (2023-01-10)
    We present Essentia API, a web API to access a collection of state-of-the-art music audio analysis and description algorithms based on Essentia, an open-source library and machine learning (ML) models for audio and music ...
  • Cano Vila, Pedro; Gómez Gutiérrez, Emilia, 1975-; Gouyon, Fabien; Herrera Boyer, Perfecto, 1964-; Koppenberger, Markus; Ong, Bee Suan; Serra, Xavier; Streich, Sebastian; Wack, Nicolas (2006)
    In this paper we report on the ISMIR 2004 Audio Description Contest. We first detail the contest organization, evaluation metrics, data and infrastructure. We then provide the details and ...
  • Bogdanov, Dmitry; Lizarraga Seijas, Xavier; Alonso-Jiménez, Pablo; Serra, Xavier (2022-09-27)
    We present MusAV, a new public benchmark dataset for comparative validation of arousal and valence (AV) regression models for audio-based music emotion recognition. To gather the ground truth, we rely on relative ...
  • Alonso-Jiménez, Pablo; Serra, Xavier; Bogdanov, Dmitry (2022-09-22)
    This paper revisits the idea of music representation learning supervised by editorial metadata, contributing to the state of the art in two ways. First, we exploit the public editorial metadata available on Discogs, an ...
  • Kim, Hyon; Miron, Marius; Serra, Xavier (2023-01-16)
    Piano is one of the most popular music instruments. During the piano performance, loudness is an important factor for expressiveness, alongside tempo, changes in dynamics play with expectation, convey various emotions, ...
  • Alonso-Jiménez, Pablo; Favory, Xavier; Foroughmand, Hadrien; Bourdalas, Grigoris; Serra, Xavier; Lidy, Thomas; Bogdanov, Dmitry (2023-04-25)
    In this work, we investigate an approach that relies on contrastive learning and music metadata as a weak source of supervision to train music representation models. Recent studies show that contrastive learning can be ...
  • Kim, Hyon; Miron, Marius; Serra, Xavier (2023-05-12)
    Piano is one of the most popular instruments among people that learn to play music. When playing the piano, the level of loudness is crucial for expressing emotions as well as manipulating tempo. These elements convey ...
  • Tamer, Nazif C; Özer, Yigitcan; Müller, Meinard; Serra, Xavier (2023-04-25)
    Pitch estimation of a target musical source within a multi-source polyphonic signal is of great interest for music performance analysis. One possible approach for extracting the pitch of a target source is to first perform ...
  • Tamer, Nazif C; Ramoneda, Pedro; Serra, Xavier (2022-09-22)
    Violin performance analysis requires accurate and robust f0 estimates to give feedback on the playing accuracy. Despite the recent advancements in data-driven f0 estimators, their application to performance analysis remains ...

Search DSpace


Advanced Search

Browse

My Account

Compliant to Partaking