Benvinguts al Repositori Digital de la UPF

Visualitzant Departament de Tecnologies de la Informació i les Comunicacions per data de publicació

Visualitzant Departament de Tecnologies de la Informació i les Comunicacions per data de publicació

Ordena per: Ordre: Resultats:

  • Serra, Xavier (1988)
    This paper describes an environment developed at CCRMA for the analysis, transformation, and resynthesis of sounds. It has been written on a Lisp Machine workstation, using an Array Processor to speed up the signal processing ...
  • Cano Vila, Pedro; Gómez Gutiérrez, Emilia, 1975-; Gouyon, Fabien; Herrera Boyer, Perfecto, 1964-; Koppenberger, Markus; Ong, Bee Suan; Serra, Xavier; Streich, Sebastian; Wack, Nicolas (2006)
    In this paper we report on the ISMIR 2004 Audio Description Contest. We first detail the contest organization, evaluation metrics, data and infrastructure. We then provide the details and ...
  • Gómez Gutiérrez, Emilia, 1975-; Streich, Sebastian; Ong, Bee Suan; Paiva, Rui Pedro; Tappert, Sven; Batke, Jan-Mark; Poliner, Graham; Ellis, Daniel P. W.; Bello, Juan Pablo (Universitat Pompeu Fabra, 2006)
    This paper provides an overview of current state-of-the-art approaches for melody extraction from polyphonic audio recordings, and it proposes a methodology for the quantitative evaluation of melody extraction algorithms. ...
  • Oliver Riera, Miquel; Hernández Leo, Davinia; Daza, Vanesa; Martín i Badell, Carles; Albó, Laia (2014-01)
    Podemos afirmar que España se ha situado en muy poco tiempo, y de forma sorprendente, en el/ngrupo líder de países que más actividad están generando entorno a los cursos masivos en línea/nabiertos o MOOCs (del inglés Massive ...
  • Oliver Riera, Miquel; Hernández Leo, Davinia; Albó, Laia (2015-11)
    Este nuevo informe realizado por la Cátedra de Telefónica de la Universitat Pompeu Fabra/nrepresenta un segundo análisis del fenómeno MOOC (Massive Open Online Course) desde una/nperspectiva más centrada en la demanda de ...
  • Mille, Simon; Dasiopoulou, Stamatia (2017)
    This paper describes the FORGe generator at WebNLG. The input DBpedia triples are mapped onto sentences by applying a series of rule-based graph-transducers and aggregation grammars to template predicate-argument structures ...
  • Mille, Simon; Dasiopoulou, Stamatia (Universitat Pompeu Fabra, 2017)
    This paper describes the FORGe generator at E2E. The input triples are mapped onto sentences by applying a series of rule-based graph-transducers and aggregation grammars to template predicate-argument structures associated ...
  • Rankothge, Windhya; Le, Franck; Russo, Alessandra; Lobo, Jorge (2017)
    To conduct a more realistic evaluations on resource allocation algorithms for Virtualized Network Functions (VNFs), researches need data on: (1) potential Network Functions (NFs) chains (policies), (2) traffic flows ...
  • Marco, Álvaro M.; Bernat, Nadia P.; De Vivo, Francesco; Ortega, Juan; Isern, Alejandra; Oviedo, Óscar; Sánchez, Paula (2020)
    ​En el seno de los Premis Enginy COVID19 UPF, el grupo de estudiantes MASKIN y con no otro ánimo que la libre difusión de los resultados alcanzados, presenta su portfolio final que sirva de inspiración a terceros, estudiantes ...
  • Cortès, Guillem; Ciurana, Alex; Molina, Emilio; Miron, Marius; Meyers, Owen; Six, Joren; Serra, Xavier (2022-09-21)
    Audio Fingerprinting (AFP) is a well-studied problem in music information retrieval for various use-cases e.g. content-based copy detection, DJ-set monitoring, and music excerpt identification. However, AFP for ...
  • Plaja-Roglans, Genís; Miron, Marius; Serra, Xavier (2022-09-22)
    Notable progress in music source separation has been achieved using multi-branch networks that operate on both temporal and spectral domains. However, such networks tend to be complex and heavy-weighted. In this work, we ...
  • Alonso-Jiménez, Pablo; Serra, Xavier; Bogdanov, Dmitry (2022-09-22)
    This paper revisits the idea of music representation learning supervised by editorial metadata, contributing to the state of the art in two ways. First, we exploit the public editorial metadata available on Discogs, an ...
  • Tamer, Nazif C; Ramoneda, Pedro; Serra, Xavier (2022-09-22)
    Violin performance analysis requires accurate and robust f0 estimates to give feedback on the playing accuracy. Despite the recent advancements in data-driven f0 estimators, their application to performance analysis remains ...
  • Morsi, Alia; Serra, Xavier (2022-09-22)
    Although audio to score alignment is a classic Music Information Retrieval problem, it has not been defined uniquely with the scope of musical scenarios representing its core. The absence of a unified vision makes it ...
  • Nuttall, Thomas; Plaja-Roglans, Genís; Pearson, Lara; Sierra, Xavier (2022-09-22)
    Carnatic Music is a South Indian art and devotional musical practice in which melodic patterns (motifs and phrases), known as sañcāras, play a crucial structural and expressive role. We demonstrate how the combination of ...
  • Bogdanov, Dmitry; Lizarraga Seijas, Xavier; Alonso-Jiménez, Pablo; Serra, Xavier (2022-09-27)
    We present MusAV, a new public benchmark dataset for comparative validation of arousal and valence (AV) regression models for audio-based music emotion recognition. To gather the ground truth, we rely on relative ...
  • Correya, Albin Andrew; Bogdanov, Dmitry; Alonso Jiménez, Pablo; Serra, Xavier (2023-01-10)
    We present Essentia API, a web API to access a collection of state-of-the-art music audio analysis and description algorithms based on Essentia, an open-source library and machine learning (ML) models for audio and music ...
  • Kim, Hyon; Miron, Marius; Serra, Xavier (2023-01-16)
    Piano is one of the most popular music instruments. During the piano performance, loudness is an important factor for expressiveness, alongside tempo, changes in dynamics play with expectation, convey various emotions, ...
  • Alonso-Jiménez, Pablo; Favory, Xavier; Foroughmand, Hadrien; Bourdalas, Grigoris; Serra, Xavier; Lidy, Thomas; Bogdanov, Dmitry (2023-04-25)
    In this work, we investigate an approach that relies on contrastive learning and music metadata as a weak source of supervision to train music representation models. Recent studies show that contrastive learning can be ...
  • Tamer, Nazif C; Özer, Yigitcan; Müller, Meinard; Serra, Xavier (2023-04-25)
    Pitch estimation of a target musical source within a multi-source polyphonic signal is of great interest for music performance analysis. One possible approach for extracting the pitch of a target source is to first perform ...

Cerca


Cerca avançada

Visualitza

El meu compte

Amb col·laboració de Complim Participem