Inici
→
Recerca: working papers, preprints, informes, etc.
→
Departament de Tecnologies de la Informació i les Comunicacions
→
Visualitzant Departament de Tecnologies de la Informació i les Comunicacions per data de publicació

Visualitzant Departament de Tecnologies de la Informació i les Comunicacions per data de publicació

Ara mostrant els elements 1-20 de 35

Pàgina següent

An Environment for the analysis, transformation and resynthesis of music sounds

Serra, Xavier (1988)

This paper describes an environment developed at CCRMA for the analysis, transformation, and resynthesis of sounds. It has been written on a Lisp Machine workstation, using an Array Processor to speed up the signal processing ...
ISMIR 2004 audio description contest

Cano Vila, Pedro; Gómez Gutiérrez, Emilia, 1975-; Gouyon, Fabien; Herrera Boyer, Perfecto, 1964-; Koppenberger, Markus; Ong, Bee Suan; Serra, Xavier; Streich, Sebastian; Wack, Nicolas (2006)

In this paper we report on the ISMIR 2004 Audio Description Contest. We first detail the contest organization, evaluation metrics, data and infrastructure. We then provide the details and ...
A quantitative comparison of different approaches for melody extraction from polyphonic audio recordings

Gómez Gutiérrez, Emilia, 1975-; Streich, Sebastian; Ong, Bee Suan; Paiva, Rui Pedro; Tappert, Sven; Batke, Jan-Mark; Poliner, Graham; Ellis, Daniel P. W.; Bello, Juan Pablo (Universitat Pompeu Fabra, 2006)

This paper provides an overview of current state-of-the-art approaches for melody extraction from polyphonic audio recordings, and it proposes a methodology for the quantitative evaluation of melody extraction algorithms. ...
MOOCs en España. Panorama actual de los Cursos Masivos Abiertos en Línea en las universidades españolas

Oliver Riera, Miquel; Hernández Leo, Davinia; Daza, Vanesa; Martín i Badell, Carles; Albó, Laia (2014-01)

Podemos afirmar que España se ha situado en muy poco tiempo, y de forma sorprendente, en el/ngrupo líder de países que más actividad están generando entorno a los cursos masivos en línea/nabiertos o MOOCs (del inglés Massive ...
MOOCs en España. Análisis de la demanda. Panorama actual de los Cursos Masivos Abiertos en Línea en la plataforma Miríada X

Oliver Riera, Miquel; Hernández Leo, Davinia; Albó, Laia (2015-11)

Este nuevo informe realizado por la Cátedra de Telefónica de la Universitat Pompeu Fabra/nrepresenta un segundo análisis del fenómeno MOOC (Massive Open Online Course) desde una/nperspectiva más centrada en la demanda de ...
FORGe at E2E 2017

Mille, Simon; Dasiopoulou, Stamatia (2017)

This paper describes the FORGe generator at WebNLG. The input DBpedia triples are mapped onto sentences by applying a series of rule-based graph-transducers and aggregation grammars to template predicate-argument structures ...
FORGe at WebNLG 2017

Mille, Simon; Dasiopoulou, Stamatia (Universitat Pompeu Fabra, 2017)

This paper describes the FORGe generator at E2E. The input triples are mapped onto sentences by applying a series of rule-based graph-transducers and aggregation grammars to template predicate-argument structures associated ...
Data modelling for the evaluation of virtualized network functions resource allocation algorithms

Rankothge, Windhya; Le, Franck; Russo, Alessandra; Lobo, Jorge (2017)

To conduct a more realistic evaluations on resource allocation algorithms for Virtualized Network Functions (VNFs), researches need data on: (1) potential Network Functions (NFs) chains (policies), (2) traffic flows ...
Transparent Facemask, MASKIN: prototipo de mascarilla transparente reutilizable y comprometida con medioambiente y sociedad.

Marco, Álvaro M.; Bernat, Nadia P.; De Vivo, Francesco; Ortega, Juan; Isern, Alejandra; Oviedo, Óscar; Sánchez, Paula (2020)

En el seno de los Premis Enginy COVID19 UPF, el grupo de estudiantes MASKIN y con no otro ánimo que la libre difusión de los resultados alcanzados, presenta su portfolio final que sirva de inspiración a terceros, estudiantes ...
BAF: an audio fingerprinting dataset for broadcast monitoring

Cortès, Guillem; Ciurana, Alex; Molina, Emilio; Miron, Marius; Meyers, Owen; Six, Joren; Serra, Xavier (2022-09-21)

Audio Fingerprinting (AFP) is a well-studied problem in music information retrieval for various use-cases e.g. content-based copy detection, DJ-set monitoring, and music excerpt identification. However, AFP for ...
A diffusion-inspired training strategy for singing voice extraction in the waveform domain

Plaja-Roglans, Genís; Miron, Marius; Serra, Xavier (2022-09-22)

Notable progress in music source separation has been achieved using multi-branch networks that operate on both temporal and spectral domains. However, such networks tend to be complex and heavy-weighted. In this work, we ...
Music representation learning based on editorial metadata from discogs

Alonso-Jiménez, Pablo; Serra, Xavier; Bogdanov, Dmitry (2022-09-22)

This paper revisits the idea of music representation learning supervised by editorial metadata, contributing to the state of the art in two ways. First, we exploit the public editorial metadata available on Discogs, an ...
Violin etudes: a comprehensive dataset for f0 estimation and performance analysis

Tamer, Nazif C; Ramoneda, Pedro; Serra, Xavier (2022-09-22)

Violin performance analysis requires accurate and robust f0 estimates to give feedback on the playing accuracy. Despite the recent advancements in data-driven f0 estimators, their application to performance analysis remains ...
Bottlenecks and solutions for audio to score alignment research

Morsi, Alia; Serra, Xavier (2022-09-22)

Although audio to score alignment is a classic Music Information Retrieval problem, it has not been defined uniquely with the scope of musical scenarios representing its core. The absence of a unified vision makes it ...
In search of Sañcāras: tradition-informed repeated melodic pattern recognition in carnatic music

Nuttall, Thomas; Plaja-Roglans, Genís; Pearson, Lara; Sierra, Xavier (2022-09-22)

Carnatic Music is a South Indian art and devotional musical practice in which melodic patterns (motifs and phrases), known as sañcāras, play a crucial structural and expressive role. We demonstrate how the combination of ...
MUSAV: a dataset of relative arousal-valence annotations for validation of audio models

Bogdanov, Dmitry; Lizarraga Seijas, Xavier; Alonso-Jiménez, Pablo; Serra, Xavier (2022-09-27)

We present MusAV, a new public benchmark dataset for comparative validation of arousal and valence (AV) regression models for audio-based music emotion recognition. To gather the ground truth, we rely on relative ...
Essentia API: a web API for music audio analysis

Correya, Albin Andrew; Bogdanov, Dmitry; Alonso Jiménez, Pablo; Serra, Xavier (2023-01-10)

We present Essentia API, a web API to access a collection of state-of-the-art music audio analysis and description algorithms based on Essentia, an open-source library and machine learning (ML) models for audio and music ...
Note level midi velocity estimation for piano performance

Kim, Hyon; Miron, Marius; Serra, Xavier (2023-01-16)

Piano is one of the most popular music instruments. During the piano performance, loudness is an important factor for expressiveness, alongside tempo, changes in dynamics play with expectation, convey various emotions, ...
Pre-Training Strategies Using Contrastive Learning and Playlist Information for Music Classification and Similarity

Alonso-Jiménez, Pablo; Favory, Xavier; Foroughmand, Hadrien; Bourdalas, Grigoris; Serra, Xavier; Lidy, Thomas; Bogdanov, Dmitry (2023-04-25)

In this work, we investigate an approach that relies on contrastive learning and music metadata as a weak source of supervision to train music representation models. Recent studies show that contrastive learning can be ...
TAPE: An End-to-End Timbre-Aware Pitch Estimator

Tamer, Nazif C; Özer, Yigitcan; Müller, Meinard; Serra, Xavier (2023-04-25)

Pitch estimation of a target musical source within a multi-source polyphonic signal is of great interest for music performance analysis. One possible approach for extracting the pitch of a target source is to first perform ...