Benvinguts al Repositori Digital de la UPF

Visualitzant Congressos (Departament de Tecnologies de la Informació i les Comunicacions) per títol

Visualitzant Congressos (Departament de Tecnologies de la Informació i les Comunicacions) per títol

Ordena per: Ordre: Resultats:

  • Ferraro, Andrés; Jeon, Jea Ho; Kim, Biho; Serra, Xavier; Bogdanov, Dmitry (ICML, 2020)
    To evaluate if the recommendations are fair, we have to consider how all the stakeholders are affected. In this work, we focus on the artists in the music domain. We analyze the recommendations made with Collaborative ...
  • Yesiler, Furkan; Serrà, Joan; Miron, Marius; Gómez Gutiérrez, Emilia, 1975- (ACM Association for Computer Machinery, 2022)
    Version identification (VI) systems now offer accurate and scalable solutions for detecting different renditions of a musical composition, allowing the use of these systems in industrial applications and throughout the ...
  • Pérez-Mayos, Laura; Táboas García, Alba; Mille, Simon; Wanner, Leo (ACL (Association for Computational Linguistics), 2021)
    Multilingual Transformer-based language models, usually pretrained on more than 100 languages, have been shown to achieve outstanding results in a wide range of crosslingual transfer tasks. However, it remains unknown ...
  • Serrà Julià, Joan; Koduri, Gopala Krishna; Miron, Marius; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2011)
    The issue of tuning in Indian classical music has been, historically,/na matter of theoretical debate. In this paper, we/nstudy its contemporary practice in sung performances of Carnatic/nand Hindustani music following an ...
  • Giménez Mínguez, Paula; Bijnens, Bart; Bernardino Perez, Gabriel; Lluch Alvarez, Èric; Soveral, Iris; Gómez, Olga; García Cañadilla, Patricia, 1985- (Springer, 2017)
    Aortic coarctation is one of the most difficult cardiac defects to diagnose before birth, and it accounts for 8% of congenital heart diseases. Antenatal diagnosis is crucial for early treatment of the neonate and to decrease ...
  • Tauste Campo, Adrià, 1982-; Biglieri, Ezio (National Institute of Information and Communications Technology (NICT), 2008)
    We examine a multiple-access communication system in which multiuser detection is performed without knowledge of the number of active interferers. Using a statistical-physics approach, we compute the single-user channel ...
  • Font-Segura, Josep; Vázquez, Gregori; Riba, Jaume (Institute of Electrical and Electronics Engineers (IEEE), 2012)
    The performance in signal detection is evaluated by the error (false-alarm and missed-detection) probabilities.However,calculating these probabilities is a difficult taskin practice. This paper studies the asymptotic ...
  • Font-Segura, Josep; Martínez, Alfonso, 1973-; Guillén i Fábregas, A. (Albert) (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    This paper provides an asymptotic expansion of the error probability, as the codeword length n goes to infinity, in quasi-static binary symmetric channels. After the leading term, namely the outage probability, the next ...
  • Font-Segura, Josep; Martínez, Alfonso, 1973-; Guillén i Fábregas, A. (Albert) (Institute of Electrical and Electronics Engineers (IEEE), 2019)
    Saddlepoint approximations to the error probability are derived for multiple-cost-constrained random coding ensembles where codewords satisfy a set of constraints. Constantcomposition inputs over a binary symmetric channel ...
  • Öktem, Alp; Farrús, Mireia; Wanner, Leo (Springer, 2017)
    Until very recently, the generation of punctuation marks for automatic speech recognition (ASR) output has been mostly done by looking at the syntactic structure of the recognized utterances. Prosodic cues such as breaks, ...
  • Font Corbera, Frederic; Serrà Julià, Joan; Serra, Xavier (Audio Engineering Society, 2014)
    Methods for automatic sound and music classification are of great value when trying to organise the large amounts of unstructured, user-contributed audio content uploaded to online sharing platforms. Currently, most of ...
  • Font Corbera, Frederic; Brookes, Tim; Fazekas, George; Guerber, Martin; La Burthe, Amaury; Plans, David; Plumbley, Mark D.; Shaashua, Meir; Wang, Wenwu; Serra, Xavier (Audio Engineering Society, 2016)
    Significant amounts of user-generated audio content, such as sound effects, musical samples and music pieces, are uploaded to online repositories and made available under open licenses. Moreover, a constantly increasing ...
  • Herrera Boyer, Perfecto, 1964-; Serra, Xavier; Peeters, Geoffroy (International Computer Music Conference, 1999)
    Sound content description is one of the aims of the MPEG-7 initiative. Although MPEG-7 focuses on indexing and retrieval of audio, there are other sound content-based processing applications waiting to be developed once ...
  • Atli, Hasan Sercan; Uyar, Burak; Sentürk, Sertan; Bozkurt, Baris; Serra, Xavier (2014)
    For Turkish makam music, there exist several analysis tools which generally use only the audio as the input to extract the features of the audio. This study aims at extending such approach by using additional features such ...
  • Gong, Rong; Pons Puig, Jordi; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2017)
    We approach the singing phrase audio to score matching problem by using phonetic and duration information – with a focus on studying the jingju a cappella singing case. We argue that, due to the existence of a basic melodic ...
  • Luque, Jordi; Morros, R.; Garde, I.; Anguita, Jan; Farrús, Mireia; Macho, D.; Marqués López, Fernando; Martínez, C.; Vilaplana, Verónica; Hernando, Javier (Springer, 2007)
    In this paper, we address the modality integration issue on the example of a smart room environment aiming at enabling person identification by combining speech and 2D face images. First we introduce the monomodal audio ...
  • Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2018)
    In this paper we present a new dataset of time-aligned jazz harmony transcriptions. This dataset is a useful resource for content-based analysis, especially for training and evaluating chord transcription algorithms. Most ...
  • Papiotis, Panagiotis, 1985-; Herrera Boyer, Perfecto, 1964-; Marchini, Marco, 1984-; Maestre Gómez, Esteban (University of Jyväskylä, 2013)
    In a musical ensemble musicians can influence each other’s performance in terms not only of timing but also in other aspects of the performance such as dynamics, intonation, and timbre. The goal of this work is to test ...
  • Dzhambazov, Georgi Bogomilov; Yang, Yile; Caro Repetto, Rafael; Serra, Xavier (International Workshop on Folk Music Analysis, 2016)
    In this study we propose how to modify a standard approach for text-to-speech alignment to apply in the case of alignment of lyrics and singing voice. We model phoneme durations by means of a duration-explicit hidden Markov ...
  • Costa-jussà, Marta R.; Farrús, Mireia; Mariño Acebal, José B.; Fonollosa, José A Rodriguez (European Language Resources Association (ELRA), 2010)
    Machine translation systems can be classified into rule-based and corpus-based approaches, in terms of their core technology. Since both paradigms have largely been used during the last years, one of the aims in the research ...

Cerca


Cerca avançada

Visualitza

El meu compte

Amb col·laboració de Complim Participem