Inici
→
Recerca: articles, congressos, llibres
→
Departament de Tecnologies de la Informació i les Comunicacions
→
Congressos (Departament de Tecnologies de la Informació i les Comunicacions)
→
Visualitzant Congressos (Departament de Tecnologies de la Informació i les Comunicacions) per títol

Visualitzant Congressos (Departament de Tecnologies de la Informació i les Comunicacions) per títol

Ara mostrant els elements 143-162 de 1060

Artist biases in collaborative filtering for music recommendation

Ferraro, Andrés; Jeon, Jea Ho; Kim, Biho; Serra, Xavier; Bogdanov, Dmitry (ICML, 2020)

To evaluate if the recommendations are fair, we have to consider how all the stakeholders are affected. In this work, we focus on the artists in the music domain. We analyze the recommendations made with Collaborative ...
Assessing algorithmic biases for musical version identification

Yesiler, Furkan; Serrà, Joan; Miron, Marius; Gómez Gutiérrez, Emilia, 1975- (ACM Association for Computer Machinery, 2022)

Version identification (VI) systems now offer accurate and scalable solutions for detecting different renditions of a musical composition, allowing the use of these systems in industrial applications and throughout the ...
Assessing the syntactic capabilities of transformer-based multilingual language models

Pérez-Mayos, Laura; Táboas García, Alba; Mille, Simon; Wanner, Leo (ACL (Association for Computational Linguistics), 2021)

Multilingual Transformer-based language models, usually pretrained on more than 100 languages, have been shown to achieve outstanding results in a wide range of crosslingual transfer tasks. However, it remains unknown ...
Assessing the tuning of sung indian classical music

Serrà Julià, Joan; Koduri, Gopala Krishna; Miron, Marius; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2011)

The issue of tuning in Indian classical music has been, historically,/na matter of theoretical debate. In this paper, we/nstudy its contemporary practice in sung performances of Carnatic/nand Hindustani music following an ...
Assessment of haemodynamic remodeling in fetal aortic coarctation using a lumped model of the circulation

Giménez Mínguez, Paula; Bijnens, Bart; Bernardino Perez, Gabriel; Lluch Alvarez, Èric; Soveral, Iris; Gómez, Olga; García Cañadilla, Patricia, 1985- (Springer, 2017)

Aortic coarctation is one of the most difficult cardiac defects to diagnose before birth, and it accounts for 8% of congenital heart diseases. Antenatal diagnosis is crucial for early treatment of the neonate and to decrease ...
Asymptotic capacity of static multiuser channels with an unknown number of users

Tauste Campo, Adrià, 1982-; Biglieri, Ezio (National Institute of Information and Communications Technology (NICT), 2008)

We examine a multiple-access communication system in which multiuser detection is performed without knowledge of the number of active interferers. Using a statistical-physics approach, we compute the single-user channel ...
Asymptotic error exponents in energy-detector and estimator-correlator signal detection

Font-Segura, Josep; Vázquez, Gregori; Riba, Jaume (Institute of Electrical and Electronics Engineers (IEEE), 2012)

The performance in signal detection is evaluated by the error (false-alarm and missed-detection) probabilities.However,calculating these probabilities is a difficult taskin practice. This paper studies the asymptotic ...
Asymptotics of the error probability in quasi-static binary symmetric channels

Font-Segura, Josep; Martínez, Alfonso, 1973-; Guillén i Fábregas, A. (Albert) (Institute of Electrical and Electronics Engineers (IEEE), 2017)

This paper provides an asymptotic expansion of the error probability, as the codeword length n goes to infinity, in quasi-static binary symmetric channels. After the leading term, namely the outage probability, the next ...
Asymptotics of the random coding error probability for constant-composition codes

Font-Segura, Josep; Martínez, Alfonso, 1973-; Guillén i Fábregas, A. (Albert) (Institute of Electrical and Electronics Engineers (IEEE), 2019)

Saddlepoint approximations to the error probability are derived for multiple-cost-constrained random coding ensembles where codewords satisfy a set of constraints. Constantcomposition inputs over a binary symmetric channel ...
Attentional parallel RNNs for generating punctuation in transcribed speech

Öktem, Alp; Farrús, Mireia; Wanner, Leo (Springer, 2017)

Until very recently, the generation of punctuation marks for automatic speech recognition (ASR) output has been mostly done by looking at the syntactic structure of the recognized utterances. Prosodic cues such as breaks, ...
Audio clip classification using social tags and the effect of tag expansion

Font Corbera, Frederic; Serrà Julià, Joan; Serra, Xavier (Audio Engineering Society, 2014)

Methods for automatic sound and music classification are of great value when trying to organise the large amounts of unstructured, user-contributed audio content uploaded to online sharing platforms. Currently, most of ...
Audio Commons: Bringing Creative Commons audio content to the creative industries

Font Corbera, Frederic; Brookes, Tim; Fazekas, George; Guerber, Martin; La Burthe, Amaury; Plans, David; Plumbley, Mark D.; Shaashua, Meir; Wang, Wenwu; Serra, Xavier (Audio Engineering Society, 2016)

Significant amounts of user-generated audio content, such as sound effects, musical samples and music pieces, are uploaded to online repositories and made available under open licenses. Moreover, a constantly increasing ...
Audio descriptors and descriptor schemes in the context of MPEG-7

Herrera Boyer, Perfecto, 1964-; Serra, Xavier; Peeters, Geoffroy (International Computer Music Conference, 1999)

Sound content description is one of the aims of the MPEG-7 initiative. Although MPEG-7 focuses on indexing and retrieval of audio, there are other sound content-based processing applications waiting to be developed once ...
Audio feature extraction for exploring Turkish makam music

Atli, Hasan Sercan; Uyar, Burak; Sentürk, Sertan; Bozkurt, Baris; Serra, Xavier (2014)

For Turkish makam music, there exist several analysis tools which generally use only the audio as the input to extract the features of the audio. This study aims at extending such approach by using additional features such ...
Audio to score matching by combining phonetic and duration information

Gong, Rong; Pons Puig, Jordi; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2017)

We approach the singing phrase audio to score matching problem by using phonetic and duration information – with a focus on studying the jingju a cappella singing case. We argue that, due to the existence of a basic melodic ...
Audio, video and multimodal person identification in a smart room

Luque, Jordi; Morros, R.; Garde, I.; Anguita, Jan; Farrús, Mireia; Macho, D.; Marqués López, Fernando; Martínez, C.; Vilaplana, Verónica; Hernando, Javier (Springer, 2007)

In this paper, we address the modality integration issue on the example of a smart room environment aiming at enabling person identification by combining speech and 2D face images. First we introduce the monomodal audio ...
Audio-aligned jazz harmony dataset for automatic chord transcription and corpus-based research

Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2018)

In this paper we present a new dataset of time-aligned jazz harmony transcriptions. This dataset is a useful resource for content-based analysis, especially for training and evaluating chord transcription algorithms. Most ...
Aural-based detection and assessment of real versus artificially synchronized string quartet performance

Papiotis, Panagiotis, 1985-; Herrera Boyer, Perfecto, 1964-; Marchini, Marco, 1984-; Maestre Gómez, Esteban (University of Jyväskylä, 2013)

In a musical ensemble musicians can influence each other’s performance in terms not only of timing but also in other aspects of the performance such as dynamics, intonation, and timbre. The goal of this work is to test ...
Automatic alignment of long syllables in a cappella Beijing opera

Dzhambazov, Georgi Bogomilov; Yang, Yile; Caro Repetto, Rafael; Serra, Xavier (International Workshop on Folk Music Analysis, 2016)

In this study we propose how to modify a standard approach for text-to-speech alignment to apply in the case of alignment of lyrics and singing voice. We model phoneme durations by means of a duration-explicit hidden Markov ...
Automatic and human evaluation study of a rule-based and a statistical Catalan-Spanish machine translation systems

Costa-jussà, Marta R.; Farrús, Mireia; Mariño Acebal, José B.; Fonollosa, José A Rodriguez (European Language Resources Association (ELRA), 2010)

Machine translation systems can be classified into rule-based and corpus-based approaches, in terms of their core technology. Since both paradigms have largely been used during the last years, one of the aims in the research ...