Inici
→
Recerca: articles, congressos, llibres
→
Departament de Tecnologies de la Informació i les Comunicacions
→
Congressos (Departament de Tecnologies de la Informació i les Comunicacions)
→
Visualitzant Congressos (Departament de Tecnologies de la Informació i les Comunicacions) per títol

Visualitzant Congressos (Departament de Tecnologies de la Informació i les Comunicacions) per títol

Ara mostrant els elements 171-190 de 1060

Automatic extraction of parallel speech corpora from dubbed movies

Öktem, Alp; Farrús, Mireia; Wanner, Leo (ACL (Association for Computational Linguistics), 2017)

This paper presents a methodology to extract parallel speech corpora based on any language pair from dubbed movies, together with an application framework in which some corresponding prosodic parameters are extracted. ...
Automatic generation of high-level state features for generalized planning

Lotinac, Damir; Segovia-Aguas, Javier; Jiménez, Sergio; Jonsson, Anders, 1973- (Association for the Advancement of Artificial Intelligence (AAAI), 2016)

In many domains generalized plans can only/nbe computed if certain high-level state features,/ni.e. features that capture key concepts to accurately/ndistinguish between states and make good decisions,/nare available. In ...
Automatic labeling of vascular structures with topological constraints via HMM

Wang, Xingce; Liu, Yue; Wu, Zhongke; Mou, Xiao; Zhou, Mingquan; González Ballester, Miguel Ángel, 1973-; Zhang, Chong (Springer, 2017)

Identifcation of anatomical vessel branches is a prerequisite task for diagnosis, treatment and inter-subject comparison. We propose a novel graph labeling approach to anatomically label vascular structures of interest. ...
Automatic lyrics-to-audio alignment in classical Turkish music

Dzhambazov, Georgi Bogomilov; Sentürk, Sertan; Serra, Xavier (Computer Engineering Department, Bogaziçi University, 2014)

We apply a lyrics-to-audio alignment state-of-the-art approach to polyphonic pieces from classical Turkish repertoire. A phonetic recognizer is employed, whereby each phoneme is assigned a hidden Markov model (HMM). Initially ...
Automatic Makam recognition using chroma features

Demirel, Emir; Bozkurt, Baris; Serra, Xavier (Aristotle University of Thessaloniki, 2018)

This work focuses on the automatic makam recognition task for Turkish Makam Music using chroma features. Chroma features are widely used for music identification and tonal recognition tasks such as key estimation or chord ...
Automatic modification of communication style in dialogue management

Pragst, Louisa; Miehle, Juliana; Ultes, Stefan; Minker, Wolfgang (ACL (Association for Computational Linguistics), 2016)

In task-oriented dialogues, there is often only one right answer the system can give. However, a lack of variation can seem repetitive and unnatural. Humans change the way they express something, e.g. by being more or less ...
Automatic music tagging with Harmonic CNN

Won, Minz; Chun, Sanghyuk; Nieto Caballero, Oriol; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2019)

In this paper, we introduce the Harmonic Convolutional Neural Network (Harmonic CNN), a music representation model that exploits the inherent harmonic structure of audio signals. The proposed model outperforms previous ...
Automatic musical instrument recognition in audiovisual recordings by combining image and audio classification strategies

Slizovskaia, Olga; Gómez Gutiérrez, Emilia, 1975-; Haro Ortega, Gloria (Zentrum für Mikrotonale Musik und Multimediale Komposition (ZM4), Hochschule für Musik und Theater Hamburg, 2016)

The goal of this work is to incorporate the visual modality into a musical instrument recognition system. For that, we first evaluate state-of-the-art image recognition techniques in the context of music instrument ...
Automatic paragraph segmentation with lexical and prosodic features

Lai, Catherine; Farrús, Mireia; Moore, Johanna D. (International Speech Communication Association (ISCA), 2016)

As long-form spoken documents become more ubiquitous in everyday life, so does the need for automatic discourse segmentation in spoken language processing tasks. Although previous work has focused on broad topic segmentation, ...
Automatic piano fingering from partially annotated scores using autoregressive neural networks

Ramoneda, Pedro; Jeong, Dasaem; Nakamura, Eita; Serra, Xavier; Miron, Marius (ACM Association for Computer Machinery, 2022)

Piano fingering is a creative and highly individualised task acquired by musicians progressively in their first music education years. Pianists must learn to choose the order of fingers to play the piano keys because ...
Automatic playlist continuation using a hybrid recommender system combining features from text and audio

Ferraro, Andrés; Bogdanov, Dmitry; Yoon, Jisang; Kim, KwangSeob; Serra, Xavier (ACM Association for Computer Machinery, 2018)

The ACM RecSys Challenge 2018 focuses on music recommendation in the context of automatic playlist continuation. In this paper, we describe our approach to the problem and the final hybrid system that was submitted to the ...
Automatic transcription of flamenco guitar falsetas

Rodríguez, Sonia; Gómez Gutiérrez, Emilia, 1975-; Cuesta, Helena (Folk Music Analysis, 2018)

This work deals with the automatic transcription and characterization of flamenco guitar, with a focus on short melodic interludes improvised between sung verses. These are called falsetas in the flamenco argot and are ...
Automatic transcription of Turkish makam music

Benetos, Emmanouil; Holzapfel, Andre (International Society for Music Information Retrieval (ISMIR), 2013)

In this paper we propose an automatic system for transcribing/nmakam music of Turkey. We document the specific/ntraits of this music that deviate from properties that/nwere targeted by transcription tools so far and we ...
Automatic viseme vocabulary construction to enhance continuous lip-reading

Fernandez-Lopez, Adriana; Sukno, Federico Mateo (SCITEPRESS, 2017)

Speech is the most common communication method between humans and involves the perception of both auditory and visual channels. Automatic speech recognition focuses on interpreting the audio signals, but it has been ...
Automatic, fast and perceptually accurate gamut mapping based on vision science models

Zamir, Syed Waqas; Vazquez-Corral, Javier; Bertalmío, Marcelo (Society of Motion Picture & Television Engineers (SMPTE), 2017)

Gamut mapping transforms colors of the original (image or video) content to the color palette of the display device with the simultaneous goals of (a) reproducing content accurately while preserving the artistic intent of ...
Autonomous development of turn-taking behaviors in agent populations: a computational study

Moulin-Frier, Clément; Sánchez Fibla, Martí; Verschure, Paul F. M. J. (Institute of Electrical and Electronics Engineers (IEEE), 2015)

We provide an original computational model showing how turn-taking behaviors can self-organize out of sensorimotor/ninteractions between vocalizing agents. These agents are equipped with a cognitive architecture based on ...
Awareness tools for monitoring socio-emotional regulation during collaboration in settings outside school without teacher supervision

Velamazán, Mariano; Santos, Patricia; Hernández Leo, Davinia (Springer, 2021)

There are several awareness tools developed to research how to support different phases and modes of socio-emotional regulation of learning. Most of these tools have focused on only one mode of regulation (self-, co- or ...
BAF: an audio fingerprinting dataset for broadcast monitoring

Cortès, Guillem; Ciurana, Alex; Molina, Emilio; Miron, Marius; Meyers, Owen; Six, Joren; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2022)

Audio Fingerprinting (AFP) is a well-studied problem in music information retrieval for various use-cases e.g. content-based copy detection, DJ-set monitoring, and music excerpt identification. However, AFP for continuous ...
Baixant de la tarima: una experiència a l'aula on l'alumne pren la paraula

Farrús, Mireia; Serra, Montse; Basart i Muñoz, Josep M.; Nadeu Camprubí, Climent (CIDUI Congrés Internacional de Docència Universitària i Innovació, 2016)

En aquesta comunicació presentem una sèrie d'elements de protagonisme de l'alumne, treballats en una aula d'informàtica, amb l'objectiu de fomentar la participació de l'alumne a la classe. Aquests elements, que inclouen ...
Balanced-mixup for highly imbalanced medical image classification

Galdran, Adrian; Carneiro, Gustavo; González Ballester, Miguel Ángel, 1973- (Springer, 2021)

Highly imbalanced datasets are ubiquitous in medical image classification problems. In such problems, it is often the case that rare classes associated to less prevalent diseases are severely under-represented in labeled ...