Benvinguts al Repositori Digital de la UPF

Visualitzant Congressos (Departament de Tecnologies de la Informació i les Comunicacions) per títol

Visualitzant Congressos (Departament de Tecnologies de la Informació i les Comunicacions) per títol

Ordena per: Ordre: Resultats:

  • Öktem, Alp; Farrús, Mireia; Wanner, Leo (ACL (Association for Computational Linguistics), 2017)
    This paper presents a methodology to extract parallel speech corpora based on any language pair from dubbed movies, together with an application framework in which some corresponding prosodic parameters are extracted. ...
  • Lotinac, Damir; Segovia-Aguas, Javier; Jiménez, Sergio; Jonsson, Anders, 1973- (Association for the Advancement of Artificial Intelligence (AAAI), 2016)
    In many domains generalized plans can only/nbe computed if certain high-level state features,/ni.e. features that capture key concepts to accurately/ndistinguish between states and make good decisions,/nare available. In ...
  • Wang, Xingce; Liu, Yue; Wu, Zhongke; Mou, Xiao; Zhou, Mingquan; González Ballester, Miguel Ángel, 1973-; Zhang, Chong (Springer, 2017)
    Identifcation of anatomical vessel branches is a prerequisite task for diagnosis, treatment and inter-subject comparison. We propose a novel graph labeling approach to anatomically label vascular structures of interest. ...
  • Dzhambazov, Georgi Bogomilov; Sentürk, Sertan; Serra, Xavier (Computer Engineering Department, Bogaziçi University, 2014)
    We apply a lyrics-to-audio alignment state-of-the-art approach to polyphonic pieces from classical Turkish repertoire. A phonetic recognizer is employed, whereby each phoneme is assigned a hidden Markov model (HMM). Initially ...
  • Demirel, Emir; Bozkurt, Baris; Serra, Xavier (Aristotle University of Thessaloniki, 2018)
    This work focuses on the automatic makam recognition task for Turkish Makam Music using chroma features. Chroma features are widely used for music identification and tonal recognition tasks such as key estimation or chord ...
  • Pragst, Louisa; Miehle, Juliana; Ultes, Stefan; Minker, Wolfgang (ACL (Association for Computational Linguistics), 2016)
    In task-oriented dialogues, there is often only one right answer the system can give. However, a lack of variation can seem repetitive and unnatural. Humans change the way they express something, e.g. by being more or less ...
  • Won, Minz; Chun, Sanghyuk; Nieto Caballero, Oriol; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2019)
    In this paper, we introduce the Harmonic Convolutional Neural Network (Harmonic CNN), a music representation model that exploits the inherent harmonic structure of audio signals. The proposed model outperforms previous ...
  • Slizovskaia, Olga; Gómez Gutiérrez, Emilia, 1975-; Haro Ortega, Gloria (Zentrum für Mikrotonale Musik und Multimediale Komposition (ZM4), Hochschule für Musik und Theater Hamburg, 2016)
    The goal of this work is to incorporate the visual modality into a musical instrument recognition system. For that, we first evaluate state-of-the-art image recognition techniques in the context of music instrument ...
  • Lai, Catherine; Farrús, Mireia; Moore, Johanna D. (International Speech Communication Association (ISCA), 2016)
    As long-form spoken documents become more ubiquitous in everyday life, so does the need for automatic discourse segmentation in spoken language processing tasks. Although previous work has focused on broad topic segmentation, ...
  • Ramoneda, Pedro; Jeong, Dasaem; Nakamura, Eita; Serra, Xavier; Miron, Marius (ACM Association for Computer Machinery, 2022)
    Piano fingering is a creative and highly individualised task acquired by musicians progressively in their first music education years. Pianists must learn to choose the order of fingers to play the piano keys because ...
  • Ferraro, Andrés; Bogdanov, Dmitry; Yoon, Jisang; Kim, KwangSeob; Serra, Xavier (ACM Association for Computer Machinery, 2018)
    The ACM RecSys Challenge 2018 focuses on music recommendation in the context of automatic playlist continuation. In this paper, we describe our approach to the problem and the final hybrid system that was submitted to the ...
  • Rodríguez, Sonia; Gómez Gutiérrez, Emilia, 1975-; Cuesta, Helena (Folk Music Analysis, 2018)
    This work deals with the automatic transcription and characterization of flamenco guitar, with a focus on short melodic interludes improvised between sung verses. These are called falsetas in the flamenco argot and are ...
  • Benetos, Emmanouil; Holzapfel, Andre (International Society for Music Information Retrieval (ISMIR), 2013)
    In this paper we propose an automatic system for transcribing/nmakam music of Turkey. We document the specific/ntraits of this music that deviate from properties that/nwere targeted by transcription tools so far and we ...
  • Fernandez-Lopez, Adriana; Sukno, Federico Mateo (SCITEPRESS, 2017)
    Speech is the most common communication method between humans and involves the perception of both auditory and visual channels. Automatic speech recognition focuses on interpreting the audio signals, but it has been ...
  • Zamir, Syed Waqas; Vazquez-Corral, Javier; Bertalmío, Marcelo (Society of Motion Picture & Television Engineers (SMPTE), 2017)
    Gamut mapping transforms colors of the original (image or video) content to the color palette of the display device with the simultaneous goals of (a) reproducing content accurately while preserving the artistic intent of ...
  • Moulin-Frier, Clément; Sánchez Fibla, Martí; Verschure, Paul F. M. J. (Institute of Electrical and Electronics Engineers (IEEE), 2015)
    We provide an original computational model showing how turn-taking behaviors can self-organize out of sensorimotor/ninteractions between vocalizing agents. These agents are equipped with a cognitive architecture based on ...
  • Velamazán, Mariano; Santos, Patricia; Hernández Leo, Davinia (Springer, 2021)
    There are several awareness tools developed to research how to support different phases and modes of socio-emotional regulation of learning. Most of these tools have focused on only one mode of regulation (self-, co- or ...
  • Cortès, Guillem; Ciurana, Alex; Molina, Emilio; Miron, Marius; Meyers, Owen; Six, Joren; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2022)
    Audio Fingerprinting (AFP) is a well-studied problem in music information retrieval for various use-cases e.g. content-based copy detection, DJ-set monitoring, and music excerpt identification. However, AFP for continuous ...
  • Farrús, Mireia; Serra, Montse; Basart i Muñoz, Josep M.; Nadeu Camprubí, Climent (CIDUI Congrés Internacional de Docència Universitària i Innovació, 2016)
    En aquesta comunicació presentem una sèrie d'elements de protagonisme de l'alumne, treballats en una aula d'informàtica, amb l'objectiu de fomentar la participació de l'alumne a la classe. Aquests elements, que inclouen ...
  • Galdran, Adrian; Carneiro, Gustavo; González Ballester, Miguel Ángel, 1973- (Springer, 2021)
    Highly imbalanced datasets are ubiquitous in medical image classification problems. In such problems, it is often the case that rare classes associated to less prevalent diseases are severely under-represented in labeled ...

Cerca


Cerca avançada

Visualitza

El meu compte

Amb col·laboració de Complim Participem