Benvinguts al Repositori Digital de la UPF

Visualitzant Congressos (Departament de Tecnologies de la Informació i les Comunicacions) per data de publicació

Visualitzant Congressos (Departament de Tecnologies de la Informació i les Comunicacions) per data de publicació

Ordena per: Ordre: Resultats:

  • Bonada, Jordi, 1973-; Loscos, Àlex; Cano, Pedro; Serra, Xavier; Kenmochi, Hideki (Audio Engineering Society, 2001)
    In this paper we present two different approaches to the modeling of the singing voice. Each of these approaches has been thought to fit in the specific requirements of two applications. These are an automatic voice impersonator ...
  • Bonada, Jordi, 1973-; Celma Herrada, Òscar; Loscos, Àlex; Ortolà, Jaume; Serra, Xavier; Yoshioka, Yasuo; Kayama, Hiraku; Hisaminato, Yuji; Kenmochi, Hideki (International Computer Music Conference, 2001)
    This paper presents an approach to the modeling of the singing voice with a particular emphasis on the naturalness of the resulting synthetic voice. The underlying analysis/synthesis technique is based on the Spectral ...
  • Amatriain, Xavier; Bonada, Jordi, 1973-; Loscos, Àlex; Serra, Xavier (2001)
    When designing audio effects for music processing, we are always aiming at providing higherlevel representations that may somehow fill in the gap between the signal processing world and the end-user. Spectral models in ...
  • Serra, Xavier (International Computer Music Conference, 2004)
    The Music Technology Group, MTG, integrated in the Audiovisual Institute of the Universitat Pompeu Fabra of Barcelona, specializes in audio processing technologies and their music and multimedia applications. With more ...
  • Wright, Matthew; Dannenberg, Roger; Pope, Stephen; Rodet, Xavier; Serra, Xavier; Wessel, David (International Computer Music Conference, 2004)
    This panel discussion will review the standards that the computer music community has produced and how these standards were created, followed by a guided interactive group discussion about future directions for our ...
  • Farrús, Mireia; Anguita, Jan; Anguera Miró, Xavier, 1978-; Crego, Josep Maria; de Gispert, A.; Hernando, Javier; Nadeu Camprubí, Climent (2004)
    La visió actual de la societat de la informació gira fonamentalment al voltant de la llengua escrita. No obstant, és evident que la forma més natural i espontània de comunicació entre els éssers humans és la parla, i no ...
  • Anguera Miró, Xavier, 1978-; Farrús, Mireia; Hernando, Javier (2004)
    L'evolució de la societat de la informació ha esdevingut un incessant increment de continguts audiovisuals que s’emeten constantment en cadenes de televisió i emissores de radio locals i nacionals en llengua catalana. ...
  • Cano, Pedro; Koppenberger, Markus; Wack, Nicolas (ACM Association for Computer Machinery, 2005)
    We present the MusicSurfer, a metadata free system for the interaction with massive collections of music. MusicSurfer automatically extracts descriptions related to instrumentation, rhythm and harmony from music audio ...
  • Cano, Pedro; Koppenberger, Markus; Wack, Nicolas (ACM Association for Computer Machinery, 2005)
    We present a metadata free system for the interaction with massive collections of music, the MusicSurfer. MusicSurfer automatically extracts descriptions related to instrumentation, rhythm and harmony from music audio ...
  • Herrera Boyer, Perfecto; Bello, Juan; Widmer, Gerhard; Sandler, Mark; Celma Herrada, Òscar; Vignoli, Fabio; Pampalk, Elias; Cano, Pedro; Pauws, Steffen; Serra, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2005)
    The SIMAC project addresses the study and development of innovative components for a music information retrieval system. The key feature is the usage and exploitation of semantic descriptors of musical content that are ...
  • Serra, Xavier (International Computer Music Conference, 2005)
    In this article I give a personal view on what could be a roadmap for the research in Music Technology. I will describe the context of this research, the current state of the art, the challenges that lie ahead and I will ...
  • Hernando, Javier; Farrús, Mireia; Ejarque, Pascual; Garde, Ainara; Luque, Jordi; Comunicació presentada a: SECRYPT 2006, International Conference on Security and Cryptography, celebrada a Setúbal, Portugal, del 7 al 10 d'agost de 2006 (INSTICC Press, 2006)
    Prosodic information can be used successfully for automatic speaker recognition, although most of the speaker recognition systems use only short-term spectral features as voice information. In this work, prosody information ...
  • Farrús, Mireia; Luque, Jordi; Morros, R.; Anguita, Jan; Macho, D.; Marqués, M.; Martínez, C.; Vilaplana, V.; Hernando, Javier (Universidad de Zaragoza, 2006)
    In this paper we present a person identification system based on a combination of acoustic features and 2D face images. We address the modality integration issue on the example of a smart room environment. In order to ...
  • Farrús, Mireia; Garde, Ainara; Ejarque, Pascual; Luque, Jordi; Hernando, Javier (International Speech Communication Association (ISCA), 2006)
    Comunicació presentada a: 9th International Conference on Spoken Language Processing; 17-21 de setembre de 2006 a Pittsburgh, Estats Units d'Amèrica
  • Celma Herrada, Òscar; Herrera Boyer, Perfecto; Serra, Xavier (CEUR Workshop Proceedings, 2006)
    In this paper we present the music information plane and the dfferent levels of information extraction that exist in the musical domain. Based on this approach we propose a way to overcome the existing semantic gap in ...
  • Luque, Jordi; Morros, R.; Garde, I.; Anguita, Jan; Farrús, Mireia; Macho, D.; Marqués, F.; Martínez, C.; Vilaplana, V.; Hernando, Javier (Springer, 2007)
    In this paper, we address the modality integration issue on the example of a smart room environment aiming at enabling person identification by combining speech and 2D face images. First we introduce the monomodal audio ...
  • Farrús, Mireia; Ejarque, Pascual; Temko, Andrey; Hernando, Javier (Springer, 2007)
    It has been shown that prosody helps to improve voice spectrum based speaker recognition systems. Therefore, prosodic features can also be used in multimodal person verification in order to achieve better results. In this ...
  • Serra, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2007)
    Sound synthesis and processing has been the most active research topic in the field of sound and music computing for more than 40 years. Quite a number of the early research results are now standard components of many audio ...
  • Serra, Xavier (International Conference on Digital Audio Effects, 2007)
    DAFX is an established conference that has become a reference gathering for the researchers working on audio signal processing. In this presentation I will go back ten years to the beginning of this conference and to the ...
  • Cerdà, Ramon; Farrús, Mireia; Hernando, Javier; Veyrat Rigat, Montserrat (Dirección Xeral de Creación e Difusión Cultural, 2007)
    La presente comunicación no ofrece todavía resultados de ningún experimento ya ejecutado, sino, com su título indica, una serie de comentarios en torno a la noción de "campo de dispersión fonemática" (o "fonológica") o ...