Welcome to the UPF Digital Repository

Browsing Congressos (Departament de Tecnologies de la Informació i les Comunicacions) by Issue Date

Browsing Congressos (Departament de Tecnologies de la Informació i les Comunicacions) by Issue Date

Sort by: Order: Results:

  • Herrera Boyer, Perfecto; Amatriain, Xavier; Batlle, Eloi; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2000)
    A system capable of describing the musical content of any kind of sound file or sound stream, as it is supposed to be done in MPEG7-compliant applications, should provide an account of the different moments where a certain ...
  • Bonada, Jordi, 1973-; Loscos, Àlex; Cano, Pedro; Serra, Xavier; Kenmochi, Hideki (Audio Engineering Society, 2001)
    In this paper we present two different approaches to the modeling of the singing voice. Each of these approaches has been thought to fit in the specific requirements of two applications. These are an automatic voice impersonator ...
  • Bonada, Jordi, 1973-; Celma Herrada, Òscar; Loscos, Àlex; Ortolà, Jaume; Serra, Xavier; Yoshioka, Yasuo; Kayama, Hiraku; Hisaminato, Yuji; Kenmochi, Hideki (International Computer Music Conference, 2001)
    This paper presents an approach to the modeling of the singing voice with a particular emphasis on the naturalness of the resulting synthetic voice. The underlying analysis/synthesis technique is based on the Spectral ...
  • Amatriain, Xavier; Bonada, Jordi, 1973-; Loscos, Àlex; Serra, Xavier (2001)
    When designing audio effects for music processing, we are always aiming at providing higherlevel representations that may somehow fill in the gap between the signal processing world and the end-user. Spectral models in ...
  • Wright, Matthew; Dannenberg, Roger; Pope, Stephen; Rodet, Xavier; Serra, Xavier; Wessel, David (International Computer Music Conference, 2004)
    This panel discussion will review the standards that the computer music community has produced and how these standards were created, followed by a guided interactive group discussion about future directions for our ...
  • Serra, Xavier (International Computer Music Conference, 2004)
    The Music Technology Group, MTG, integrated in the Audiovisual Institute of the Universitat Pompeu Fabra of Barcelona, specializes in audio processing technologies and their music and multimedia applications. With more ...
  • Anguera Miró, Xavier, 1978-; Farrús, Mireia; Hernando, Javier (2004)
    L'evolució de la societat de la informació ha esdevingut un incessant increment de continguts audiovisuals que s’emeten constantment en cadenes de televisió i emissores de radio locals i nacionals en llengua catalana. ...
  • Farrús, Mireia; Anguita, Jan; Anguera Miró, Xavier, 1978-; Crego, Josep Maria; de Gispert, A.; Hernando, Javier; Nadeu Camprubí, Climent (2004)
    La visió actual de la societat de la informació gira fonamentalment al voltant de la llengua escrita. No obstant, és evident que la forma més natural i espontània de comunicació entre els éssers humans és la parla, i no ...
  • Serra, Xavier (International Computer Music Conference, 2005)
    In this article I give a personal view on what could be a roadmap for the research in Music Technology. I will describe the context of this research, the current state of the art, the challenges that lie ahead and I will ...
  • Cano, Pedro; Koppenberger, Markus; Wack, Nicolas (ACM Association for Computer Machinery, 2005)
    We present the MusicSurfer, a metadata free system for the interaction with massive collections of music. MusicSurfer automatically extracts descriptions related to instrumentation, rhythm and harmony from music audio ...
  • Herrera Boyer, Perfecto; Bello, Juan; Widmer, Gerhard; Sandler, Mark; Celma Herrada, Òscar; Vignoli, Fabio; Pampalk, Elias; Cano, Pedro; Pauws, Steffen; Serra, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2005)
    The SIMAC project addresses the study and development of innovative components for a music information retrieval system. The key feature is the usage and exploitation of semantic descriptors of musical content that are ...
  • Parés, Narcís, 1966- (ACM Association for Computer Machinery, 2005)
    This paper identifies and contextualizes the limitations and problems found in interactive installations that require a massive flux of users. It then presents a set of solutions for these practical problems and shows how ...
  • Cano, Pedro; Koppenberger, Markus; Wack, Nicolas (ACM Association for Computer Machinery, 2005)
    We present a metadata free system for the interaction with massive collections of music, the MusicSurfer. MusicSurfer automatically extracts descriptions related to instrumentation, rhythm and harmony from music audio ...
  • Farrús, Mireia; Luque, Jordi; Morros, R.; Anguita, Jan; Macho, D.; Marqués, Marta; Martínez, C.; Vilaplana, V.; Hernando, Javier (Universidad de Zaragoza, 2006)
    In this paper we present a person identification system based on a combination of acoustic features and 2D face images. We address the modality integration issue on the example of a smart room environment. In order to ...
  • Farrús, Mireia; Garde, Ainara; Ejarque, Pascual; Luque, Jordi; Hernando, Javier (International Speech Communication Association (ISCA), 2006)
    Comunicació presentada a: 9th International Conference on Spoken Language Processing; 17-21 de setembre de 2006 a Pittsburgh, Estats Units d'Amèrica
  • Hernando, Javier; Farrús, Mireia; Ejarque, Pascual; Garde, Ainara; Luque, Jordi; Comunicació presentada a: SECRYPT 2006, International Conference on Security and Cryptography, celebrada a Setúbal, Portugal, del 7 al 10 d'agost de 2006 (INSTICC Press, 2006)
    Prosodic information can be used successfully for automatic speaker recognition, although most of the speaker recognition systems use only short-term spectral features as voice information. In this work, prosody information ...
  • Celma Herrada, Òscar; Herrera Boyer, Perfecto; Serra, Xavier (CEUR Workshop Proceedings, 2006)
    In this paper we present the music information plane and the dfferent levels of information extraction that exist in the musical domain. Based on this approach we propose a way to overcome the existing semantic gap in ...
  • Puiggròs, Montserrat; Gómez Gutiérrez, Emilia, 1975-; Ramírez, Rafael,1966-; Serra, Xavier; Bresin, Roberto (Bononia University Press, 2006)
    Expressive performance characterization is traditionally based on the analysis of the main differences between performances, players, playing styles and emotional intentions. This work addresses the characterization of ...
  • Luque, Jordi; Morros, R.; Garde, I.; Anguita, Jan; Farrús, Mireia; Macho, D.; Marqués López, Fernando; Martínez, C.; Vilaplana, V.; Hernando, Javier (Springer, 2007)
    In this paper, we address the modality integration issue on the example of a smart room environment aiming at enabling person identification by combining speech and 2D face images. First we introduce the monomodal audio ...
  • Farrús, Mireia; Ejarque, Pascual; Temko, Andrey; Hernando, Javier (Springer, 2007)
    It has been shown that prosody helps to improve voice spectrum based speaker recognition systems. Therefore, prosodic features can also be used in multimodal person verification in order to achieve better results. In this ...

Search DSpace


Advanced Search

Browse

My Account

Compliant to Partaking