Benvinguts al Repositori Digital de la UPF

Visualitzant Congressos (Departament de Tecnologies de la Informació i les Comunicacions) per títol

Visualitzant Congressos (Departament de Tecnologies de la Informació i les Comunicacions) per títol

Ordena per: Ordre: Resultats:

  • Farrús, Mireia; Lai, Catherine; Moore, Johanna D. (International Speech Communication Association, 2016)
    Speech synthesis has improved in both expressiveness and voice quality in recent years. However, obtaining full expressiveness when dealing with large multi-sentential synthesized discourse is still a challenge, since ...
  • Smith, Julius O.; Serra, Xavier (International Computer Music Conference, 1987)
    This paper describes a peak-tracking spectrum analyzer, called PARSHL, which is useful for extracting additive synthesis parameters from inharmonic sounds such as the piano. PARSHL is based on the Short-Time Fourier Transform ...
  • Cyriac, Praveen; Kane, David; Bertalmío, Marcelo (The British Machine Vision Association (BMVA), 2015)
    Digital cameras apply a non-linearity to the captured sensor values prior to quantisation. This process is known as perceptual linearisation and ensures that the quantisation rate is approximately proportional to human ...
  • Bertalmío, Marcelo; Vazquez-Corral, Javier (Color Science Association of Japan (CSAJ), 2015)
    Gamut mapping transforms the color of an input image within the range of a target device. A huge amount of research has been devoted to two subproblems that arise from this general one: gamut reduction and gamut extension. ...
  • Moreno, Verónica; Bellalta, Boris; Infante, Jorge; Piella Fenoy, Gemma; Frangi Caregnato, Alejandro (2009-07-03)
    L’Escola Superior Politècnica (ESUP) de la Universitat Pompeu Fabra (UPF) ha realitzat una anàlisi per conèixer el perfil dels estudiants que s’incorporen a les Enginyeries de Telecomunicacions i Informàtica contemplant ...
  • Mungara, Ratheesh K.; Zhang, Xinchen; Lozano Solsona, Angel; Heath, R. W., Jr. (Institute of Electrical and Electronics Engineers (IEEE), 2015)
    We present a performance evaluation of ITLinQ/nand FlashLinQ, the two most popular schemes proposed to date/nto channelize D2D transmissions, i.e., to parse transmissions/ninto noninterfering sets to be allocated to separate ...
  • Hernando, Javier; Farrús, Mireia; Ejarque, Pascual; Garde, Ainara; Luque, Jordi; Comunicació presentada a: SECRYPT 2006, International Conference on Security and Cryptography, celebrada a Setúbal, Portugal, del 7 al 10 d'agost de 2006 (INSTICC Press, 2006)
    Prosodic information can be used successfully for automatic speaker recognition, although most of the speaker recognition systems use only short-term spectral features as voice information. In this work, prosody information ...
  • Serra, Xavier (International Computer Music Conference, 1994)
    Phonos Foundation is a center located in the city of Barcelona. It promotes the use of technology in music, combining creation and research. Phonos offers musicians, engineers and scientists a working environment based on ...
  • Gulati, Sankalp; Serrà Julià, Joan; Ishwar, Vignesh; Sentürk, Sertan; Serra, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2016)
    Automatic raga recognition is one of the fundamental computational tasks in Indian art music. Motivated by the way seasoned listeners identify ragas, we propose a raga recognition approach based on melodic phrases. Firstly, ...
  • Gong, Rong; Yang, Yile; Serra, Xavier (Zentrum für Mikrotonale Musik und Multimediale Komposition (ZM4) Hochschule für Musik und Theate, 2016)
    Imitation is the main approach of jingju (also known as Beijing opera) singing training through its inheritance of nearly 200 years. Students learn singing by receiving auditory and gestural feedback cues. The aim of ...
  • McFee, Brian; Humphrey, Eric J.; Urbano, Julián (International Society for Music Information Retrieval (ISMIR), 2016)
    The Music Information Retrieval Evaluation eXchange (MIREX) is a valuable community service, having established standard datasets, metrics, baselines, methodologies, and infrastructure for comparing MIR methods. While MIREX ...
  • Bonet, Blai; Geffner, Héctor (Association for the Advancement of Artificial Intelligence (AAAI), 2011)
    Planning with partial observability can be formulated as a non-deterministic search problem in belief space. The problem is harder than classical planning as keeping track of beliefs is harder than keeping track of states, ...
  • Gómez, Vicenç; Kappen, Hilbert J.; Peters, Jan; Neumann, Gerhard (Springer, 2014)
    Path integral (PI) control defines a general class of control problems for which the optimal control computation is equivalent to an inference problem that can be solved by evaluation of a path integral over state trajectories. ...
  • Domínguez Bajo, Mónica; Latorre, Iván; Farrús, Mireia; Codina-Filbà, Joan; Wanner, Leo (COLING, 2016)
    This paper presents an implementation of the widely used speech analysis tool Praat as a web application with an extended functionality for feature annotation. In particular, Praat on the Web addresses some of the central ...
  • Moreno, Verónica; Hernández Leo, Davinia; Daza, Vanesa (2012-07)
    L’Escola Superior Politècnica (ESUP) de la Universitat Pompeu Fabra, amb el suport del Consell Social va iniciar el curs 2009-2010 un Pla de Mentors (EnginyCat) adreçat als estudiants de primer curs dels Graus TIC. Aquesta ...
  • Becerra-Fajardo, Laura; Ivorra Cano, Antoni, 1974- (2012)
    For several years, researchers have developed techniques to replace and enhance the capabilities of our neural system by means of implantable electrical stimulation technologies. Even though important work has been done ...
  • Cerdà, Ramon; Farrús, Mireia; Hernando, Javier; Veyrat Rigat, Montserrat (Dirección Xeral de Creación e Difusión Cultural, 2007)
    La presente comunicación no ofrece todavía resultados de ningún experimento ya ejecutado, sino, com su título indica, una serie de comentarios en torno a la noción de "campo de dispersión fonemática" (o "fonológica") o ...
  • Öktem, Alp; Farrús, Mireia; Wanner, Leo (2017)
    This paper presents an open-source tool that has been developed to visualize a speech corpus with its transcript and prosodic features aligned at word level. In particular, the tool is aimed at providing a simple and clear ...
  • Bogdanov, Dmitry; Serra, Xavier (International Society for Music Information Retrieval (ISMIR), 2017)
    While a vast amount of editorial metadata is being actively gathered and used by music collectors and enthusiasts, it is often neglected by music information retrieval and musicology researchers. In this paper we propose ...
  • Caro Repetto, Rafael; Zhang, Shuo; Serra, Xavier (2017)
    When lyrics of tonal languages are set to music, the pitch contour of the tones has to agree to a certain extent with the melodic contour to assure intelligibility. The relationship between the linguistic tones of the ...