Welcome to the UPF Digital Repository

On the fusion of prosody, voice spectrum and face features for multimodal person verification

Show simple item record

dc.contributor.author Farrús, Mireia
dc.contributor.author Garde, Ainara
dc.contributor.author Ejarque, Pascual
dc.contributor.author Luque, Jordi
dc.contributor.author Hernando, Javier
dc.date.accessioned 2017-03-17T15:22:15Z
dc.date.available 2017-03-17T15:22:15Z
dc.date.issued 2006
dc.identifier.citation Farrús M, Garde A, Ejarque P, Luque J, Hernando J. On the fusion of prosody, voice spectrum and face features for multimodal person verification. In: 9th International Conference on Spoken Language Processing; 2006 Sept. 17-21; Pittsburgh (PA, USA). [place unknown]: ISCA; 2006. p. 2106-9.
dc.identifier.uri http://hdl.handle.net/10230/28256
dc.description.abstract Comunicació presentada a: 9th International Conference on Spoken Language Processing; 17-21 de setembre de 2006 a Pittsburgh, Estats Units d'Amèrica
dc.description.abstract Multimodal person recognition systems normally use shortterm spectral features as voice information. In this paper prosodic information is added to a system based on face and voice spectrum features. By using two fusion techniques, support vector machines and matcher weighting, different fusion strategies based on the fusion of monomodal scores in several steps are proposed. The performance of the system is clearly improved when the prosodic information is added and the best results are achieved when prosodic scores are previously fused and the resulting scores are fused again with spectral and facial scores. Speech and face scores have been obtained upon Switchboard-I and XM2VTS databases respectively.
dc.format.mimetype application/pdf
dc.language.iso eng
dc.publisher International Speech Communication Association (ISCA)
dc.relation.ispartof 9th International Conference on Spoken Language Processing; 2006 Sept. 17-21; Pittsburgh (PA, USA). [place unknown]: ISCA; 2006. p. 2106-9.
dc.rights © ISCA.
dc.title On the fusion of prosody, voice spectrum and face features for multimodal person verification
dc.type info:eu-repo/semantics/conferenceObject
dc.subject.keyword Speaker recognition
dc.subject.keyword Multimodality
dc.subject.keyword Fusion
dc.subject.keyword Prosody
dc.subject.keyword Voice spectrum
dc.subject.keyword Face
dc.relation.projectID info:eu-repo/grantAgreement/EC/FP6/506909
dc.rights.accessRights info:eu-repo/semantics/openAccess
dc.type.version info:eu-repo/semantics/publishedVersion


This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account

Statistics

Compliant to Partaking