Robustness of prosodic features to voice imitation

Citació

Farrús M, Wagner M, Anguita J, Hernando J. Robustness of prosodic features to voice imitation. In: 9th Annual Conference of the International Speech Communication Association; 2008 Sept. 22-26; Brisbane (Australia). [place unknown]: ISCA; 2008. p. 613-6.

Enllaç permanent

Descripció

Resum
Prosody plays an important role in the human recognition process; therefore, prosodic elements are normally used by impersonators aiming to resemble someone else. Since such voice imitation is one of the potential threats to security systems relying on automatic speaker recognition, and prosodic features have been considered for state-of-the-art recognition systems in recent years, the question arises as to what extent a mimicker is able to get close the prosodic characteristics of a target speaker. To this end, two experiments are conducted for twelve individual features in order to determine how a prosodic speaker identification system would perform against professionally imitated voices. The results show that the identification error rate increases for all the features except F0 range when the impersonators’ modified voices are used instead of the impersonators natural voices. Moreover, it seems easier to copy prosody on the basis of a whole sentence than for a specific word.
Descripció
Comunicació presentada a 9th Annual Conference of the International Speech Communication Association celebrada a Brisbane (Australia) del 22 al 26 de setembre de 2008.
Col·leccions
Congressos (Departament de Tecnologies de la Informació i les Comunicacions)

Fitxers