Vocal caricatures reveal signatures of speaker identity

López, S.; Riera, P.; Assaneo, M.F.; Eguía, M.; Sigman, M.; Trevisan, M.A.

doi:10.1038/srep03407

Navegar

Documento Últimos Documentos Autor FCEN - Año Autor FCEN - Revista Año - Revista Revista - Año SubjectPcEn Colores Type

Colección

Artículo

López, S.; Riera, P.; Assaneo, M.F.; Eguía, M.; Sigman, M.; Trevisan, M.A. "Vocal caricatures reveal signatures of speaker identity" (2013) Scientific Reports. 3

https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_20452322_v3_n_p_Lopez

Estamos trabajando para incorporar este artículo al repositorio

Consulte el artículo en la página del editor

Consulte la política de Acceso Abierto del editor

Abstract:

What are the features that impersonators select to elicit a speaker's identity? We built a voice database of public figures (targets) and imitations produced by professional impersonators. They produced one imitation based on their memory of the target (caricature) and another one after listening to the target audio (replica). A set of naive participants then judged identity and similarity of pairs of voices. Identity was better evoked by the caricatures and replicas were perceived to be closer to the targets in terms of voice similarity. We used this data to map relevant acoustic dimensions for each task. Our results indicate that speaker identity is mainly associated with vocal tract features, while perception of voice similarity is related to vocal folds parameters. We therefore show the way in which acoustic caricatures emphasize identity features at the cost of loosing similarity, which allows drawing an analogy with caricatures in the visual space.

Registro:

Documento:	Artículo
Título:	Vocal caricatures reveal signatures of speaker identity
Autor:	López, S.; Riera, P.; Assaneo, M.F.; Eguía, M.; Sigman, M.; Trevisan, M.A.
Filiación:	Dynamical Systems Lab, IFIBA-Physics dept, University of Buenos Aires, Pabellón 1, Ciudad Universitaria, CABA 1428EGA, Argentina Acoustics and Sound Perception Lab, Universidad of Quilmes, Roque Sáenz Peña 352, Bernal, Buenos Aires B1876BXD, Argentina Integrative Neuroscience Lab, IFIBA-Physics dept, University of Buenos Aires, Pabellón 1, Ciudad Universitaria, CABA 1428EGA, Argentina Torcuato Di Tella University, Almirante Juan Saenz Valiente 1010, C1428BIJ Buenos Aires, Argentina
Año:	2013
Volumen:	3
DOI:	http://dx.doi.org/10.1038/srep03407
Título revista:	Scientific Reports
Título revista abreviado:	Sci. Rep.
ISSN:	20452322
Registro:	https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_20452322_v3_n_p_Lopez

Referencias:

Latinus, M., Belin, P., Human voice perception (2011) Curr. Biol., 21, pp. R143-R145
Eriksson, E., (2007) That Voice Sounds Familiar: Factors in Speaker Recognition, , http://umu.diva-portal.org/smash/record.jsf?pid5diva2:140217, Accessed 29 October 2013
Hardcastle, W.J., Mackenzie Beck, J., (2005) A Figure of Speech. A Festschrift for John Laver, , Lawrence Erlbaum Associates
Hauser, M.D., Chomsky, N., Fitch, W.T., Neuroscience: The faculty of language: What is it, who has it, and how did it evolve? (2002) Science, 298 (5598), pp. 1569-1579. , DOI 10.1126/science.298.5598.1569
Markham, D., (2013) Phonetic Imitation, Accent, and the Learner, , (Lund University Press, 1997). Accessed 29 October
Assaneo, M.F., Nichols, J.I., Trevisan, M.A., The anatomy of onomatopoeia (2011) PLoS One, 6, pp. e28317
Kitamura, T., (2008) Acoustic Analysis of Imitated Voice Produced by A Professional Impersonator, , http://basil.is.konan-u.ac.jp/pub/is2008.pdf, INTERSPEECH 813-816 Accessed 21 July 2013
Zetterholm, E., Same speaker - Different voices. A study of one impersonator and some of his different imitations (2006) Proc. 11st. Aust. Int. Conf. Speech Sci. Technol., pp. 70-75. , (Warren, P. & C. I.)
Titze, I.R., (1994) Principles of Voice Production, p. 354. , Prentice Hall
Fant, G., (1970) Acoustic Theory of Speech Production, , Mouton De Gruyter
Titze, I., The physics of small-amplitude oscillation of the vocal folds (1988) J. Acoust. Soc. Am., pp. 1536-1552. , http://link.aip.org/link/jasman/v83/i4/p1536/s1, Accessed 16 May 2013
Zetterholm, E., The same but different-three impersonators imitate the same target voices (2003) Proc. 15th Int. Congr. Phonetic Sci., pp. 2205-2208
Baumann, O., Belin, P., Perceptual scaling of voice identity: Common dimensions for different vowels and speakers (2010) Psychol. Res., 74, pp. 110-120
Murry, T., Singh, S., Multidimensional analysis of male and female voices (1980) J. Acoust. Soc. Am., 68, pp. 1294-1300
Collins, S.A., Men's voices and women's choices (2000) Animal Behaviour, 60 (6), pp. 773-780. , DOI 10.1006/anbe.2000.1523
Fitch, W.T., Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaques (1997) Journal of the Acoustical Society of America, 102 (2), pp. 1213-1222. , DOI 10.1121/1.421048
Samson, S., Zatorre, R.J., Ramsay, J.O., Multidimensional scaling of synthetic musical timbre: Perception of spectral and temporal characteristics (1997) Canadian Journal of Experimental Psychology, 51 (4), pp. 307-315
Zetterholm, E. in Speak. Classif. II SE - 16 (Muller, C.) 4441, 192-205 (Springer Berlin Heidelberg, 2007); Pasley, B.N., Reconstructing speech from human auditory cortex (2012) PLoS Biol., 10, pp. e1001251
Boersma, P., Weenink, D., (2013) Praat: Doing Phonetics by Computer, , http://www.praat.org/, Accessed 30 October 2013
Brainard, H.D., The psychophysics toolbox (1997) Spat. Vis., 10, pp. 433-436
Eriksson, E.J., Detection of imitated voices, who are reliable earwitnesses? (2010) Int. J. Speech Lang. Law, 17, pp. 25-44
Boersma, P., Weenink, D., (2013) Praat: Doing Phonetics by Computer, , http://www.praat.org/, Accessed 30 October 2013
Caclin, A., McAdams, S., Smith, B.K., Winsberg, S., Acoustic correlates of timbre space dimensions: A confirmatory study using synthetic tones (2005) J. Acoust. Soc. Am., 118, p. 471
Farrus, M., Wagner, M., Erro, D., Hernando, J., Automatic speaker recognition as ameasurement of voice imitation and conversion (2010) Int. J. Speech Lang. Law, 17, pp. 119-142
Farrus Cabeceran, M., (2013) Fusing Prosodic and Acoustic Information for Speaker Recognition, , (Universitat Politecnica de Catalunya 2008). Accessed 30 October
Jones, D.L., (2013) FATHOM for Matlab, , http://www.marine.usf.edu/user/djones/matlab/matlab.html, Accessed 30 October

Citas:

---------- APA ----------

López, S., Riera, P., Assaneo, M.F., Eguía, M., Sigman, M. & Trevisan, M.A. (2013) . Vocal caricatures reveal signatures of speaker identity. Scientific Reports, 3.
http://dx.doi.org/10.1038/srep03407

---------- CHICAGO ----------

López, S., Riera, P., Assaneo, M.F., Eguía, M., Sigman, M., Trevisan, M.A. "Vocal caricatures reveal signatures of speaker identity" . Scientific Reports 3 (2013).
http://dx.doi.org/10.1038/srep03407

---------- MLA ----------

López, S., Riera, P., Assaneo, M.F., Eguía, M., Sigman, M., Trevisan, M.A. "Vocal caricatures reveal signatures of speaker identity" . Scientific Reports, vol. 3, 2013.
http://dx.doi.org/10.1038/srep03407

---------- VANCOUVER ----------

López, S., Riera, P., Assaneo, M.F., Eguía, M., Sigman, M., Trevisan, M.A. Vocal caricatures reveal signatures of speaker identity. Sci. Rep. 2013;3.
http://dx.doi.org/10.1038/srep03407