Artículo

López, S.; Riera, P.; Assaneo, M.F.; Eguía, M.; Sigman, M.; Trevisan, M.A. "Vocal caricatures reveal signatures of speaker identity" (2013) Scientific Reports. 3
Estamos trabajando para incorporar este artículo al repositorio
Consulte el artículo en la página del editor
Consulte la política de Acceso Abierto del editor

Abstract:

What are the features that impersonators select to elicit a speaker's identity? We built a voice database of public figures (targets) and imitations produced by professional impersonators. They produced one imitation based on their memory of the target (caricature) and another one after listening to the target audio (replica). A set of naive participants then judged identity and similarity of pairs of voices. Identity was better evoked by the caricatures and replicas were perceived to be closer to the targets in terms of voice similarity. We used this data to map relevant acoustic dimensions for each task. Our results indicate that speaker identity is mainly associated with vocal tract features, while perception of voice similarity is related to vocal folds parameters. We therefore show the way in which acoustic caricatures emphasize identity features at the cost of loosing similarity, which allows drawing an analogy with caricatures in the visual space.

Registro:

Documento: Artículo
Título:Vocal caricatures reveal signatures of speaker identity
Autor:López, S.; Riera, P.; Assaneo, M.F.; Eguía, M.; Sigman, M.; Trevisan, M.A.
Filiación:Dynamical Systems Lab, IFIBA-Physics dept, University of Buenos Aires, Pabellón 1, Ciudad Universitaria, CABA 1428EGA, Argentina
Acoustics and Sound Perception Lab, Universidad of Quilmes, Roque Sáenz Peña 352, Bernal, Buenos Aires B1876BXD, Argentina
Integrative Neuroscience Lab, IFIBA-Physics dept, University of Buenos Aires, Pabellón 1, Ciudad Universitaria, CABA 1428EGA, Argentina
Torcuato Di Tella University, Almirante Juan Saenz Valiente 1010, C1428BIJ Buenos Aires, Argentina
Año:2013
Volumen:3
DOI: http://dx.doi.org/10.1038/srep03407
Título revista:Scientific Reports
Título revista abreviado:Sci. Rep.
ISSN:20452322
Registro:https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_20452322_v3_n_p_Lopez

Referencias:

  • Latinus, M., Belin, P., Human voice perception (2011) Curr. Biol., 21, pp. R143-R145
  • Eriksson, E., (2007) That Voice Sounds Familiar: Factors in Speaker Recognition, , http://umu.diva-portal.org/smash/record.jsf?pid5diva2:140217, Accessed 29 October 2013
  • Hardcastle, W.J., Mackenzie Beck, J., (2005) A Figure of Speech. A Festschrift for John Laver, , Lawrence Erlbaum Associates
  • Hauser, M.D., Chomsky, N., Fitch, W.T., Neuroscience: The faculty of language: What is it, who has it, and how did it evolve? (2002) Science, 298 (5598), pp. 1569-1579. , DOI 10.1126/science.298.5598.1569
  • Markham, D., (2013) Phonetic Imitation, Accent, and the Learner, , (Lund University Press, 1997). Accessed 29 October
  • Assaneo, M.F., Nichols, J.I., Trevisan, M.A., The anatomy of onomatopoeia (2011) PLoS One, 6, pp. e28317
  • Kitamura, T., (2008) Acoustic Analysis of Imitated Voice Produced by A Professional Impersonator, , http://basil.is.konan-u.ac.jp/pub/is2008.pdf, INTERSPEECH 813-816 Accessed 21 July 2013
  • Zetterholm, E., Same speaker - Different voices. A study of one impersonator and some of his different imitations (2006) Proc. 11st. Aust. Int. Conf. Speech Sci. Technol., pp. 70-75. , (Warren, P. & C. I.)
  • Titze, I.R., (1994) Principles of Voice Production, p. 354. , Prentice Hall
  • Fant, G., (1970) Acoustic Theory of Speech Production, , Mouton De Gruyter
  • Titze, I., The physics of small-amplitude oscillation of the vocal folds (1988) J. Acoust. Soc. Am., pp. 1536-1552. , http://link.aip.org/link/jasman/v83/i4/p1536/s1, Accessed 16 May 2013
  • Zetterholm, E., The same but different-three impersonators imitate the same target voices (2003) Proc. 15th Int. Congr. Phonetic Sci., pp. 2205-2208
  • Baumann, O., Belin, P., Perceptual scaling of voice identity: Common dimensions for different vowels and speakers (2010) Psychol. Res., 74, pp. 110-120
  • Murry, T., Singh, S., Multidimensional analysis of male and female voices (1980) J. Acoust. Soc. Am., 68, pp. 1294-1300
  • Collins, S.A., Men's voices and women's choices (2000) Animal Behaviour, 60 (6), pp. 773-780. , DOI 10.1006/anbe.2000.1523
  • Fitch, W.T., Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaques (1997) Journal of the Acoustical Society of America, 102 (2), pp. 1213-1222. , DOI 10.1121/1.421048
  • Samson, S., Zatorre, R.J., Ramsay, J.O., Multidimensional scaling of synthetic musical timbre: Perception of spectral and temporal characteristics (1997) Canadian Journal of Experimental Psychology, 51 (4), pp. 307-315
  • Zetterholm, E. in Speak. Classif. II SE - 16 (Muller, C.) 4441, 192-205 (Springer Berlin Heidelberg, 2007); Pasley, B.N., Reconstructing speech from human auditory cortex (2012) PLoS Biol., 10, pp. e1001251
  • Boersma, P., Weenink, D., (2013) Praat: Doing Phonetics by Computer, , http://www.praat.org/, Accessed 30 October 2013
  • Brainard, H.D., The psychophysics toolbox (1997) Spat. Vis., 10, pp. 433-436
  • Eriksson, E.J., Detection of imitated voices, who are reliable earwitnesses? (2010) Int. J. Speech Lang. Law, 17, pp. 25-44
  • Boersma, P., Weenink, D., (2013) Praat: Doing Phonetics by Computer, , http://www.praat.org/, Accessed 30 October 2013
  • Caclin, A., McAdams, S., Smith, B.K., Winsberg, S., Acoustic correlates of timbre space dimensions: A confirmatory study using synthetic tones (2005) J. Acoust. Soc. Am., 118, p. 471
  • Farrus, M., Wagner, M., Erro, D., Hernando, J., Automatic speaker recognition as ameasurement of voice imitation and conversion (2010) Int. J. Speech Lang. Law, 17, pp. 119-142
  • Farrus Cabeceran, M., (2013) Fusing Prosodic and Acoustic Information for Speaker Recognition, , (Universitat Politecnica de Catalunya 2008). Accessed 30 October
  • Jones, D.L., (2013) FATHOM for Matlab, , http://www.marine.usf.edu/user/djones/matlab/matlab.html, Accessed 30 October

Citas:

---------- APA ----------
López, S., Riera, P., Assaneo, M.F., Eguía, M., Sigman, M. & Trevisan, M.A. (2013) . Vocal caricatures reveal signatures of speaker identity. Scientific Reports, 3.
http://dx.doi.org/10.1038/srep03407
---------- CHICAGO ----------
López, S., Riera, P., Assaneo, M.F., Eguía, M., Sigman, M., Trevisan, M.A. "Vocal caricatures reveal signatures of speaker identity" . Scientific Reports 3 (2013).
http://dx.doi.org/10.1038/srep03407
---------- MLA ----------
López, S., Riera, P., Assaneo, M.F., Eguía, M., Sigman, M., Trevisan, M.A. "Vocal caricatures reveal signatures of speaker identity" . Scientific Reports, vol. 3, 2013.
http://dx.doi.org/10.1038/srep03407
---------- VANCOUVER ----------
López, S., Riera, P., Assaneo, M.F., Eguía, M., Sigman, M., Trevisan, M.A. Vocal caricatures reveal signatures of speaker identity. Sci. Rep. 2013;3.
http://dx.doi.org/10.1038/srep03407