Conferencia

Gálvez, R.H.; Benus, S.; Gravano, A.; Trnka, M.; Lacerda F.; Strombergsson S.; Wlodarczak M.; Heldner M.; Gustafson J.; House D.; Amazon Alexa; Apple; DiDi; et al.; Furhat Robotics; Microsoft "Prosodic facilitation and interference while judging on the veracity of synthesized statements" (2017) 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. 2017-August:2331-2335
Estamos trabajando para incorporar este artículo al repositorio
Consulte el artículo en la página del editor
Consulte la política de Acceso Abierto del editor

Abstract:

Two primary sources of information are provided in human speech. On the one hand, the verbal channel encodes linguistic content, while on the other hand, the vocal channel transmits paralinguistic information, mainly through prosody. In line with several studies that induce a conflict between these two channels to better understand the role of prosody, we conducted an experiment in which subjects had to listen to a series of statements synthesized with varying prosody and indicate if they believed them to be true or false. We find evidence suggesting that acoustic/prosodic (a/p) features of the synthesized statements affect response times (a well-known proxy for cognitive load). Our results suggest that prosody in synthesized speech may play a role of either facilitation or interference when subjects judge the truthfulness of a statement. Furthermore, we find that this pattern is amplified when the a/p features of the synthesized statements are analyzed relative to the subjects' own a/p features. This suggests that the entrainment of TTS voices has serious implications in the perceived trustworthiness of the system's skills. Copyright © 2017 ISCA.

Registro:

Documento: Conferencia
Título:Prosodic facilitation and interference while judging on the veracity of synthesized statements
Autor:Gálvez, R.H.; Benus, S.; Gravano, A.; Trnka, M.; Lacerda F.; Strombergsson S.; Wlodarczak M.; Heldner M.; Gustafson J.; House D.; Amazon Alexa; Apple; DiDi; et al.; Furhat Robotics; Microsoft
Filiación:Departamento de Computacion, FCEyN, Universidad de Buenos Aires, Argentina
Constantine the Philosopher University in Nitra, Slovakia
Institute of Informatics, Slovak Academy of Sciences, Slovakia
Instituto de Investigacion en Ciencias de la Computacion, CONICET-UBA, Buenos Aires, Argentina
Palabras clave:Entrainment; Prosodic interference/facilitation; Text to speech; Trustworthiness; Air entrainment; Linguistics; Speech; Cognitive loads; Human speech; Paralinguistic information; Primary sources; Prosodic interference/facilitation; Synthesized speech; Text to speech; Trustworthiness; Speech communication
Año:2017
Volumen:2017-August
Página de inicio:2331
Página de fin:2335
DOI: http://dx.doi.org/10.21437/Interspeech.2017-453
Título revista:18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017
Título revista abreviado:Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH
ISSN:2308457X
Registro:https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_2308457X_v2017-August_n_p2331_Galvez

Referencias:

  • Apple, W., Streeter, L.A., Krauss, R.M., Effects of pitch and speech rate on personal attributions (1979) Journal of Personality and Social Psychology, 37 (5), p. 715
  • Schuller, B., Steidl, S., Batliner, A., Burkhardt, F., Devillers, L., Müller, C., Narayanan, S., Paralinguistics in speech and language - State-of-the-art and the challenge (2013) Computer Speech & Language, 27 (1), pp. 4-39
  • Huang, X., Acero, A., Hon, H.-W., Foreword, R., (2001) By-Reddy, Spoken Language Processing: A Guide to Theory, Algorithm, and System Development, , Prentice hall PTR
  • Burns, K.L., Beier, E.G., Significance of vocal and visual channels in the decoding of emotional meaning (1973) Journal of Communication, 23 (1), pp. 118-130. , http://dx.doi.org/10.1111/j.1460-2466.1973.tb00936.x
  • Cutler, A., Dahan, D., Van Donselaar, W., Prosody in the comprehension of spoken language: A literature review (1997) Language and Speech, 40 (2), pp. 141-201
  • Fairbanks, G., Pronovost, W., An experimental study of the pitch characteristics of the voice during the expression of emotion (1939) Communications Monographs, 6 (1), pp. 87-104
  • Crumpton, J., Bethel, C.L., A survey of using vocal prosody to convey emotion in robot speech (2016) International Journal of Social Robotics, 8 (2), pp. 271-285. , http://dx.doi.org/10.1007/s12369-015-0329-4
  • Kjelgaard, M.M., Speer, S.R., Prosodic facilitation and interference in the resolution of temporary syntactic closure ambiguity (1999) Journal of Memory and Language, 40 (2), pp. 153-194
  • Mitchell, R.L., Does incongruence of lexicosemantic and prosodic information cause discernible cognitive conflict? (2006) Cognitive, Affective, & Behavioral Neuroscience, 6 (4), pp. 298-305
  • Wittfoth, M., Schrder, C., Schardt, D.M., Dengler, R., Heinze, H.-J., Kotz, S.A., On emotional conflict: Interference resolution of happy and angry prosody reveals valence-specific effects (2009) Cerebral Cortex, 20 (2), p. 383. , http://dx.doi.org/10.1093/cercor/bhp106
  • Nass, C., Jonsson, I.-M., Harris, H., Reaves, B., Endo, J., Brave, S., Takayama, L., Improving automotive safety by pairing driver emotion and car voice emotion (2005) CHI '05 Extended Abstracts on Human Factors in Computing Systems, Ser. CHI EA '05, pp. 1973-1976. , New York, NY, USA: ACM
  • D'Mello, S., Graesser, A., Autotutor and affective autotutor: Learning by talking with cognitively and emotionally intelligent computers that talk back (2013) ACM Trans. Interact. Intell. Syst., 2 (4), pp. 231-2339. , Jan
  • Smith, S.M., Shaffer, D.R., Celerity and cajolery: Rapid speech May promote or inhibit persuasion through its impact on message elaboration (1991) Personality and Social Psychology Bulletin, 17 (6), pp. 663-669. , http://dx.doi.org/10.1177/0146167291176009
  • Levitan, R., Benuš, S., Gravano, A., Hirschberg, J., Entrainment and turn-taking in human-human dialogue (2015) AAAI Spring Symposium on Turn-Taking and Coordination in Human-Machine Interaction
  • Ward, A., Litman, D., Measuring convergence and priming in tutorial dialog (2007) University of Pittsburgh, , Tech. Rep
  • Levitan, R., Hirschberg, J., Measuring acoustic-prosodic entrainment with respect to multiple levels and dimensions (2011) Interspeech 2011, pp. 3081-3084
  • Gravano, A., Benuš, S., Levitan, R., Hirschberg, J., Backward mimicry and forward influence in prosodic contour choice in standard American english (2015) Proceedings of Interspeech
  • Brennan, S.E., Clark, H.H., Conceptual pacts and lexical choice in conversation (1996) Journal of Experimental Psychology: Learning, Memory, and Cognition, 22 (6), p. 1482
  • B. Litwak, personal communication, 2016-08-05; Levitan, R., Benuš, S., Gálvez, R.H., Gravano, A., Savoretti, F., Trnka, M., Weise, A., Hirschberg, J., Implementing acoustic-prosodic entrainment in a conversational avatar (2016) Interspeech 2016, pp. 1166-1170
  • Violante, L., Zivic, P.R., Gravano, A., Improving speech synthesis quality by reducing pitch peaks in the source recordings (2013) HLT-NAACL, pp. 502-506
  • Boersma, P., Weenink, D., (2016) Praat: Doing Phonetics by Computer, , http://www.praat.org
  • Jones, C., Berry, L., Stevens, C., Synthesized speech intelligibility and persuasion: Speech rate and non-native listeners (2007) Computer Speech & Language, 21 (4), pp. 641-651
  • Heathcote, A., Popiel, S.J., Mewhort, D., Analysis of response time distributions: An example using the stroop task (1991) Psychological Bulletin, 109 (2), p. 340
  • Pérez, J.M., Gálvez, R.H., Gravano, A., Disentrainment May be a positive thing: A novel measure of unsigned acoustic-prosodic synchrony, and its relation to speaker engagement (2016) Interspeech 2016, pp. 1270-1274
  • Levitan, R., Benuš, S., Gravano, A., Hirschberg, J., Acoustic-prosodic entrainment in slovak, Spanish, english and Chinese: A cross-linguistic comparison (2015) Proceedings of SIGdial, pp. 325-334
  • DePaulo, B.M., Lindsay, J.J., Malone, B.E., Muhlenbruck, L., Charlton, K., Cooper, H., Cues to deception (2003) Psychological Bulletin, 129 (1), p. 74
  • Cheng, J.T., Tracy, J.L., Ho, S., Henrich, J., Listen, follow me: Dynamic vocal signals of dominance predict emergent social rank in humans (2016) Journal of Experimental Psychology: General, 145 (5), p. 536
  • Klofstad, C.A., Anderson, R.C., Peters, S., Sounds like a winner: Voice pitch influences perception of leadership capacity in both men and women (2012) Proceedings of the Royal Society of London B: Biological Sciences, 279 (1738), pp. 2698-2704
  • Rockwell, P., Buller, D.B., Burgoon, J.K., The voice of deceit: Refining and expanding vocal cues to deception (1997) Communication Research Reports, 14 (4), pp. 451-459
  • Ohala, J.J., Cross-language use of pitch: An ethological view (1983) Phonetica, 40 (1), pp. 1-18
  • Hirschberg, J., The pragmatics of intonational meaning (2002) Proceedings of Speech Prosody
  • Gussenhoven, C., Intonation and interpretation: Phonetics and phonology (2002) Proceedings of Speech ProsodyA4 - Amazon Alexa; Apple; DiDi; et al.; Furhat Robotics; Microsoft

Citas:

---------- APA ----------
Gálvez, R.H., Benus, S., Gravano, A., Trnka, M., Lacerda F., Strombergsson S., Wlodarczak M.,..., Amazon Alexa; Apple; DiDi; et al.; Furhat Robotics; Microsoft (2017) . Prosodic facilitation and interference while judging on the veracity of synthesized statements. 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017, 2017-August, 2331-2335.
http://dx.doi.org/10.21437/Interspeech.2017-453
---------- CHICAGO ----------
Gálvez, R.H., Benus, S., Gravano, A., Trnka, M., Lacerda F., Strombergsson S., et al. "Prosodic facilitation and interference while judging on the veracity of synthesized statements" . 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017 2017-August (2017) : 2331-2335.
http://dx.doi.org/10.21437/Interspeech.2017-453
---------- MLA ----------
Gálvez, R.H., Benus, S., Gravano, A., Trnka, M., Lacerda F., Strombergsson S., et al. "Prosodic facilitation and interference while judging on the veracity of synthesized statements" . 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017, vol. 2017-August, 2017, pp. 2331-2335.
http://dx.doi.org/10.21437/Interspeech.2017-453
---------- VANCOUVER ----------
Gálvez, R.H., Benus, S., Gravano, A., Trnka, M., Lacerda F., Strombergsson S., et al. Prosodic facilitation and interference while judging on the veracity of synthesized statements. Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH. 2017;2017-August:2331-2335.
http://dx.doi.org/10.21437/Interspeech.2017-453