Conferencia

Levitan, R.; Beňuš, Š.; Gálvez, R.H.; Gravano, A.; Savoretti, F.; Trnka, M.; Weise, A.; Hirschberg, J.; Morgan N.; Georgiou P.; Morgan N.; Narayanan S.; Metze F.; Amazon Alexa; Apple; eBay; et al.; Google; Microsoft "Implementing acoustic-prosodic entrainment in a conversational avatar" (2016) 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016. 08-12-September-2016:1166-1170
Estamos trabajando para incorporar este artículo al repositorio
Consulte el artículo en la página del editor
Consulte la política de Acceso Abierto del editor

Abstract:

Entrainment, aka accommodation or alignment, is the phenomenon by which conversational partners become more similar to each other in behavior. While there has been much work on some behaviors there has been little on entrainment in speech and even less on how Spoken Dialogue Systems (SDS) which entrain to their users' speech can be created. We present an architecture and algorithm for implementing acoustic-prosodic entrainment in SDS and show that speech produced under this algorithm conforms to the feature targets, satisfying the properties of entrainment behavior observed in human-human conversations. We present results of an extrinsic evaluation of this method, comparing whether subjects are more likely to ask advice from a conversational avatar that entrains vs. one that does not, in English, Spanish and Slovak SDS. Copyright © 2016 ISCA.

Registro:

Documento: Conferencia
Título:Implementing acoustic-prosodic entrainment in a conversational avatar
Autor:Levitan, R.; Beňuš, Š.; Gálvez, R.H.; Gravano, A.; Savoretti, F.; Trnka, M.; Weise, A.; Hirschberg, J.; Morgan N.; Georgiou P.; Morgan N.; Narayanan S.; Metze F.; Amazon Alexa; Apple; eBay; et al.; Google; Microsoft
Filiación:Department of Computer and Information Science, Brooklyn College CUNY, United States
Constantine the Philosopher University in Nitra, Slovakia
Institute of Informatics, Slovak Academy of Sciences, Slovakia
Departamento de Computación, FCEyN, Universidad de Buenos Aires, Argentina
Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
Department of Computer Science, Columbia University, United States
Palabras clave:Alignment; Avatars; Entrainment; Virtual agents; Air entrainment; Alignment; Behavioral research; Speech; Speech processing; Avatars; Spoken dialogue system; Virtual agent; Speech communication
Año:2016
Volumen:08-12-September-2016
Página de inicio:1166
Página de fin:1170
DOI: http://dx.doi.org/10.21437/Interspeech.2016-985
Título revista:17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016
Título revista abreviado:Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH
ISSN:2308457X
Registro:https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_2308457X_v08-12-September-2016_n_p1166_Levitan

Referencias:

  • Lee, C.-C., Black, M., Katsamanis, A., Lammert, A., Baucom, B., Christensen, A., Georgiou, P.G., Narayanan, S., Quantification of prosodic entrainment in affective spontaneous spoken interactions of married couples (2010) Proceedings of Interspeech
  • Levitan, R., Hirschberg, J., Measuring acoustic-prosodic entrainment with respect to multiple levels and dimensions (2011) Proceedings of Interspeech
  • Chartrand, T.L., Bargh, J.A., The chameleon effect: The perception-behavior link and social interaction (1999) Journal of Personality and Social Psychology, 76 (6), pp. 893-910
  • Manson, J.H., Bryant, G.A., Gervais, M.M., Kline, M.A., Convergence of speech rate in conversation predicts cooperation (2013) Evolution and Human Behavior, 34 (6), pp. 419-426
  • Nass, C., Steuer, J., Tauber, E.R., Computers are social actors (1994) Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 72-78. , ACM
  • Levitan, R., (2014) Acoustic-prosodic Entrainment in Human-human and Human-computer Dialogue, , Ph.D. dissertation, Columbia University
  • Natale, M., Convergence of mean vocal intensity in dyadic communication as a function of social desirability (1975) Journal of Personality and Social Psychology, 32 (5), pp. 790-804
  • Gregory, S., Webster, S., Huang, G., Voice pitch and amplitude convergence as a metric of quality in dyadic interviews (1993) Language & Communication, 13 (3), pp. 195-217
  • Ward, A., Litman, D., (2007) Measuring Convergence and Priming in Tutorial Dialog, , University of Pittsburgh, Tech. Rep
  • Brennan, S.E., Lexical entrainment in spontaneous dialog (1996) Proceedings of ISSD, pp. 41-44
  • Branigan, H.P., Pickering, M.J., Cleland, A.A., Syntactic coordination in dialogue (2000) Cognition, 75 (2), pp. B13-B25
  • Reitter, D., Moore, J.D., Keller, F., Priming of syntactic rules in task-oriented dialogue and spontaneous conversation (2006) Proceedings of the 28th Annual Conference of the Cognitive Science Society, p. 685690
  • Niederhoffer, K.G., Pennebaker, J.W., Linguistic style matching in social interaction (2002) Journal of Language and Social Psychology, 21 (4), pp. 337-360
  • Danescu-Niculescu-Mizil, C., Gamon, M., Dumais, S., Mark my words! linguistic style accommodation in social media (2011) Proceedings of WWW
  • Michael, L., Otterbacher, J., Write like i write: Herding in the language of online reviews (2014) Proceedings of the Eigth International AAAI Conference on Weblogs and Social Media
  • Giles, H., Coupland, N., Coupland, J., Accommodation theory: Communication, context, and consequence (1991) Contexts of Accommodation: Developments in Applied Sociolinguistics, 1
  • Bourhis, R.Y., Giles, H., The language of intergroup distinctiveness (1977) Language, Ethnicity and Intergroup Relations, 13, p. 119
  • Street, R.L., Speech convergence and speech evaluation in factfinding interviews (1984) Human Communication Research, 11 (2), pp. 139-169
  • Reitter, D., Moore, J.D., Predicting success in dialogue (2007) Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp. 808-815
  • Ireland, M.E., Slatcher, R.B., Eastwick, P.W., Scissors, L.E., Finkel, E.J., Pennebaker, J.W., Language style matching predicts relationship initiation and stability (2011) Psychological Science, 22 (1), pp. 39-44
  • Thomason, J., Nguyen, H.V., Litman, D., Prosodic entrainment and tutoring dialogue success (2013) Artificial Intelligence in Education, pp. 750-753. , Springer
  • Lubold, N., Pon-Barry, H., Walker, E., Naturalness and rapport in a pitch adaptive learning companion (2015) IEEE Automatic Speech Recognition and Understanding Workshop
  • Hu, Z., Halberg, G., Jimenez, C.R., Walker, M.A., Entrainment in pedestrian direction giving: How many kinds of entrainment? (2014) Proceedings of 5th International Workshop on Spoken, , Dialog System
  • Lopes, J., Eskenazi, M., Trancoso, I., Automated two-way entrainment to improve spoken dialog system performance (2013) Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pp. 8372-8376. , IEEE
  • Boersma, P., Weenink, D., (2012) Praat: Doing Phonetics by Computer [Computer Program], , http://www.praat.org, version 5.3.23, retrieved 21 August 2012 from
  • Mertens, P., The prosogram: Semi-automatic transcription of prosody based on a tonal perception model (2004) Speech Prosody
  • Rosenberg, A., AuToBI-A tool for automatic ToBI annotation (2010) Proceedings of Interspeech, pp. 146-149
  • Levitan, R., BeňuŠ, S., Gravano, A., Hirschberg, J., Entrainment in slovak, Spanish, english, and Chinese: A cross-linguistic comparison (2015) Proceedings of SIGdial
  • Levitan, R., Gravano, A., Willson, L., Benus, S., Hirschberg, J., Nenkova, A., Acoustic-prosodic entrainment and social behavior (2012) Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 11-19. , http://www.aclweb.org/anthology/N12-1002, Montréal, Canada: Association for Computational Linguistics, June
  • Levitan, R., Benus, S., Gravano, A., Hirschberg, J., Entrainment and turn-taking in human-human dialogue (2015) AAAI 2015 Spring Symposium on Turn-taking and Coordination in Human-Machine Interaction
  • Schröder, M., Trouvain, J., The German text-to-speech synthesis system Mary: A tool for research, development, and teaching (2001) SSW
  • Gosling, S., Rentfrow, P., Swann, W., A very brief measure of the big-five personality domains (2003) Journal of Research in Personality, 37 (6), pp. 504-528
  • Huggins-Daines, D., Kumar, M., Chan, A., Black, A.W., Ravishankar, M., Rudnicky, A.I., Pocketsphinx: A free, real-time continuous speech recognition system for hand-held devices (2006) Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings 2006 IEEE International Conference on, , IEEE
  • Violante, L., Zivic, P.R., Gravano, A., Improving speech synthesis quality by reducing pitch peaks in the source recordings (2013) HLT-NAACL, pp. 502-506
  • Wu, C.-H., Hsia, C.-C., Liu, T.-H., Wang, J.-F., Voice conversion using duration-embedded bi-hmms for expressive speech synthesis (2006) Audio, Speech, and Language Processing, IEEE Transactions on, 14 (4), pp. 1109-1116A4 - Amazon Alexa; Apple; eBay; et al.; Google; Microsoft

Citas:

---------- APA ----------
Levitan, R., Beňuš, Š., Gálvez, R.H., Gravano, A., Savoretti, F., Trnka, M., Weise, A.,..., Amazon Alexa; Apple; eBay; et al.; Google; Microsoft (2016) . Implementing acoustic-prosodic entrainment in a conversational avatar. 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016, 08-12-September-2016, 1166-1170.
http://dx.doi.org/10.21437/Interspeech.2016-985
---------- CHICAGO ----------
Levitan, R., Beňuš, Š., Gálvez, R.H., Gravano, A., Savoretti, F., Trnka, M., et al. "Implementing acoustic-prosodic entrainment in a conversational avatar" . 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016 08-12-September-2016 (2016) : 1166-1170.
http://dx.doi.org/10.21437/Interspeech.2016-985
---------- MLA ----------
Levitan, R., Beňuš, Š., Gálvez, R.H., Gravano, A., Savoretti, F., Trnka, M., et al. "Implementing acoustic-prosodic entrainment in a conversational avatar" . 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016, vol. 08-12-September-2016, 2016, pp. 1166-1170.
http://dx.doi.org/10.21437/Interspeech.2016-985
---------- VANCOUVER ----------
Levitan, R., Beňuš, Š., Gálvez, R.H., Gravano, A., Savoretti, F., Trnka, M., et al. Implementing acoustic-prosodic entrainment in a conversational avatar. Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH. 2016;08-12-September-2016:1166-1170.
http://dx.doi.org/10.21437/Interspeech.2016-985