Implementing acoustic-prosodic entrainment in a conversational avatar

Levitan, R.; Beňuš, Š.; Gálvez, R.H.; Gravano, A.; Savoretti, F.; Trnka, M.; Weise, A.; Hirschberg, J.; Morgan N.; Georgiou P.; Morgan N.; Narayanan S.; Metze F.; Amazon Alexa; Apple; eBay; et al.; Google; Microsoft

doi:10.21437/Interspeech.2016-985

Navegar

Documento Últimos Documentos Autor FCEN - Año Autor FCEN - Revista Año - Revista Revista - Año SubjectPcEn Colores Type

Colección

Conferencia

Levitan, R.; Beňuš, Š.; Gálvez, R.H.; Gravano, A.; Savoretti, F.; Trnka, M.; Weise, A.; Hirschberg, J.; Morgan N.; Georgiou P.; Morgan N.; Narayanan S.; Metze F.; Amazon Alexa; Apple; eBay; et al.; Google; Microsoft "Implementing acoustic-prosodic entrainment in a conversational avatar" (2016) 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016. 08-12-September-2016:1166-1170

https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_2308457X_v08-12-September-2016_n_p1166_Levitan

Estamos trabajando para incorporar este artículo al repositorio

Consulte el artículo en la página del editor

Consulte la política de Acceso Abierto del editor

Abstract:

Entrainment, aka accommodation or alignment, is the phenomenon by which conversational partners become more similar to each other in behavior. While there has been much work on some behaviors there has been little on entrainment in speech and even less on how Spoken Dialogue Systems (SDS) which entrain to their users' speech can be created. We present an architecture and algorithm for implementing acoustic-prosodic entrainment in SDS and show that speech produced under this algorithm conforms to the feature targets, satisfying the properties of entrainment behavior observed in human-human conversations. We present results of an extrinsic evaluation of this method, comparing whether subjects are more likely to ask advice from a conversational avatar that entrains vs. one that does not, in English, Spanish and Slovak SDS. Copyright © 2016 ISCA.

Registro:

Documento:	Conferencia
Título:	Implementing acoustic-prosodic entrainment in a conversational avatar
Autor:	Levitan, R.; Beňuš, Š.; Gálvez, R.H.; Gravano, A.; Savoretti, F.; Trnka, M.; Weise, A.; Hirschberg, J.; Morgan N.; Georgiou P.; Morgan N.; Narayanan S.; Metze F.; Amazon Alexa; Apple; eBay; et al.; Google; Microsoft
Filiación:	Department of Computer and Information Science, Brooklyn College CUNY, United States Constantine the Philosopher University in Nitra, Slovakia Institute of Informatics, Slovak Academy of Sciences, Slovakia Departamento de Computación, FCEyN, Universidad de Buenos Aires, Argentina Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina Department of Computer Science, Columbia University, United States
Palabras clave:	Alignment; Avatars; Entrainment; Virtual agents; Air entrainment; Alignment; Behavioral research; Speech; Speech processing; Avatars; Spoken dialogue system; Virtual agent; Speech communication
Año:	2016
Volumen:	08-12-September-2016
Página de inicio:	1166
Página de fin:	1170
DOI:	http://dx.doi.org/10.21437/Interspeech.2016-985
Título revista:	17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016
Título revista abreviado:	Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH
ISSN:	2308457X
Registro:	https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_2308457X_v08-12-September-2016_n_p1166_Levitan

Referencias:

Lee, C.-C., Black, M., Katsamanis, A., Lammert, A., Baucom, B., Christensen, A., Georgiou, P.G., Narayanan, S., Quantification of prosodic entrainment in affective spontaneous spoken interactions of married couples (2010) Proceedings of Interspeech
Levitan, R., Hirschberg, J., Measuring acoustic-prosodic entrainment with respect to multiple levels and dimensions (2011) Proceedings of Interspeech
Chartrand, T.L., Bargh, J.A., The chameleon effect: The perception-behavior link and social interaction (1999) Journal of Personality and Social Psychology, 76 (6), pp. 893-910
Manson, J.H., Bryant, G.A., Gervais, M.M., Kline, M.A., Convergence of speech rate in conversation predicts cooperation (2013) Evolution and Human Behavior, 34 (6), pp. 419-426
Nass, C., Steuer, J., Tauber, E.R., Computers are social actors (1994) Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 72-78. , ACM
Levitan, R., (2014) Acoustic-prosodic Entrainment in Human-human and Human-computer Dialogue, , Ph.D. dissertation, Columbia University
Natale, M., Convergence of mean vocal intensity in dyadic communication as a function of social desirability (1975) Journal of Personality and Social Psychology, 32 (5), pp. 790-804
Gregory, S., Webster, S., Huang, G., Voice pitch and amplitude convergence as a metric of quality in dyadic interviews (1993) Language & Communication, 13 (3), pp. 195-217
Ward, A., Litman, D., (2007) Measuring Convergence and Priming in Tutorial Dialog, , University of Pittsburgh, Tech. Rep
Brennan, S.E., Lexical entrainment in spontaneous dialog (1996) Proceedings of ISSD, pp. 41-44
Branigan, H.P., Pickering, M.J., Cleland, A.A., Syntactic coordination in dialogue (2000) Cognition, 75 (2), pp. B13-B25
Reitter, D., Moore, J.D., Keller, F., Priming of syntactic rules in task-oriented dialogue and spontaneous conversation (2006) Proceedings of the 28th Annual Conference of the Cognitive Science Society, p. 685690
Niederhoffer, K.G., Pennebaker, J.W., Linguistic style matching in social interaction (2002) Journal of Language and Social Psychology, 21 (4), pp. 337-360
Danescu-Niculescu-Mizil, C., Gamon, M., Dumais, S., Mark my words! linguistic style accommodation in social media (2011) Proceedings of WWW
Michael, L., Otterbacher, J., Write like i write: Herding in the language of online reviews (2014) Proceedings of the Eigth International AAAI Conference on Weblogs and Social Media
Giles, H., Coupland, N., Coupland, J., Accommodation theory: Communication, context, and consequence (1991) Contexts of Accommodation: Developments in Applied Sociolinguistics, 1
Bourhis, R.Y., Giles, H., The language of intergroup distinctiveness (1977) Language, Ethnicity and Intergroup Relations, 13, p. 119
Street, R.L., Speech convergence and speech evaluation in factfinding interviews (1984) Human Communication Research, 11 (2), pp. 139-169
Reitter, D., Moore, J.D., Predicting success in dialogue (2007) Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp. 808-815
Ireland, M.E., Slatcher, R.B., Eastwick, P.W., Scissors, L.E., Finkel, E.J., Pennebaker, J.W., Language style matching predicts relationship initiation and stability (2011) Psychological Science, 22 (1), pp. 39-44
Thomason, J., Nguyen, H.V., Litman, D., Prosodic entrainment and tutoring dialogue success (2013) Artificial Intelligence in Education, pp. 750-753. , Springer
Lubold, N., Pon-Barry, H., Walker, E., Naturalness and rapport in a pitch adaptive learning companion (2015) IEEE Automatic Speech Recognition and Understanding Workshop
Hu, Z., Halberg, G., Jimenez, C.R., Walker, M.A., Entrainment in pedestrian direction giving: How many kinds of entrainment? (2014) Proceedings of 5th International Workshop on Spoken, , Dialog System
Lopes, J., Eskenazi, M., Trancoso, I., Automated two-way entrainment to improve spoken dialog system performance (2013) Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pp. 8372-8376. , IEEE
Boersma, P., Weenink, D., (2012) Praat: Doing Phonetics by Computer [Computer Program], , http://www.praat.org, version 5.3.23, retrieved 21 August 2012 from
Mertens, P., The prosogram: Semi-automatic transcription of prosody based on a tonal perception model (2004) Speech Prosody
Rosenberg, A., AuToBI-A tool for automatic ToBI annotation (2010) Proceedings of Interspeech, pp. 146-149
Levitan, R., BeňuŠ, S., Gravano, A., Hirschberg, J., Entrainment in slovak, Spanish, english, and Chinese: A cross-linguistic comparison (2015) Proceedings of SIGdial
Levitan, R., Gravano, A., Willson, L., Benus, S., Hirschberg, J., Nenkova, A., Acoustic-prosodic entrainment and social behavior (2012) Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 11-19. , http://www.aclweb.org/anthology/N12-1002, Montréal, Canada: Association for Computational Linguistics, June
Levitan, R., Benus, S., Gravano, A., Hirschberg, J., Entrainment and turn-taking in human-human dialogue (2015) AAAI 2015 Spring Symposium on Turn-taking and Coordination in Human-Machine Interaction
Schröder, M., Trouvain, J., The German text-to-speech synthesis system Mary: A tool for research, development, and teaching (2001) SSW
Gosling, S., Rentfrow, P., Swann, W., A very brief measure of the big-five personality domains (2003) Journal of Research in Personality, 37 (6), pp. 504-528
Huggins-Daines, D., Kumar, M., Chan, A., Black, A.W., Ravishankar, M., Rudnicky, A.I., Pocketsphinx: A free, real-time continuous speech recognition system for hand-held devices (2006) Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings 2006 IEEE International Conference on, , IEEE
Violante, L., Zivic, P.R., Gravano, A., Improving speech synthesis quality by reducing pitch peaks in the source recordings (2013) HLT-NAACL, pp. 502-506
Wu, C.-H., Hsia, C.-C., Liu, T.-H., Wang, J.-F., Voice conversion using duration-embedded bi-hmms for expressive speech synthesis (2006) Audio, Speech, and Language Processing, IEEE Transactions on, 14 (4), pp. 1109-1116A4 - Amazon Alexa; Apple; eBay; et al.; Google; Microsoft

Citas:

---------- APA ----------

Levitan, R., Beňuš, Š., Gálvez, R.H., Gravano, A., Savoretti, F., Trnka, M., Weise, A.,..., Amazon Alexa; Apple; eBay; et al.; Google; Microsoft (2016) . Implementing acoustic-prosodic entrainment in a conversational avatar. 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016, 08-12-September-2016, 1166-1170.
http://dx.doi.org/10.21437/Interspeech.2016-985

---------- CHICAGO ----------

Levitan, R., Beňuš, Š., Gálvez, R.H., Gravano, A., Savoretti, F., Trnka, M., et al. "Implementing acoustic-prosodic entrainment in a conversational avatar" . 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016 08-12-September-2016 (2016) : 1166-1170.
http://dx.doi.org/10.21437/Interspeech.2016-985

---------- MLA ----------

Levitan, R., Beňuš, Š., Gálvez, R.H., Gravano, A., Savoretti, F., Trnka, M., et al. "Implementing acoustic-prosodic entrainment in a conversational avatar" . 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016, vol. 08-12-September-2016, 2016, pp. 1166-1170.
http://dx.doi.org/10.21437/Interspeech.2016-985

---------- VANCOUVER ----------

Levitan, R., Beňuš, Š., Gálvez, R.H., Gravano, A., Savoretti, F., Trnka, M., et al. Implementing acoustic-prosodic entrainment in a conversational avatar. Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH. 2016;08-12-September-2016:1166-1170.
http://dx.doi.org/10.21437/Interspeech.2016-985