Conferencia

Brusco, P.; Perez, J.M.; Gravano, A.; Lacerda F.; Strombergsson S.; Wlodarczak M.; Heldner M.; Gustafson J.; House D.; Amazon Alexa; Apple; DiDi; et al.; Furhat Robotics; Microsoft "Cross-linguistic study of the production of turn-taking cues in American English and Argentine Spanish" (2017) 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. 2017-August:2351-2355
Estamos trabajando para incorporar este artículo al repositorio
Consulte el artículo en la página del editor
Consulte la política de Acceso Abierto del editor

Abstract:

We present the results of a series of machine learning experiments aimed at exploring the differences and similarities in the production of turn-taking cues in American English and Argentine Spanish. An analysis of prosodic features automatically extracted from 21 dyadic conversations (12 En, 9 Sp) revealed that, when signaling Holds, speakers of both languages tend to use roughly the same combination of cues, characterized by a sustained final intonation, a shorter duration of turn-final inter-pausal units, and a distinct voice quality. However, in speech preceding Smooth Switches or Backchannels, we observe the existence of the same set of prosodic turn-taking cues in both languages, although the ways in which these cues are combined together to form complex signals differ. Still, we find that these differences do not degrade below chance the performance of cross-linguistic systems for automatically detecting turn-taking signals. These results are relevant to the construction of multilingual spoken dialogue systems, which need to adapt not only their ASR modules but also the way prosodic turn-taking cues are synthesized and recognized. Copyright © 2017 ISCA.

Registro:

Documento: Conferencia
Título:Cross-linguistic study of the production of turn-taking cues in American English and Argentine Spanish
Autor:Brusco, P.; Perez, J.M.; Gravano, A.; Lacerda F.; Strombergsson S.; Wlodarczak M.; Heldner M.; Gustafson J.; House D.; Amazon Alexa; Apple; DiDi; et al.; Furhat Robotics; Microsoft
Filiación:Departamento de Computacion, FCEyN, Universidad de Buenos Aires, Argentina
Instituto de Investigacion en Ciencias de la Computacion, CONICET-UBA, Buenos Aires, Argentina
Palabras clave:Cross-linguistic; Dialogue; Prosody; Turn-taking; Learning systems; Linguistics; Speech processing; American English; Complex signal; Dialogue; Prosodic features; Prosody; Spoken dialogue system; Turn-taking; Voice quality; Speech communication
Año:2017
Volumen:2017-August
Página de inicio:2351
Página de fin:2355
DOI: http://dx.doi.org/10.21437/Interspeech.2017-124
Título revista:18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017
Título revista abreviado:Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH
ISSN:2308457X
Registro:https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_2308457X_v2017-August_n_p2351_Brusco

Referencias:

  • Sacks, H., Schegloff, E.A., Jefferson, G., A simplest systematics for the organization of turn-taking for conversation (1974) Language, pp. 696-735
  • Duncan, S., Fiske, D., (1977) Face-to-face Interaction: Research, Methods and Theory
  • Ford, C.E., Thompson, S.A., Interactional units in conversation: Syntactic, intonational, and pragmatic resources for the management of turns (1996) Studies in Interactional Sociolinguistics, 13, pp. 134-184
  • Wennerstrom, A., Siegel, A.F., Keeping the floor in multiparty conversations: Intonation, syntax, and pause (2003) Discourse Processes, 36 (2), pp. 77-107
  • Stolcke, A., Ferrer, L., Shriberg, E., (2002) Is the Speaker Done Yet? Faster and More Accurate End-of-utterance Detection Using Prosody
  • Gravano, A., Hirschberg, J., Turn-taking cues in task-oriented dialogue (2011) Computer Speech & Language, 25 (3), pp. 601-634
  • Hjalmarsson, A., The additive effect of turn-taking cues in human and synthetic voice (2011) Speech Communication, 53, pp. 23-25
  • Koiso, H., Horiuchi, Y., Tutiya, S., Ichikawa, A., Den, Y., An analysis of turn-taking and backchannels based on prosodic and syntactic features in Japanese map task dialogs (1998) Language and Speech, 41 (3-4), pp. 295-321
  • Schlangen, D., From reaction to prediction: Experiments with computational models of turn-taking (2006) Proceedings of Interspeech 2006
  • Bauman, R., Sherzer, J., (1989) Explorations in the Ethnography of Speaking, (8). , Cambridge University Press
  • Schegloff, E.A., Interaction: The infrastructure for social institutions, the natural ecological niche for language, and the arena in which culture is enacted (2006) Roots of Human Sociality: Culture, Cognition and Interaction, pp. 70-96
  • Stivers, T., Enfield, N.J., Brown, P., Englert, C., Hayashi, M., Heinemann, T., Hoymann, G., Yoon, K.-E., Universals and cultural variation in turn-taking in conversation (2009) Proceedings of the National Academy of Sciences, 106 (26), pp. 10587-10592
  • Gravano, A., Brusco, P., Beňuš, S., Who do you think will speak next? Perception of turn-taking cues in slovak and argentine Spanish (2016) Interspeech, 2016, pp. 1265-1269
  • Gravano, A., Hirschberg, J., Beňuš, S., Affirmative cue words in task-oriented dialogue (2012) Computational Linguistics, 38 (1), pp. 1-39
  • Breiman, L., Random forests (2001) Machine Learning, 45 (1), pp. 5-32
  • Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Duchesnay, E., Scikit-learn: Machine learning in python (2011) Journal of Machine Learning Research, 12, pp. 2825-2830
  • Cohen, J., Cohen, P., West, S.G., Aiken, L.S., (2013) Applied Multiple Regression/correlation Analysis for the Behavioral Sciences, , Routledge
  • Ward, N., Tsukahara, W., Prosodic features which cue backchannel responses in English and Japanese (2000) Journal of Pragmatics, 32 (8), pp. 1177-1207A4 - Amazon Alexa; Apple; DiDi; et al.; Furhat Robotics; Microsoft

Citas:

---------- APA ----------
Brusco, P., Perez, J.M., Gravano, A., Lacerda F., Strombergsson S., Wlodarczak M., Heldner M.,..., Amazon Alexa; Apple; DiDi; et al.; Furhat Robotics; Microsoft (2017) . Cross-linguistic study of the production of turn-taking cues in American English and Argentine Spanish. 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017, 2017-August, 2351-2355.
http://dx.doi.org/10.21437/Interspeech.2017-124
---------- CHICAGO ----------
Brusco, P., Perez, J.M., Gravano, A., Lacerda F., Strombergsson S., Wlodarczak M., et al. "Cross-linguistic study of the production of turn-taking cues in American English and Argentine Spanish" . 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017 2017-August (2017) : 2351-2355.
http://dx.doi.org/10.21437/Interspeech.2017-124
---------- MLA ----------
Brusco, P., Perez, J.M., Gravano, A., Lacerda F., Strombergsson S., Wlodarczak M., et al. "Cross-linguistic study of the production of turn-taking cues in American English and Argentine Spanish" . 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017, vol. 2017-August, 2017, pp. 2351-2355.
http://dx.doi.org/10.21437/Interspeech.2017-124
---------- VANCOUVER ----------
Brusco, P., Perez, J.M., Gravano, A., Lacerda F., Strombergsson S., Wlodarczak M., et al. Cross-linguistic study of the production of turn-taking cues in American English and Argentine Spanish. Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH. 2017;2017-August:2351-2355.
http://dx.doi.org/10.21437/Interspeech.2017-124