Abstract:
We present the results of a series of machine learning experiments aimed at exploring the differences and similarities in the production of turn-taking cues in American English and Argentine Spanish. An analysis of prosodic features automatically extracted from 21 dyadic conversations (12 En, 9 Sp) revealed that, when signaling Holds, speakers of both languages tend to use roughly the same combination of cues, characterized by a sustained final intonation, a shorter duration of turn-final inter-pausal units, and a distinct voice quality. However, in speech preceding Smooth Switches or Backchannels, we observe the existence of the same set of prosodic turn-taking cues in both languages, although the ways in which these cues are combined together to form complex signals differ. Still, we find that these differences do not degrade below chance the performance of cross-linguistic systems for automatically detecting turn-taking signals. These results are relevant to the construction of multilingual spoken dialogue systems, which need to adapt not only their ASR modules but also the way prosodic turn-taking cues are synthesized and recognized. Copyright © 2017 ISCA.
Registro:
Documento: |
Conferencia
|
Título: | Cross-linguistic study of the production of turn-taking cues in American English and Argentine Spanish |
Autor: | Brusco, P.; Perez, J.M.; Gravano, A.; Lacerda F.; Strombergsson S.; Wlodarczak M.; Heldner M.; Gustafson J.; House D.; Amazon Alexa; Apple; DiDi; et al.; Furhat Robotics; Microsoft |
Filiación: | Departamento de Computacion, FCEyN, Universidad de Buenos Aires, Argentina Instituto de Investigacion en Ciencias de la Computacion, CONICET-UBA, Buenos Aires, Argentina
|
Palabras clave: | Cross-linguistic; Dialogue; Prosody; Turn-taking; Learning systems; Linguistics; Speech processing; American English; Complex signal; Dialogue; Prosodic features; Prosody; Spoken dialogue system; Turn-taking; Voice quality; Speech communication |
Año: | 2017
|
Volumen: | 2017-August
|
Página de inicio: | 2351
|
Página de fin: | 2355
|
DOI: |
http://dx.doi.org/10.21437/Interspeech.2017-124 |
Título revista: | 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017
|
Título revista abreviado: | Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH
|
ISSN: | 2308457X
|
Registro: | https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_2308457X_v2017-August_n_p2351_Brusco |
Referencias:
- Sacks, H., Schegloff, E.A., Jefferson, G., A simplest systematics for the organization of turn-taking for conversation (1974) Language, pp. 696-735
- Duncan, S., Fiske, D., (1977) Face-to-face Interaction: Research, Methods and Theory
- Ford, C.E., Thompson, S.A., Interactional units in conversation: Syntactic, intonational, and pragmatic resources for the management of turns (1996) Studies in Interactional Sociolinguistics, 13, pp. 134-184
- Wennerstrom, A., Siegel, A.F., Keeping the floor in multiparty conversations: Intonation, syntax, and pause (2003) Discourse Processes, 36 (2), pp. 77-107
- Stolcke, A., Ferrer, L., Shriberg, E., (2002) Is the Speaker Done Yet? Faster and More Accurate End-of-utterance Detection Using Prosody
- Gravano, A., Hirschberg, J., Turn-taking cues in task-oriented dialogue (2011) Computer Speech & Language, 25 (3), pp. 601-634
- Hjalmarsson, A., The additive effect of turn-taking cues in human and synthetic voice (2011) Speech Communication, 53, pp. 23-25
- Koiso, H., Horiuchi, Y., Tutiya, S., Ichikawa, A., Den, Y., An analysis of turn-taking and backchannels based on prosodic and syntactic features in Japanese map task dialogs (1998) Language and Speech, 41 (3-4), pp. 295-321
- Schlangen, D., From reaction to prediction: Experiments with computational models of turn-taking (2006) Proceedings of Interspeech 2006
- Bauman, R., Sherzer, J., (1989) Explorations in the Ethnography of Speaking, (8). , Cambridge University Press
- Schegloff, E.A., Interaction: The infrastructure for social institutions, the natural ecological niche for language, and the arena in which culture is enacted (2006) Roots of Human Sociality: Culture, Cognition and Interaction, pp. 70-96
- Stivers, T., Enfield, N.J., Brown, P., Englert, C., Hayashi, M., Heinemann, T., Hoymann, G., Yoon, K.-E., Universals and cultural variation in turn-taking in conversation (2009) Proceedings of the National Academy of Sciences, 106 (26), pp. 10587-10592
- Gravano, A., Brusco, P., Beňuš, S., Who do you think will speak next? Perception of turn-taking cues in slovak and argentine Spanish (2016) Interspeech, 2016, pp. 1265-1269
- Gravano, A., Hirschberg, J., Beňuš, S., Affirmative cue words in task-oriented dialogue (2012) Computational Linguistics, 38 (1), pp. 1-39
- Breiman, L., Random forests (2001) Machine Learning, 45 (1), pp. 5-32
- Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Duchesnay, E., Scikit-learn: Machine learning in python (2011) Journal of Machine Learning Research, 12, pp. 2825-2830
- Cohen, J., Cohen, P., West, S.G., Aiken, L.S., (2013) Applied Multiple Regression/correlation Analysis for the Behavioral Sciences, , Routledge
- Ward, N., Tsukahara, W., Prosodic features which cue backchannel responses in English and Japanese (2000) Journal of Pragmatics, 32 (8), pp. 1177-1207A4 - Amazon Alexa; Apple; DiDi; et al.; Furhat Robotics; Microsoft
Citas:
---------- APA ----------
Brusco, P., Perez, J.M., Gravano, A., Lacerda F., Strombergsson S., Wlodarczak M., Heldner M.,..., Amazon Alexa; Apple; DiDi; et al.; Furhat Robotics; Microsoft
(2017)
. Cross-linguistic study of the production of turn-taking cues in American English and Argentine Spanish. 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017, 2017-August, 2351-2355.
http://dx.doi.org/10.21437/Interspeech.2017-124---------- CHICAGO ----------
Brusco, P., Perez, J.M., Gravano, A., Lacerda F., Strombergsson S., Wlodarczak M., et al.
"Cross-linguistic study of the production of turn-taking cues in American English and Argentine Spanish"
. 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017 2017-August
(2017) : 2351-2355.
http://dx.doi.org/10.21437/Interspeech.2017-124---------- MLA ----------
Brusco, P., Perez, J.M., Gravano, A., Lacerda F., Strombergsson S., Wlodarczak M., et al.
"Cross-linguistic study of the production of turn-taking cues in American English and Argentine Spanish"
. 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017, vol. 2017-August, 2017, pp. 2351-2355.
http://dx.doi.org/10.21437/Interspeech.2017-124---------- VANCOUVER ----------
Brusco, P., Perez, J.M., Gravano, A., Lacerda F., Strombergsson S., Wlodarczak M., et al. Cross-linguistic study of the production of turn-taking cues in American English and Argentine Spanish. Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH. 2017;2017-August:2351-2355.
http://dx.doi.org/10.21437/Interspeech.2017-124