Conferencia

Kleinhans, J.; Farrús, M.; Gravano, A.; Pérez, J.M.; Lai, C.; Wanner, L.; Lacerda F.; Strombergsson S.; Wlodarczak M.; Heldner M.; Gustafson J.; House D.; Amazon Alexa; Apple; DiDi; et al.; Furhat Robotics; Microsoft "Using prosody to classify discourse relations" (2017) 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. 2017-August:3201-3205
Estamos trabajando para incorporar este artículo al repositorio
Consulte el artículo en la página del editor
Consulte la política de Acceso Abierto del editor

Abstract:

This work aims to explore the correlation between the discourse structure of a spoken monologue and its prosody by predicting discourse relations from different prosodic attributes. For this purpose, a corpus of semi-spontaneous monologues in English has been automatically annotated according to the Rhetorical Structure Theory, which models coherence in text via rhetorical relations. From corresponding audio files, prosodic features such as pitch, intensity, and speech rate have been extracted from different contexts of a relation. Supervised classification tasks using Support Vector Machines have been performed to find relationships between prosodic features and rhetorical relations.Preliminary results show that intensity combined with other features extracted from intra- and intersegmental environments is the feature with the highest predictability for a discourse relation. The prediction of rhetorical relations from prosodic features and their combinations is straightforwardly applicable to several tasks such as speech understanding or generation. Moreover, the knowledge of how rhetorical relations should be marked in terms of prosody will serve as a basis to improve speech synthesis applications and make voices sound more natural and expressive. Copyright © 2017 ISCA.

Registro:

Documento: Conferencia
Título:Using prosody to classify discourse relations
Autor:Kleinhans, J.; Farrús, M.; Gravano, A.; Pérez, J.M.; Lai, C.; Wanner, L.; Lacerda F.; Strombergsson S.; Wlodarczak M.; Heldner M.; Gustafson J.; House D.; Amazon Alexa; Apple; DiDi; et al.; Furhat Robotics; Microsoft
Filiación:TALN Research Group, DTIC, Universitat Pompeu Fabra, Barcelona, Spain
Departamento de Computación, FCEyN, Universidad de Buenos Aires, Argentina
Instituto de Investigación en Ciencias de la Computación, CONICET-UBA, Buenos Aires, Argentina
School of Informatics, University of Edinburgh, Edinburgh, United Kingdom
Catalan Institute for Research and Advanced Studies, Barcelona, Spain
Palabras clave:Discourse structure; Prosody; RST; Speech synthesis; Support vector machines; Continuous speech recognition; Speech; Speech synthesis; Support vector machines; Text processing; Discourse structure; Prosodic features; Prosody; Rhetorical relations; Rhetorical structure theory; Speech rates; Speech understanding; Supervised classification; Speech communication
Año:2017
Volumen:2017-August
Página de inicio:3201
Página de fin:3205
DOI: http://dx.doi.org/10.21437/Interspeech.2017-710
Título revista:18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017
Título revista abreviado:Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH
ISSN:2308457X
Registro:https://bibliotecadigital.exactas.uba.ar/collection/paper/document/paper_2308457X_v2017-August_n_p3201_Kleinhans

Referencias:

  • Hirschberg, J., Litman, D., Now let's talk about now: Identifying cue phrases intonationally (1987) Proceedings of the 25th Annual Meeting on Association for Computational Linguistics, pp. 163-171. , Association for Computational Linguistics
  • Hirschberg, J., Litman, D., Pierrehumbert, J.B., Ward, G., Intonation and the intentional structure of discourse (1987) Proceedings of the 10th International Joint Conference on Artificial Intelligence, 2 (1), pp. 636-639
  • Murray, G., Renals, S., Taboada, M., Prosodic correlates of rhetorical relations Proceedings of the HLT-NAACL 2006 Workshop on Analyzing Conversations in Text and Speech, 2006, pp. 1-7. , June
  • Mann, W.C., Thompson, S.A., (1988) Rhetorical Structure Theory: Toward A Functional Theory of Text Organization, pp. 243-281
  • Zwicky, A., Clitics and particles (1985) Language, 61 (2), pp. 283-305
  • Fraser, B., What are discourse markers? (1999) Journal of Pragmatics, 31, pp. 931-952
  • Louwerse, M.M., Mitchell, H., Towards a taxonomy of a set of discourse markers in dialog: A theoretical and computational linguistic account (2003) Discourse Processes, 35 (1), pp. 199-239
  • Schiffrin, D., (1988) Discourse Markers, , Cambridge University Press
  • Fries, C.C., (1973) The Structure of English: An Introduction to the Construction of English Sentences, , Longman
  • Knott, A., Dale, R., Using linguistic phenomena to motivate a set of coherence relations (1994) Discourse Processes, 18, pp. 35-62
  • Taboada, M., Discourse markers as signals (or not) of rhetorical relations (2006) Journal of Pragmatics, 38, pp. 567-592
  • Janin, A., Baron, D., Edwards, J., Ellis, D., Gelbart, D., Morgan, N., Peskin, B., Stolcke, A., The icsi meeting corpus. Acoustics, speech, and signal processing, 2003 (2003) Proceedings.(ICASSP03). 2003 IEEE International Conference on, 1
  • Farrús, M., Lai, C., Moore, J.D., Paragraph-based prosodic cues for speech synthesis applications (2016) Proceedings of the 8th International Conference on Speech Prosody (SP 2016)
  • Feng, V.W., Hirst, G., A linear-time bottom-up discourse parser with constraints and post-editing (2014) Acl, pp. 511-521
  • Heilman, M., Sagae, K., (2015) Fast Rhetorical Structure Theory Discourse Parsing, , http://arxiv.org/abs/1505.02425
  • Surdeanu, M., Hicks, T., Valenzuela-Escárcega, M.A., Two practical rhetorical structure theory parsers (2015) Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations, pp. 1-5
  • Hernault, H., Prendinger, H., DuVerle, D.A., Ishizuka, M., HILDA: A discourse parser using Support Vector Machine classification (2010) Dialogue & Discourse, 1 (3), pp. 1-33
  • Liu, Y., Chawla, N.V., Harper, M.P., Shriberg, E., Stolcke, A., A study in machine learning from imbalanced data for sentence boundary detection in speech (2006) Computer Speech & Language, 20 (4), pp. 469-494
  • Witten, I., Frank, E., Hall, M., Pal, C., The WEKA workbench. Online appendix for data mining: Practical machine learning tools and techniques (2016) Ser. The Morgan Kaufmann Series in Data Management Systems, , Fourth Edition Elsevier Science
  • Gravano, A., Benus, S., Hirschberg, J., Mitchell, S., Vovsha, I., Classification of discourse functions of affirmative words in spoken dialogue (2007) Interspeech, pp. 1613-1616
  • Lai, C., What do you mean, you're uncertain?: The interpretation of cue words and rising intonation in dialogue (2010) Interspeech, pp. 1-4
  • Domínguez, M., Farrús, M., Burga, A., Wanner, L., The information structureprosody language interface revisited (2014) Proceedings of the 7th International Conference on Speech Prosody (SP2014), pp. 539-543. , Dublin, IrelandA4 - Amazon Alexa; Apple; DiDi; et al.; Furhat Robotics; Microsoft

Citas:

---------- APA ----------
Kleinhans, J., Farrús, M., Gravano, A., Pérez, J.M., Lai, C., Wanner, L., Lacerda F.,..., Amazon Alexa; Apple; DiDi; et al.; Furhat Robotics; Microsoft (2017) . Using prosody to classify discourse relations. 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017, 2017-August, 3201-3205.
http://dx.doi.org/10.21437/Interspeech.2017-710
---------- CHICAGO ----------
Kleinhans, J., Farrús, M., Gravano, A., Pérez, J.M., Lai, C., Wanner, L., et al. "Using prosody to classify discourse relations" . 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017 2017-August (2017) : 3201-3205.
http://dx.doi.org/10.21437/Interspeech.2017-710
---------- MLA ----------
Kleinhans, J., Farrús, M., Gravano, A., Pérez, J.M., Lai, C., Wanner, L., et al. "Using prosody to classify discourse relations" . 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017, vol. 2017-August, 2017, pp. 3201-3205.
http://dx.doi.org/10.21437/Interspeech.2017-710
---------- VANCOUVER ----------
Kleinhans, J., Farrús, M., Gravano, A., Pérez, J.M., Lai, C., Wanner, L., et al. Using prosody to classify discourse relations. Proc. Annu. Conf. Int. Speech. Commun. Assoc., INTERSPEECH. 2017;2017-August:3201-3205.
http://dx.doi.org/10.21437/Interspeech.2017-710