Abstract
Voice assistants in mobile devices and smart speakers offer the potential of conversational agents as storytelling peers of children, especially those who may have limited proficiency in spelling and grammar. Despite their prevalence, however, the built-in automatic speech recognition features of voice interfaces have been shown to perform poorly on children’s speech, which may affect child-agent interaction. In this paper, we describe our experiments in deploying a conversational storytelling agent on two popular commercial voice interfaces - Google Assistant and Amazon Alexa. Through post-validation feedback from children and analysis of the captured conversation logs, we compare the challenges encountered by children when sharing their stories with these voice assistants. We also used the Bilingual Evaluation Understudy to provide a quantitative assessment of the text-to-speech transcription quality. We found that voice assistants’ short waiting time and the frequent yet misplaced interruptions during pauses disrupt the thinking process of children. Furthermore, disfluencies and grammatical errors that naturally occur in children’s speech affected the transcription quality.
Supported by DOST-PCIEERD.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Blythe, M., Reid, J., Wright, P., Geelhoed, E.: Interdisciplinary criticism: analysing the experience of riot! a location-sensitive digital narrative. J. Behav. Inf. Technol. 25(2), 127–139 (2006)
Cheng, Y., Yen, K., Chen, Y., Chen, S., Hiniker, A.: Why doesn’t it work? voice-driven interfaces and young children’s communication repair strategies. In: Proceedings of 17th ACM Conference on Interaction Design and Children, pp. 337–348. ACM (2018)
Duranti, A., Goodwin, C.: Rethinking Context: Language as an Interactive Phenomenon. Cambridge University Press, Cambridge (1992)
Engel, S.: The Stories Children Tell: Making Sense of the Narratives of Childhood. W H Freeman & Co. Ltd., New York (1995)
Gerosa, M., Giuliani, D., Narayanan, S., Potamianos, A.: A review of ASR technologies for children’s speech. In: Proceedings of the 2nd Workshop on Child, Computer and Interaction, WOCCI 2009, pp. 1–8, November 2009
Harwell, D.: The accent gap: how Amazon’s and Google’s smart speakers leave certain voices behind, July 2018
Hone, K.S., Graham, R.: Towards a tool for the subjective assessment of speech system interfaces (SASSI). Natural Lang. Eng. 6(3–4), 287–303 (2000)
Kennedy, J., et al.: Child speech recognition in human-robot interaction: evaluations and recommendations. In: Proceedings of the 2017 ACM/IEEE International Conference on Human-Robot Interaction, pp. 82–90 (2017)
Keren, G., Fridin, M.: Kindergarten social assistive robot (kindsar) for children’s geometric thinking and metacognitive development in preschool education: a pilot study. Comput. Hum. Behav. 35, 400–412 (2014)
Lovato, S., Piper, A.M.: “Siri, is this you?": understanding young children’s interactions with voice input systems. In: Proceedings of the 14th International Conference on Interaction Design and Children, pp. 335–338, June 2015
Lovato, S.B., Piper, A.M., Wartela, E.A.: ’hey google, do unicorns exist?’: conversational agents as a path to answers to children’s questions. In: Proceedings of the 18th ACM International Conference on Interaction Design and Children, pp. 301–313 (2019)
Maier, A., et al.: An automatic version of a reading disorder test. ACM Trans. Speech Lang. Process. 7, 15 (2011)
Meinedo, H., Trancoso, I.: Age and gender detection in the I-DASH project. ACM Trans. Speech Lang. Process. 7, 16 (2011)
Most, T.: The use of repair strategies by children with and without hearing impairment. Lang. Speech Hearing Serv. Schools 33(2), 112–123 (2002)
Ong, D.T., De Jesus, C.R., Gilig, L.K., Alburo, J.B., Ong, E.: A dialogue model for collaborative storytelling with children. In: Proceedings of the 26th International Conference on Computers in Education, pp. 205–210. APSCE (2018)
Ong, E., Alburo, J.B., De Jesus, C.R., Gilig, L.K., Ong, D.T.: Challenges posed by voice interface to child-agent collaborative storytelling. In: Proceedings of the 22nd Conference of the Oriental COCOSDA, pp. 1–6, October 2019
Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: Bleu: A method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 311–318, July 2002
Peck, J.: Using storytelling to promote language and literacy development. Reading Teach. 43(2), 138–141 (1989)
Pyae, A., Scifleet, P.: Investigating differences between native English and non-native English speakers in interacting with a voice user interface: A case of google home. In: Proceedings of the 30th Australian Conference on CHI, pp. 548–553, December 2018
Sun, M., Leite, I., Lehman, J., Li, B.: Collaborative storytelling between robot and child: a feasibility study. In: Proceedings 2017 Conference on Interaction Design and Children, pp. 205–214, June 2017
Tamura, Y., Kimoto, M., Shiomi, M., Iio, T., Shimohara, K., Hagita, N.: Effects of a listener robot with children in storytelling. In: Proceedings of the 5th International. Conference on Human Agent Interaction, pp. 35–43. ACM, NY (2017)
Ward, W., Cole, R., Bolaños, D., Buchenroth-Martin, C., Svirsky, E., Weston, T.: My science tutor: a conversational multimedia virtual tutor. J. Educ. Psychol. 105, 1115 (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Ureta, J., Brito, C.I., Dy, J.B., Santos, KA., Villaluna, W., Ong, E. (2020). At Home with Alexa: A Tale of Two Conversational Agents. In: Sojka, P., Kopeček, I., Pala, K., Horák, A. (eds) Text, Speech, and Dialogue. TSD 2020. Lecture Notes in Computer Science(), vol 12284. Springer, Cham. https://doi.org/10.1007/978-3-030-58323-1_53
Download citation
DOI: https://doi.org/10.1007/978-3-030-58323-1_53
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-58322-4
Online ISBN: 978-3-030-58323-1
eBook Packages: Computer ScienceComputer Science (R0)