Abstract
This article gives an overview of our symbiotic human-robot interaction project, which aims at an autonomous android who behaves and interacts just like a human. A conversational android ERICA is designed to conduct several social roles focused on spoken dialogue, such as attentive listening (similar to counseling) and job interview. Design principles in developing these spoken dialogue systems are described, in particular focused on the attentive listening system. Generation of backchannels, fillers and laughter is also addressed to make human-like conversation behaviors.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
References
Glas DF, Minato T, Ishi CT, Kawahara T, Ishiguro H (2016) ERICA: the ERATO intelligent conversational android. In: Proceedings of RO-MAN, pp 22–29
Inoue K, Milhorat P, Lala D, Zhao T, Kawahara T (2016) Talking with ERICA, an autonomous android. In: Proceedings of SIGdial meeting discourse & dialogue, volume Demo. Paper, pp 212–215
Milhorat P, Lala D, Inoue K, Tianyu Z, Ishida M, Takanashi K, Nakamura S, Kawahara T (2017) A conversational dialogue manager for the humanoid robot ERICA. In: Proceedings of international workshop spoken dialogue systems (IWSDS)
Fujie S, Matsuyama Y, Taniyama H, Kobayashi T (2009) Conversation robot participating in and activating a group communication. In: Proceedings of InterSpeech, pp 264–267
Bohus D, Horvitz E (2009) Models for multiparty engagement in open-world dialog. In: Proceedings of SIGdial
Lala D, Milhorat P, Inoue K, Ishida M, Takanashi K, Kawahara T (2017) Attentive listening system with backchanneling, response generation and flexible turn-taking. In: Proceedings of SIGdial meeting discourse & dialogue, pp 127–136
DeVault D, Artstein R, Benn G, Dey T, Fast E, Gainer A, Georgila K, Gratch J, Hartholt A, Lhommet M, Lucas G, Marsella S, Morbini F, Nazarian A, Scherer S, Stratou G, Suri A, Traum D, Wood R, Xu Y, Rizzo A, Morency L-P (2014) SimSensei Kiosk: avirtual human interviewer for healthcare decision support. In: Proceedings of AAMAS
Kobori T, Nakano M, Nakamura T (2016) Small talk improves user impressions of interview dialogue systems. In: Proceedings of SIGDial, pp 370–380
Ranganath R, Jurafsky D, McFarland D (2009) It’s not you, it’s me: detecting flirting and its misperception in speed-dates. In: Proceedings of EMNLP
Ueno S, Inaguma H, Mimura M, Kawahara T (2018) Acoustic-to-word attention-based model complemented with character-level CTC-based model. In: Proceedings of IEEE-ICASSP, pp 5804–5808
Levitan R, Hirschberg J (2011) Measuring acoustic-prosodic entrainment with respect to multiple levels and dimensions. In: Proceedings of InterSpeech, pp 3081–3085
Xiao B, Georgiou PG, Imel ZE, Atkins D, Narayanan S (2013) Modeling therapist empathy and vocal entrainment in drug addiction counseling. In: Proceedings of InterSpeech, pp 2861–2864
Kitaoka N, Takeuchi M, Nishimura R, Nakagawa S (2005) Response timing detection using prosodic and linguistic information for human-friendly spoken dialog systems. J Jpn Soc Artif Intell 20(3):220–228
Kitaoka N, Takeuchi M, Nishimura R, Nakagawa S (2005) Response timing detection using prosodic and linguistic information for human-friendly spoken dialog systems. J Jpn Soc Artif Intell 20(3):220–228
Kawahara T, Uesato M, Yoshino K, Takanashi K (2015) Toward adaptive generation of backchannels for attentive listening agents. In: Proceedings of international workshop spoken dialogue systems (IWSDS)
Kawahara T, Yamaguchi T, Inoue K, Takanashi K, Ward N (2016) Prediction and generation of backchannel form for attentive listening systems. In: Proceedings of INTERSPEECH, pp 2890–2894
Ward N (1996) Using prosodic clues to decide when to produce back-channel utterances. In: Proceedings of ICSLP, pp 1728–1731
Ward N, Tsukahara W (2000) Prosodic features which cue back-channel responses in English and Japanese. J Pragmat 32(8):1177–1207
Koiso H, Horiuchi Y, Tutiya S, Ichikawa A, Den Y (1998) An analysis of turn-taking and backchannels based on prosodic and syntactic features in Japanese map task dialogs. Lang Speech 41(3–4):295–321
Andersson S, Georgila K, Traum D, Aylett M, Clark RAJ (2010) Prediction and realisation of conversational characteristics by utilising spontaneous speech for unit selection. In: Proceedings of speech prosody
Nakanishi R, Inoue K, Nakamura S, Takanashi K, Kawahara T (2018) Generating fillers based on dialog act pairs for smooth turn-taking by humanoid robot. In: Proceedings of international workshop spoken dialogue systems (IWSDS)
Turker BB, Bucinca Z, Erzin E, Yemez Y, Sezgin M (2017) Analysis of engagement and user experience with a laughter responsive social robot. In: Proceedings of InterSpeech, pp 844–848
Skantze G, Hjalmarsson A, Oertel C (2014) Turn-taking, feedback and joint attention in situated human-robot interaction. Speech Commun 65:50–66
Inoue K, Lala D, Takanashi K, Kawahara T (2018) Latent character model for engagement recognition based on multimodal behaviors. In: Proceedings of international workshop spoken dialogue systems (IWSDS)
Acknowledgements
This work was supported by JST ERATO Ishiguro Symbiotic Human-Robot Interaction program (Grant Number JPMJER1401), Japan.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kawahara, T. (2019). Spoken Dialogue System for a Human-like Conversational Robot ERICA. In: D'Haro, L., Banchs, R., Li, H. (eds) 9th International Workshop on Spoken Dialogue System Technology. Lecture Notes in Electrical Engineering, vol 579. Springer, Singapore. https://doi.org/10.1007/978-981-13-9443-0_6
Download citation
DOI: https://doi.org/10.1007/978-981-13-9443-0_6
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9442-3
Online ISBN: 978-981-13-9443-0
eBook Packages: Literature, Cultural and Media StudiesLiterature, Cultural and Media Studies (R0)