Abstract
The authors propose a system that allows presenters to control presentations in a natural way by their body gestures and vocal commands. Thus a presentation no longer follows strictly a rigid sequential structure but can be delivered in various flexible and content adapted scenarios. Our proposed system fuses three interaction modules: gesture recognition with Kinect 3D skeletal data, key concepts detection by context analysis from natural speech, and small-scaled hand gesture recognition with haptic data from smart phone sensors. Each module can process in realtime with the accuracy of 95.0%, 91.2%, and 90.1% respectively. The system uses events generated from the three modules to trigger pre-defined scenarios in a presentation to enhance the exciting experience for audiences.
Chapter PDF
Similar content being viewed by others
Keywords
References
Adams, R.: Sourcebook of automatic identification and data collection. Van Nostrand Reinhold (1990)
Azimi, M.: Skeletal Joint Smoothing White Paper (accessed August 13, 2012), http://msdn.microsoft.com/en-us/library/jj131429.aspx
Bednarik, R., Vrzakova, H., Hradis, M.: What do you want to do next: a novel approach for intent prediction in gaze-based interaction. In: ETRA, pp. 83–90 (2012)
Hilbe, J.: Logistic regression models. CRC Press, Boca Raton (2009)
Holzinger, A., Softic, S., Stickel, C., Ebner, M., Debevc, M.: Intuitive e-teaching by using combined hci devices: Experiences with wiimote applications. In: Stephanidis, C. (ed.) UAHCI 2009, Part III. LNCS, vol. 5616, pp. 44–52. Springer, Heidelberg (2009)
Jaimes, A., Sebe, N.: Multimodal Human-Computer Interaction: A survey. Computer Vision and Image Understanding 108(1-2), 116–134 (2007)
Juang, B.H., Rabiner, L.R.: Automatic speech recognition - a brief history of the technology development. Elsevier Encyclopedia of Language and Linguistics (2005)
Jurafsky, D., Martin, J.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Prentice Hall Series in Artificial Intelligence. Pearson Prentice Hall (2009)
Karray, F., Alemzadeh, M., Saleh, J.A., Arab, M.N.: Human-Computer Interaction: Overview on State of the Art. International Journal on Smart Sensing and Intelligent Systems 1(1), 137–159 (2008)
Lou, Y., Wu, W., Zhang, H., Zhang, H., Chen, Y.: A multi-user interaction system based on kinect and wii remote. In: ICME Workshops, p. 667 (2012)
Moeller, J., Kerne, A.: Zerotouch: an optical multi-touch and free-air interaction architecture. In: CHI, pp. 2165–2174 (2012)
Myers, C., Rabiner, L.: A Comparative Study of Several Dynamic Time-Warping Algorithms for Connected-Word Recognition. The Bell System Technical Journal 60(7), 1389–1409 (1981)
Rehm, M., Bee, N., André, E.: Wave like an egyptian: accelerometer based gesture recognition for culture specific interactions. In: Proceedings of the 22nd British HCI Group Annual Conference on People and Computers: Culture, Creativity, Interaction, BCS-HCI 2008. British Computer Society, vol. 1, pp. 13–22 (2008)
Vu, Q., Demuynck, K., Van Compernolle, D.: Vietnamese automatic speech recognition: The fLavor approach. In: Huo, Q., Ma, B., Chng, E.-S., Li, H. (eds.) ISCSLP 2006. LNCS (LNAI), vol. 4274, pp. 464–474. Springer, Heidelberg (2006)
Zhang, T., Wang, J., Liu, P., Hou, J.: Fall detection by embedding an accelerometer in cellphone and using kfd algorithm. International Journal of Computer Science and Network Security 6(10) (October 2006)
Zhang, Z.: Microsoft kinect sensor and its effect. IEEE MultiMedia 19(2), 4–10 (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Le, HA., Mac, KN.C., Pham, TA., Nguyen, VT., Tran, MT. (2013). Multimodal Smart Interactive Presentation System. In: Kurosu, M. (eds) Human-Computer Interaction. Interaction Modalities and Techniques. HCI 2013. Lecture Notes in Computer Science, vol 8007. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39330-3_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-39330-3_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39329-7
Online ISBN: 978-3-642-39330-3
eBook Packages: Computer ScienceComputer Science (R0)