Bimodal Emotion Recognition Based on Speech Signals and Facial Expression

Tu, Binbin; Yu, Fengqin

doi:10.1007/978-3-642-25664-6_81

Binbin Tu⁴ &
Fengqin Yu⁴

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 122))

1720 Accesses
2 Citations

Abstract

Voice signals and facial expression changes are synchronized under the different emotions, the recognition algorithm based audio-visual feature fusion is proposed to identify emotional states more accurately. Prosodic features were extracted for speech emotional features, and local Gabor binary patterns were adopted for facial expression features. Two types of features were modeled with SVM respectively to obtain the probabilities of anger, disgust fear, happiness, sadness and surprise, and then fused the probabilities to gain the final decision. Simulation results demonstrate that the average recognition rates of the single modal classifier based on speech signals and based on facial expression reach 60% and 57% respectively, while the multimodal classifier with the feature fusion of speech signals and facial expression achieves 72%.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Audiovisual emotion recognition in wild

Article Open access 19 July 2018

Fusing facial and speech cues for enhanced multimodal emotion recognition

Article 24 January 2024

Multi-modal Emotion Recognition Based on Speech and Image

Keywords

References

Jinjing, X., Yiqiang, C., Junfa, L.: Multi-expression Facial Animation based on Speech Emotion Recognition. Journal of Computer-aided Design & Computer Graphics 20(4), 520–525 (2008)
Google Scholar
Kapoor, A., Picard, R.W.: Multimodal Affect Recognition in Learning Environments. In: Proc. of the 13th Annual International Conference on Multimedia, Singapore, pp. 677–682 (2005)
Google Scholar
Danning, J., Lianhong, C.: Speech Emotion Recognition using Acoustic Features. J. Tsinghua Univ (Sci. & Tech.) 46(1), 86–89 (2006)
Google Scholar
Koolagudi, S.G., Nandy, S., Rao, K.S.: Spectral Features for Emotion Classification. In: 2009 IEEE International Advance Computing Conference, Patiala, pp. 1292–1296 (2009)
Google Scholar
Ojala, T., Pietikainen, M., Maenpaa, T.: Multi-resolution Gray-scale and Rotation Invariant Texture Classification with Local Binary Patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(7), 971–987 (2002)
Article Google Scholar
Ahonen, T., Hadid, A., Pietikainen, M.: Face Description with Local Binary Patterns: Application to Face Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 28(12), 2037–2041 (2006)
Article Google Scholar
Wenchao, Z., Shiguang, S., Hongming, Z.: Histogram Sequence of Local Gabor Binary Pattern for Face Description and Identification. Journal of Software 17(12), 2508–2517 (2006)
Article MATH Google Scholar
Kittler, J., Hatef, M., Duin, R.P.: On Combining Classifiers. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(3), 226–239 (1998)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Internet of Things Engineering, Jiangnan University, Wuxi, 214122, China
Binbin Tu & Fengqin Yu

Authors

Binbin Tu
View author publications
You can also search for this author in PubMed Google Scholar
Fengqin Yu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Shanghai Jiao Tong University, 800 Dongchuan Road, 200240, Shanghai, China
Yinglin Wang
School of Information Science and Technology, Southwest Jiaotong University, 610031, Chengdu, Sichuan Province, China
Tianrui Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tu, B., Yu, F. (2011). Bimodal Emotion Recognition Based on Speech Signals and Facial Expression. In: Wang, Y., Li, T. (eds) Foundations of Intelligent Systems. Advances in Intelligent and Soft Computing, vol 122. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25664-6_81

Download citation

DOI: https://doi.org/10.1007/978-3-642-25664-6_81
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25663-9
Online ISBN: 978-3-642-25664-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Bimodal Emotion Recognition Based on Speech Signals and Facial Expression

Abstract

Chapter PDF

Similar content being viewed by others

Audiovisual emotion recognition in wild

Fusing facial and speech cues for enhanced multimodal emotion recognition

Multi-modal Emotion Recognition Based on Speech and Image

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Bimodal Emotion Recognition Based on Speech Signals and Facial Expression

Abstract

Chapter PDF

Similar content being viewed by others

Audiovisual emotion recognition in wild

Fusing facial and speech cues for enhanced multimodal emotion recognition

Multi-modal Emotion Recognition Based on Speech and Image

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation