Abstract
Understanding human emotion is vital to communicate effectively with others, monitor patients, analyse behaviour, and keep an eye on those who are vulnerable. Emotion recognition is essential to achieve a complete human-machine interoperability experience. Artificial intelligence, mainly machine learning (ML), have been used in recent years to improve the model for recognising emotions from a single type of data. A multimodal system has been proposed that uses text, facial expressions, and speech signals to identify emotions in this work. The MobileNet architecture is used to predict emotion from facial expressions, and different ML classifiers are used to predict emotion from text and speech signals in the proposed model. The Facial Expression Recognition 2013 (FER2013) dataset has been used to recognise emotion from facial expressions, whilst the Interactive Emotional Dyadic Motion Capture (IEMOCAP) dataset was used for both text and speech emotion recognition. The proposed ensemble technique consisting of random forest, extreme gradient boosting, and multi-layer perceptron achieves an accuracy of 70.67%, which is better than the unimodal approaches used.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Al Banna MH et al (2021) Attention-based bi-directional long-short term memory network for earthquake prediction. IEEE Access 9:56589–56603
Al Nahian MJ, Ghosh T et al (2020) Towards artificial intelligence driven emotion aware fall monitoring framework suitable for elderly people with neurological disorder. In: Proceeding of Brain Information, pp 275–286
Al Nahian MJ et al (2021) Towards an accelerometer-based elderly fall detection system using cross-disciplinary time series features. IEEE Access 9:39413–31
Basu S, Chakraborty J, Aftabuddin M (2017) Emotion recognition from speech using convolutional neural network with recurrent neural network architecture. In: Proceeding of ICCES, pp 333–336
Bertero D, Fung P (2017) A first look into a convolutional neural network for speech emotion detection. In: Proceeding of ICASSP, pp 5115–5119
Biswas M, Tania MH, Kaiser MS et al (2021) ACCU3RATE: a mobile health application rating scale based on user reviews. PloS one 16(12):e0258050
Biswas M et al (2021) An xai based autism detection: the context behind the detection. In: Proceeding of Brain Information, pp 448–459
Choi WY, Song KY, Lee CW (2018) Convolutional attention networks for multimodal emotion recognition from speech and text data. In: Proceeding of challenge-HML, pp 28–34
Deepa B et al (2022) Pattern descriptors orientation and map firefly algorithm based brain pathology classification using hybridized machine learning algorithm. IEEE Access 10:3848–3863
Fabietti M, Mahmud M, Lotfi A (2021) Anomaly detection in invasively recorded neuronal signals using deep neural network: effect of sampling frequency. In: Proceeding of AII, pp 79–91 (2021)
Fabietti M, Mahmud M, Lotfi A (2022) Channel-independent recreation of artefactual signals in chronically recorded local field potentials using machine learning. Brain Inform 9(1):1–17
Fabietti M et al (2020) Artifact detection in chronically recorded local field potentials using long-short term memory neural network. Proceeding AICT 2020:1–6
Faria TH et al (2021) Smart city technologies for next generation healthcare. In: Data-driven mining, learning and analytics for secured smart cities, pp 253–274
Ghosh T et al (2021) Artificial intelligence and internet of things in screening and management of autism spectrum disorder. Sustain Cities Soc 74:103189
Ghosh T et al (2021) An attention-based mood controlling framework for social media users. In: Proceeding of brain information, pp 245–256
Ghosh T et al (2021) A hybrid deep learning model to predict the impact of covid-19 on mental health form social media big data. Preprints 2021(2021060654)
Herzig J et al (2017) Emotion detection from text via ensemble classification using word embeddings. In: Proceeding ICTIR, pp 269–272
Jain DK, Shamsolmoali P, Sehdev P (2019) Extended deep neural network for facial emotion recognition. Pattern Recognit Lett 120:69–74
Jain N et al (2018) Hybrid deep neural networks for face emotion recognition. Pattern Recognit. Lett. 115:101–106
Kumar I et al (2022) Dense tissue pattern characterization using deep neural network. Cogn Comput 1–24 (2022) [ePub ahead of print]
Lalotra GS, Kumar V, Bhatt A, Chen T, Mahmud M (2022) Iretads: an intelligent real-time anomaly detection system for cloud communications using temporal data summarization and neural network. Secur Commun Netw 2022:9149164
Mahmud F, Islam B, Hossain A, Goala PB (2018) Facial region segmentation based emotion recognition using k-nearest neighbors. In: Proceeding ICIET, pp 1–5 (2018)
Mahmud M, Kaiser MS, McGinnity TM, Hussain A (2021) Deep learning in mining biological data. Cognit Comput 13(1):1–33
Mahmud M et al (2018) Applications of deep learning and reinforcement learning to biological data. IEEE Trans Neural Netw Learn Syst 29(6):2063–2079
Mammoottil MJ, Kulangara LJ, Cherian AS, Mohandas P, Hasikin K, Mahmud M (2022) Detection of breast cancer from five-view thermal images using convolutional neural networks. J Healthc Eng 2022:4295221
Nawar A, Toma NT, Al Mamun S et al (2021) Cross-content recommendation between movie and book using machine learning. In: Proceeding AICT, pp 1–6
Patwardhan AS (2017) Multimodal mixed emotion detection. In: Proceeding of ICCES, pp 139–143
Paul A et al (2022) Inverted bell-curve-based ensemble of deep learning models for detection of covid-19 from chest x-rays. Neural Comput Appl 1–15
Prakash N et al (2021) Deep transfer learning covid-19 detection and infection localization with superpixel based segmentation. Sustain Cities Soc 75:103252
Satt A, Rozenberg S, Hoory R (2017) Efficient emotion recognition from speech using deep learning on spectrograms. In: Interspeech, pp 1089–1093
Satu M et al (2020) Towards improved detection of cognitive performance using bidirectional multilayer long-short term memory neural network. In: Proceeding of Brain Information, pp 297–306
Satu MS et al (2021) Tclustvid: a novel machine learning classification model to investigate topics and sentiment in covid-19 tweets. Knowl-Based Syst 226:107126
Seal D, Roy UK, Basak R (2020) Sentence-level emotion detection from text based on semantic rules. In: Information and communication technology for sustainable development, pp 423–430
Sebastian J, Pierucci P et al (2019) Fusion techniques for utterance-level emotion recognition combining speech and transcripts. In: Interspeech, pp 51–55
Shrivastava K, Kumar S, Jain DK (2019) An effective approach for emotion detection in multimedia text data using sequence based convolutional neural network. Multimed Tools Appl 78(20):29607–29639
Shu L, Xie J, Yang M, Li Z, Li Z, Liao D, Xu X, Yang X (2018) A review of emotion recognition using physiological signals. Sensors 18(7):2074
Watkins J, Fabietti M, Mahmud M (2020) Sense: a student performance quantifier using sentiment analysis. In: Proceeding of IJCNN, pp 1–6
Acknowledgements
MM and MSK are supported by the DIVERSASIA project (618615-EPP-1-2020-1-UKEPPKA2-CBHEJP) funded by the European Commission under the Erasmus+ programme.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Shahriar, M.F., Arnab, M.S.A., Khan, M.S., Rahman, S.S., Mahmud, M., Kaiser, M.S. (2023). Towards Machine Learning-Based Emotion Recognition from Multimodal Data. In: Mandal, J.K., De, D. (eds) Frontiers of ICT in Healthcare . Lecture Notes in Networks and Systems, vol 519. Springer, Singapore. https://doi.org/10.1007/978-981-19-5191-6_9
Download citation
DOI: https://doi.org/10.1007/978-981-19-5191-6_9
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-5190-9
Online ISBN: 978-981-19-5191-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)