Abstract
Sarcasm detection is one the most challenging task in natural language processing. Though sentiment semantics are necessary to improve sarcasm detection performance, existing DL-based sarcasm detection models do not fully incorporate them. This research suggested the Hybrid RNN and Optimized LSTM for Multimodal Sarcasm Detection (HROMSD) model. The model is processed under the four stages: preprocessing, feature extraction, feature level fusion, and classification. The initial stage of this proposed technique is preprocessing, here input of the multimodal data, which comprises of text, video, and audio are preprocessed. Here, the text will be preprocessed under tokenization and stemming, the video will be preprocessed under face detection and the audio will be preprocessed under filtering technique. Then, the stage of feature extraction takes place, where the features from preprocessed text, video, and audio are extracted, here, n-grams, TF-IDF, improved Bag of Visual Words, and emojis are extracted as the text features; then CLM and improved SLBT based video features are extracted from the video features, and chroma, MFCC, jitter and special features are extracted from the audio features. The resultant extracted features set are subjected for feature level fusion stage, which makes use of an improved multilevel CCA fusion technique. The classification is carried out using Hybrid RNN and Optimized LSTM for detection purpose, where Improved BES (IBES) method utilized to increase the detection system’s performance. When compared to earlier research, the proposed work is more accurate.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Abbreviations
- NLP:
-
Natural language processing
- DL:
-
Deep learning
- ML:
-
Machine learning
- RF:
-
Random forest
- CCA:
-
Canonical correlation analysis
- IAC:
-
Internet argument corpus
- LSTM:
-
Long-short term memory
- CNN:
-
Convolutional neural network
- RNN:
-
Recurrent neural network
- CUU:
-
Context understanding unit
- AOA:
-
Arithmetic optimization algorithm
- BES:
-
Bald eagle search
References
Pandey R, Kumar A, Singh JP, Tripathi S (2021) Hybrid attention-based Long Short-Term Memory network for sarcasm identification. Appl Soft Comput 106:107348
Kumar A, Narapareddy VT, Srikanth VA, Malapati A, Neti LBM (2020) Sarcasm detection using multi-head attention based bidirectional LSTM. Ieee Access 8:6388–6397
Zhang Y, Liu Y, Li Q, Tiwari P, Wang B, Li Y, Pandey HM, Zhang P, Song D (2021) CFN: a complex-valued fuzzy network for sarcasm detection in conversations. IEEE Trans Fuzzy Syst 29(12):3696–3710
Razali MS, Halin AA, Ye L, Doraisamy S, Norowi NM (2021) Sarcasm detection using deep learning with contextual features. IEEE Access 9:68609–68618
Eke CI, Norman AA, Shuib L (2021) Context-based feature technique for sarcasm identification in benchmark datasets using deep learning and BERT model. IEEE Access. 9:48501–48518
Jain D, Kumar A, Garg G (2020) Sarcasm detection in mash-up language using soft-attention based bi-directional LSTM and feature-rich CNN. Appl Soft Comput 91:106198
Chia ZL, Ptaszynski M, Masui F, Leliwa G, Wroczynski M (2021) Machine Learning and feature engineering-based study into sarcasm and irony classification with application to cyberbullying detection. Inf Process Manage 58(4):102600
Zhu N, Wang Z (2020) The paradox of sarcasm: theory of mind and sarcasm use in adults. Personality Individ Differ 163:110035
Banerjee A, Bhattacharjee M, Ghosh K, Chatterjee S (2020) Synthetic minority oversampling in addressing imbalanced sarcasm detection in social media. Multimedia Tools Appl 79(47):35995–36031
Potamias RA, Siolas G, Stafylopatis AG (2020) A transformer-based approach to irony and sarcasm detection. Neural Comput Appl 32(23):17309–17320
Ren L, Xu B, Lin H, Liu X, Yang L (2020) Sarcasm detection with sentiment semantics enhanced multi-level memory network. Neurocomputing 401:320–326
Hiremath BN, Patil MM (2021) Sarcasm detection using cognitive features of visual data by learning model. Expert Syst Appl 184:115476
Ren L, Lin H, Xu B, Yang L, Zhang D (2021) Learning to capture contrast in sarcasm with contextual dual-view attention network. Int J Mach Learn Cybern 12(9):2607–2615
Nawaf Hazim B, Al-Dabbagh SS, Esam Matti WM, Naser AS (2016) Face detection and recognition using viola-jones with PCA-LDA and square euclidean distance. (IJACSA) Int J Adv Comput Sci Appl 7(5)
Kim D, Seo D, Cho S, Kang P (2018) Multi-co-training for document classification using various document representations: TF–IDF, LDA, and Doc2Vec. Inf Sci
Chunping C, Yan Zhu L (2021) A semi-supervised deep learning image caption model based on Pseudo Label and N-gram. Int J Approximate Reasoning 131:93–107
Lakshmiprabha NS, Majumder S (2012) Face recognition system invariant to plastic surgery. In: 2012 12th international conference on intelligent systems design and applications (ISDA),pp 258–263
Chia Ai O, Hariharan M, Sin Chee L (2012) Classification of speech dysfluencies with MFCC and LPCC features. Expert Syst Appl 39(2):2157–2165
Ted Kronvall M, Juhlin A, Jakobsson A, Sparse modeling of chroma features. Signal Process 130:105–117
Kavitha M, Gayathri R, Alenezi F (2022) Performance evaluation of deep e-CNN with integrated spatial-spectral features in hyperspectral image classification. Measurement 191
Gill HS, Khehra BS (2022) An integrated approach using CNN-RNN-LSTM for classification of fruit images. Mater Today: Proc 51:591–595
Ji J, Chen B, Jiang H (2020) Fully-connected LSTM–CRF on medical concept extraction. Int J Mach Learn Cybern 11(9):1971–1979
Alsattar HA, Zaidan AA, Zaidan BB (2020) Novel meta-heuristic bald eagle search optimisation algorithm. Artif Intell Rev 53(3):2237–2264
Furqan M, Hartono H, Ongko E, Ikhsan M (2017) Performance of arithmetic crossover and heuristic crossover in genetic algorithm based on alpha parameter. IOSR J Comput Eng (IOSR-JCE) 19(1):31–36
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Bavkar, D., Kashyap, R., Khairnar, V. (2023). Deep Hybrid Model with Trained Weights for Multimodal Sarcasm Detection. In: Ranganathan, G., Papakostas, G.A., Rocha, Á. (eds) Inventive Communication and Computational Technologies. ICICCT 2023. Lecture Notes in Networks and Systems, vol 757. Springer, Singapore. https://doi.org/10.1007/978-981-99-5166-6_13
Download citation
DOI: https://doi.org/10.1007/978-981-99-5166-6_13
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-5165-9
Online ISBN: 978-981-99-5166-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)