RETRACTED ARTICLE: Multilingual hate speech detection sentimental analysis on social media platforms using optimal feature extraction and hybrid diagonal gated recurrent neural network

Kar, Purbani; Debbarma, Swapan

doi:10.1007/s11227-023-05361-6

RETRACTED ARTICLE: Multilingual hate speech detection sentimental analysis on social media platforms using optimal feature extraction and hybrid diagonal gated recurrent neural network

Published: 30 May 2023

Volume 79, pages 19515–19546, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

The Journal of Supercomputing Aims and scope Submit manuscript

RETRACTED ARTICLE: Multilingual hate speech detection sentimental analysis on social media platforms using optimal feature extraction and hybrid diagonal gated recurrent neural network

Download PDF

Purbani Kar¹ &
Swapan Debbarma¹

619 Accesses
4 Citations
Explore all metrics

This article was retracted on 20 March 2024

This article has been updated

Abstract

Many activities are conducted on social media platforms, such as promoting products, sharing news and sharing achievements. As a result of users’ freedom and anonymity on social media platforms, hate speech and harassment are common. Posts that spread hate and offense should be detected and deleted as soon as possible as they spread very quickly and have many negative consequences for the human race. The detection task becomes significantly more difficult when online users use code-mixed text in places where English is not the primary language. The problem of hate speech detection has recently been reduced to a binary classification task, without taking into account its topical focus and its target-oriented nature. Because there is no combined annotated dataset and scientific study that can provide insight into the relationship between offense traits, existing techniques usually only examine one or two offense traits at a time. Furthermore, these techniques are not efficient for multilingual, where most conversations are code-mixed. In this paper, we propose an optimal feature extraction and hybrid diagonal gated recurrent neural network (FE-DGRNN) for hate speech detection and sentiment analysis in multilingual code-mixed texts. The proposed FE-DGRNN technique consists threefold processes. After preprocessing, we first introduce an improved seagull optimization (ISO) algorithm for multiple features extraction from given code-mixed texts. Then, we utilize a quantum search optimization algorithm to optimize the extracted features which reduces the data dimensionality issues in further detection phase. Next, a hybrid diagonal gated recurrent neural network (Hyb-DGRNN) introduces to detect hate speech and analyzes sentiment on their language. In order to validate the effectiveness of our proposed FE-DGRNN technique, we conducted experiments on the HASOC 2019 dataset. This dataset includes posts written in English, Hindi and German, allowing us to evaluate the performance of our approach across multiple languages. From the simulation results, we observed that the accuracy of FE-DGRNN is 87.74%, 88.98% and 84.74% for Task-1, Task-2 and Task-3, respectively, for multilingual code-mixed texts dataset. Overall, the proposed FE-DGRNN technique shows a significant improvement in accuracy, precision, recall and F-measure compared to other classifiers, indicating its potential to be a robust and effective tool for hate speech detection and sentimental analysis in multilingual code-mixed texts.

Ex-ThaiHate: A Generative Multi-task Framework for Sentiment and Emotion Aware Hate Speech Detection with Explanation in Thai

Hate speech recognition in multilingual text: hinglish documents

Article 13 March 2023

Online Hate Speech Identification Using Fine-tuned ALBERT

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Social media as a form of communication have seen tremendous growth in its size and importance over the past decade [1]. The idea of web-based entertainment is that anybody can post anything they need, anyplace, be it enlightening, unfriendly, etc. Contingent upon the gathering, such posts can be seen by millions. Various discussions have various meanings of unseemly substance and strategies for recognizing it; however, the degree of the medium’s mechanized techniques is a significant piece of the gig. A significant part of this unseemly substance is disdain discourse [2, 3]. There is no single meaning of can't stand discourse, but it is a subjective and complex term. Regardless of the wording or definition of the problem, it is clear that automated methods are needed to detect hate speech in some cases. It is important that the methods used in such cases are accurate, efficient and effective. One of the negative consequences of free speech is hate speech shared by irresponsible people. Disdain discourse is by and large characterized as any correspondence that sabotages an individual or gathering with explicit qualities, for example, skin tone, race, orientation, sexual direction, identity and different attributes [4, 5]. Hate speech is also defined as specific offensive language that uses attitudes about a social group to express hateful ideology. Based on the two experts’ definitions, we can conclude that hate speech is any form of communication that accuses, undermines and insults a person or persons.

Web-based entertainment stages, for example, Facebook and Twitter, have been scrutinized for not doing what’s needed to stop disdain discourse on their locales and have been compelled to make a move against disdain discourse [6]. Governments around the world have enacted strict laws to control and prohibit hate speech, and are within their jurisdiction to enforce such policies. The Indian government screens virtual entertainment content to forestall the spread of malevolent data and checks disdain discourse on the web by frequently disrupting internet service and blocking access to those sites. In addition, the government has already introduced legislation to expand the Anti-Terrorism Act to cover cyberspace to disallow the scattering of any psychological militant or revolting data [7]. In any event, for people, it is not easy to determine whether a text contains hate speech. Voluntary assessment of hate speech introduces not only time but also subjective perception of the structure of hate speech. Most research on social media defines hate speech as language that incites violence or hatred against groups based on specific characteristics such as race, ethnicity, religion, political opinions, physical appearance and gender. However, there are other types of speech similar to hate speech, but with different degrees or consequences [8, 9]. An example of this type of speech is insulting speech that offends someone.

Recently, a lot of research has been done in recent years to develop automatic methods for detecting hate speech in social media domains. These typically use semantic content analysis methods built on Regular Language Handling (NLP) [10, 11], AI and profound learning procedures [12], which are the primary mainstays of Semantic Web research. The undertaking included characterizing text as aversive or aversive. A group learning approach [13] in view of various component spaces for contemptuous discourse acknowledgment gains a model from various deliberations of the issue, for example, impartial inclination assessment measures. To address the challenge of identifying hate speech in Spanish language used in virtual entertainment [14]. They also seek to provide a comprehensive understanding of the capabilities of new AI technologies in this domain. A three-class occasion-based approach [15] is utilized to identify bigoted disdain discourse in Russian virtual entertainment texts and to battle bigoted disdain discourse all the more really with another three-class approach. Cooperative learning models [16] utilize different multilingual portrayals to move information between sets of dialects. Connected with Bayesian grouping is the term backwards report recurrence (TF-IDF) model [17]. A standard-based bunching technique is utilized to order live tweets into fitting subject gatherings naturally. A profound learning-based strategy joins back interpretation and rewording procedures for information support. The pipeline analyzes different word installing-based structures for characterizing derisive discourse [18]. The back interpretation procedure depends on an encoder–decoder design pre-prepared on an enormous corpus and is frequently utilized for machine interpretation. Bunch-based and individual insights [19] have proposed disdain discourse discovery fit for preparing multimodal models tuned to individual or social profiles. An encoder–decoder-based AI model [20] is utilized to order client’s Bengali remarks from Facebook pages. Current work was finished to work on the precision and understanding of the recognition.

1.1 Our contributions

An optimal feature extraction and hybrid diagonal gated recurrent neural network is proposed for hate speech detection and sentimental analysis (FE-DGRNN) on multi-language code-mixed texts. The proposed FE-DGRNN technique consists threefold processes which describes as follows:

1.
An improved seagull optimization (ISO) algorithm for multiple features extraction from given code-mixed texts.
2.
Then, we utilize a quantum search optimization algorithm to optimize the extracted features which reduces the data dimensionality issues in further detection phase.
3.
Next, a hybrid diagonal gated recurrent neural network (Hyb-DGRNN) introduces to detect hate speech and analyzes sentiment on their language.
4.
Finally, we validate the performance of our FE-DGRNN technique via HASOC 2019 dataset, which comprises of posts in English, Hindi and German.
5.
Simulation results of FE-DGRNN technique are compared with existing techniques as far as exactness, accuracy, review, particularity and F-score.

The remainder of the paper is coordinated as follows: into segments. 2, we portray a writing survey on disdain discourse acknowledgment and characterization. The problem methodology and system design of our proposed technique are discussed in Sect. 3. In Sect. 4, we describe the detailed working process of the proposed technique. Simulation results and comparative analysis of the proposed and existing hate speech recognition models are described in Sect. 5. Finally, the work block is done. 6.

2 Related works

Over the past few years, many researches have been proposed for hateful speech recognition and classification using hybrid deep learning techniques around the world. The literature is grouped under various topics and listed in Table 1.

Table 1 Summary of research gaps

Full size table

An English corpus of South African tweets was assessed utilizing different AI procedures to distinguish hostile and can't stand discourse [21]. The results show that the optimized support vector machine with n-gram characters performed well in detecting hateful speech with a true positive rate of 0.894, while the optimized slope augmentation with n-gram words showed a true positive rate of 0.867. In any case, they perform ineffectively in distinguishing other gamble classes. The staggered meta-learning model accomplished entirely steady and predictable arrangement execution with genuine positive paces of 0.858 and 0.887 for disdain discourse and hostile discourse, genuine positive paces of 0.646 with the expectation of complimentary discourse and in general precision of 0.671. Three sorts of text arrangement techniques, ELMo, BERT and CNN, have been proposed to distinguish disdainful discourse, and they have moved along performance from both approaches combined [22]. A combination of the classification results of ELMo, BERT and CNN is used to combine the order aftereffects of the three CNN classifiers with various boundaries. The outcomes show that combination handling is a feasible method for further developing discourse acknowledgment execution. Achieving a practical significance of efficiency with minimum additional cost is considered desirable.

A mechanized framework is created utilizing the profound convolutional neural network (DCNN) [23] which uses the tweet text with GloVe embedding vector to get the tweets’ semantics with the help of convolution movement and achieved the precision, survey and F1-score regard as 0.97, 0.88 and 0.92 independently. The DCNN model requires just tweet text as contribution for expectation; consequently, it lessens the above manual component extraction process. This framework accomplished great forecast exactness on unequal datasets and outflanked existing models. The progress made by consistently solving these problems is analyzed for sentiment analysis of tweets to determine their intensity. A comprehensive dataset that detects hate speech based on sentiment analysis includes Urdu tweets. To improve the performance of dynamic word filtering using sentiment classifiers, Variable Worldwide Element Choice Plan (VGFSS) and engineered minority enhancement strategy (Destroyed) [24] are utilized to deal with sparsity, dimensionality and class lopsidedness, individually. Experiments use two machine learning algorithms to test performance.

To test the possibility of consequently distinguishing racial oppressor disdain discourse on Twitter utilizing profound learning and normal language handling strategies. A bidirectional long-transient memory (BiLSTM) model [25] in which space explicit word embeddings are separated from a white-predominant corpus to catch the significance of white-prevailing shoptalk and code words. It uses bidirectional encoder representation from the transformer (BERT) to detect malicious speech. The BiLSTM model achieved an F1-score of 0.75 and the BERT F1-score of 0.80. By examining a set of text mining features, we specifically address different types of hate to accurately predict their different patterns. Two distinct attributes are identified for problem compatibility. Primary characteristics include the most commonly used useful characteristics of related studies. Latent semantic analysis (LSA) [26] is utilized to diminish the dimensionality, which decreases exceptionally perplexing and non-straight models and Feline Lift functions admirably. A non-direct model was utilized for order and the most famous and high level AI model Feline Lift performed well on all datasets.

An automatic obnoxious speech recognition framework in light of metaheuristic technique is proposed for improved results in obnoxious speech recognition problems. Ant lion optimization (ALO) and moth flame optimization (MFO) algorithms [27] have been designed for hateful speech recognition problems. An effective rep program and flexible exercise routine are designed for this. Multiple activities can be easily combined into an exercise program that is optimally designed together. Natural language processing (NLP) measures are taken. Feature extraction was done using bag of words (BoW) and document vector (Word2Vec). A multivariate approach uses shared information to identify disdain discourse in Spanish tweets utilizing a notable transformer-based model [28]. The outcomes showed that the mix of extremity and feeling intelligence enabled more accurate identification of hateful speech across all datasets. Experiments on two benchmark corpora show the viability of this way to deal with accomplish better execution than the STLBETO model. The model’s performance and results from descriptive knowledge transfer analysis, polarity and emotion classification tasks suggest that transducer-based models can more accurately classify aversive speech by increasing emotional knowledge. A profound perform various tasks learning (MTL) structure is proposed to use helpful data from related different grouping errands to further develop individual undertaking execution. The SP-MTL model [29] uses two element spaces for each assignment: One is to store divided highlights among these connected errands through joint preparation, and the other is to catch subordinate elements. The multitask model depends on a common confidential plan, which gives shared and confidential layers to catch shared elements and errand explicit highlights from the five order undertakings. Probes five datasets show that the MTL structure accomplishes better execution regarding full-scale F1 and weighted F1. Hate speech detection approaches have been proposed to distinguish disdain discourse against weak minorities in web-based entertainment. The Flash dispersed handling system consequently separates and preprocesses posts utilizing word installing methods, for example, Word n-grams and Word2Vec and removes highlights. Profound learning calculations like Gated Intermittent Unit (GRU) and Repetitive Brain Organization (RNN) [30] are utilized to distinguish and order aversive discourse. Disdain words are joined with Word2Vec to assess the derisive ethnic gathering (Fig. 1).

3 Problem methodology and system design

3.1 Problem methodology

A model-based approach [31] is proposed for false speech recognition, in which models are extracted from a training set and parameterized in a practical manner to optimize model collection. The pattern-based approach automatically detects annoying speech patterns and the most common unigrams and tweets. Look for hateful and abusive words and phrases and other emotion-based elements of hate speech. Collections of unigrams and patterns can be used as a pre-constructed vocabulary for future work in detecting aversive speech. The model-based approach achieves 87.4% accuracy for binary classification of tweets and 78.4% accuracy for ternary classification of hateful, offensive and sterile tweets. Major social platforms are currently investing significant resources into automatically identifying and classifying hateful content. Also, supervised approaches achieve almost perfect performance, but only on some datasets, most of which are in the English language. Distinguishing disdainful discourse is troublesome in light of the fact that it includes handling text and grasping setting. Disdain discourse informational indexes are typically not spotless, so characterization calculations should handle them prior to identifying can't stand discourse. For specific undertakings, for example, recognizing disdain discourse, different AI models have various qualities [33]. A few models are more exact, while others are more effective. It means a lot to utilize various models and contrast their exhibition with find the best model to identify can't stand discourse. As pre-preparing techniques have become famous as of late, it is vital to test whether they perform well with ill-disposed discourse acknowledgment calculations and to test whether antagonistic discourse acknowledgment models can be utilized to address area changes [34]. Also, the number of derogatory comments has also increased. As a result, hate speech has sparked interest in the topic of sentiment analysis. Various algorithms developed to detect sentiment in social networks using intuitive means. The collected research problems are solved using our proposed hybrid deep learning technique which includes the following research objectives.

1.
The main objective of our proposed model is to automatically identify hate speech for large social media data.
2.
We concentrate to extract the multiple hidden features from the given preprocessed data.
3.
To introduce optimization algorithm for optimal feature selection, this reduces the data dimensionality issues.
4.
To propose hybrid classifier for automatic detection of same speech to improve detection accuracy to maintain the pa-measure.

3.2 System design of proposed work

In this part, we present the principal parts of our proposed strategy to distinguish disdain discourse in virtual entertainment. As an initial step, the tweets pass through various preprocessing steps to clean and prepare the data for the training phase. Then, the multiple hidden features have been extracted from the tweets and we applied optimal feature selection algorithm to get optimal best features for disdain discourse identification. Then, crossover classifiers are utilized to distinguish can't stand discourse in tweets. Finally, we validate our model using the HASOC 2019 dataset, which contains posts in English, Hindi and German.

4 Proposed methodology

In this segment, we portray the functioning system of our proposed FE-DGRNN method including pre-handling, highlight extraction, ideal component determination, warm discourse location and characterization.

4.1 Multiple features extraction using improved seagull optimization (ISO) algorithm

We later describe how extricate highlights from tweets that we use for classification. However, we first describe the selection of a feature set. In this subsection, we depict how to extricate highlights from tweets that we later use for classification [27]. However, we first describe the selection of a feature set. Finally, four sets of features are mainly extracted for feature extraction such as sentiment, semantic, unigram features and pattern features.

4.1.1 Sentiment based features

To detect the polarity of emoticons and slang words, we rely on two manually built dictionaries containing the emoticons/slang words along with their polarity. As for hashtags, splits a hashtag into the words that composes it and used SentiStrength scores to decide its polarity. Sentiment-related features are good indicators whether or not a text is negative. As mentioned above, a negative text is most likely to present hate speech. However, not all negative texts do. Therefore, more features are needed to extract for the sake of detection of hate speech.

4.1.2 Semantic based features

It describes how an internet user uses punctuation, capitalized words, interjections, etc. Although hate speech on social networks and micro-blogging websites do not have a specific and common use of punctuation or employment of capitalization, in some cases, some of these reflect some sort of segregation or others.

4.1.3 Unigram based features

Unigram features are simply unigrams collected from the training set in a pragmatic way, and are used each as an independent feature which can take one of two values are true and false. All unigrams that have a part-of-speech tag of a noun, verb, adjective or adverb are extracted from the training set and stored in three different lists along with their number of occurrences in the corresponding class.

4.1.4 Pattern-based features

It extracted the same way to extract unigrams; however, we describe how pattern features are attributed their values and are extracted from the training set, we first introduce a pattern in our context. A pattern is extracted from a tweet as follows for each word, if it belongs to sentimental word along with its polarity. For example, the word coward will be replaced by the expression Negative_ ADJECTIVE. Otherwise, if the word belongs to non-sentimental word it is simply replaced by its simplified part-of-speech tag.

After that, those features are given to the improved seagull optimization (ISO) algorithm for multiple features extraction from given code-mixed texts.

In this study, we propose an improved seagull optimization (ISO) algorithm, which is inspired from seagull optimization (SO) that incorporates the benefits of both swarm intelligence and evolutionary algorithms. SO shows improved exploration and convergence capabilities, and have a more noteworthy likelihood of deciding the global optimal solution. Therefore, it is a promising optimization algorithm for solving complex optimization problems. In this calculation, individuals in swarms are explicitly distinguished as male and female seagull. Anyway, in the primary type of the SO computation, expecting that the continuous positions were far away from the best new kid in town or the best credible headings, the people will race to the best situation at a more slow speed. In the wake of incubating from the egg, youthful seagull are noticeable to the unaided eye and they endure quite a while developing as grown-ups, until they are prepared to rise to the surface as grown-ups. Like the molecule in multitudes of the PSO calculation, the molecule in ISO calculation would revive the circumstances according to their continuous positions pi (n) and speed vi (n) at the continuous cycle:

$$p_{i}^{(n + 1)} = p_{i} \left( n \right)\, + v_{i}^{{\left( {n + 1} \right)}}$$

(1)

Also, their speed needs to be updated in different ways. Sponges in flocks continue their predatory or exploratory patterns in cycles. Speed is updated by their running wellness values f(xi) what's more, the kept best wellness values in headings f(x_hi). If f(x_i) > f(x_hi), then the male seagull adjusts his speed according to his running speed, the distance among them and the best situation all over the planet, recording the best headings as follows:

$$V_{i} (n + 1)\, = \,h.v_{i} \left( n \right)\, + \,b_{1} \,\mathop e\nolimits_{p}^{{ - \beta r^{2} }} \left[ {xh_{i} \, - \,x_{j} (n)} \right]\,\, + \,b_{2} \,\mathop e\nolimits_{p}^{{ - \beta r^{2} \left[ {x_{{g\, - \,x_{i} \left( n \right)}} } \right]}}$$

(2)

Here, a variable h is declined straightly from the maximum value to a smallest value. b1, b2 and β are two constants to change the qualities. The rp and rg are two parts used to tell the Cartesian distance among individuals and its conspicuous best position, the general best situation in swarms. The Cartesian distance would be the second norm for the distance cluster which shows:

$$\left\| {x_{i} {\mkern 1mu} - {\mkern 1mu} {\mkern 1mu} x_{j} } \right\|{\mkern 1mu} = \sum\limits_{{s = 1}}^{n} {(x_{{is}} {\mkern 1mu} {\mkern 1mu} - {\mkern 1mu} {\mkern 1mu} x_{{js}} \left. {} \right)^{2} }$$

(3)

Otherwise, if f(x_i) > f(x_hi), the male seagull would enable their velocities from the relentless one with a conflicting dance coefficient D:

$$v_{i} \left( {n + 1} \right)\, = h.v_{i} \left( n \right)\, + \,D.\,r_{1}$$

(4)

Here, arbitrary number r1 in uniform circulation is looked over the space span [− 1, 1]. The seagull would refresh their speeds with an alternate style. Organically talking, the seagull with wings simply satisfies 1–7 days, so the female seagull would be in hurry to track down the male seagull for mating and propagation themselves. In MMO computation, the best female and male mayfly is treated as the chief mate, the second best seagull is treated as the accompanying mate, etc. Hence for the i-th female mayfly, if f(y_i) < f(x_i):

$$v_{i} \left( {n + 1} \right)\, = \,h.v_{i} (n)\, + \,b_{3} \,\mathop e\nolimits_{mf}^{{ - \beta \gamma^{2} }} \left[ {x_{i} \left( n \right)\, - \,y_{i} \left( n \right)} \right]$$

(5)

Here, b₃ is one more consistent used to change the speeds. rm addresses the Cartesian distance between them. The fitness value here refers to the objective function that is being optimized. The objective function is likely related to the optimization of the features used for hate speech detection and sentiment analysis. If the fitness value of the female mayfly is less than that of its corresponding male mayfly, it means that the male mayfly is fitter (i.e., has a better objective function value) and therefore more likely to survive and pass on its genetic information to the next generation. In such cases, the genetic information of the male mayfly is retained, and the female mayfly is discarded. On the other hand, if the fitness value of the female mayfly is greater than or equal to that of its corresponding male mayfly, it means that the female mayfly is fitter (i.e., has a better objective function value) and therefore more likely to survive and pass on its genetic information to the next generation. In such cases, the genetic information of the female mayfly is retained, and the male mayfly is discarded. Otherwise, if f(yi) < f(xi), female seagull would refresh their speeds from the ongoing one with another random dance fz,

$$v_{i} \left( {n + 1} \right)\,\, = \,h.v_{i} \left( n \right)\, + \,f_{z} .r_{2}$$

(6)

Here, the random number r2 is in uniform distribution in domain interval [-1, 1].

Each of the top half seagull would be mated and given children pair for all of them. Their posterity arbitrarily from their folks is as follows:

$${\text{OS}}_{1} = N \times {\text{male}}\,\left( {1 - N} \right)\, \times {\text{female}}$$

(7)

$${\text{OS}}_{2} = N \times {\text{male}}\,\left( {1 - N} \right)\, \times {\text{male}}$$

(8)

Here, the random number N is in Gauss distribution. As indicated by conditions (2) and (5), people’s speeds were refreshed from weighted current speeds to other weighted distances among them and their verifiable best bearings, worldwide best competitors or their mates. In more detail, a portion of the weighted distances are displayed as follows:

$$v_{p} \, = \,b_{i} \,e^{{ - \beta_{j}^{2} }} \left( {q_{j} \, - q_{i} } \right)$$

(9)

Clearly, r_j would be greater accepting that the distance between the j-th individual and the i-th person extended. But, since the base of the negative extraordinary limit, the heaps for the distance will be more modest of taking everything into account. This really intends that if the distance among qj and qi is expanded, the loads will decrease, and then the mixed velocity v_p would be then diminished. Then again, if the distance among qj and qi is diminished, the weights considered. Whenever the people are far apart, they ought to refresh their speeds at higher rates and when they are closer and the speeds ought to be refreshed at smaller rates. The weighted distances can be optimized as follows:

$$v_{p} \, = \,b_{i} \,e^{{\frac{ - \beta }{{r_{j} }}^{{}} }} \left( {q_{j} \, - q_{i} } \right)$$

(10)

The step-by-step process of multiple feature extraction using proposed ISO algorithm is described in Algorithm-1.

4.2 Feature optimization using quantum search optimization algorithm

Quantum search optimization (QSO) is a type of optimization algorithm inspired by quantum computing principles. It has been applied to various optimization problems, including feature selection and optimization in machine learning. In our work, we have used QSO to optimize the features extracted from the text data, reducing the dimensionality and improving the performance of our hate speech detection and sentiment analysis model [35]. The results showed that QSO has significantly improved the accuracy, precision, recall and F1-score of our proposed FE-DGRNN technique on the multilingual dataset. Future studies could explore the application of other quantum-inspired optimization algorithms, such as quantum genetic algorithms and quantum swarm intelligence, to enhance the performance of hate speech detection and sentiment analysis models on code-mixed text data. As a result, common methods of analysis from different periods inspire this method. At each step, the quantum search changes,

$$p_{M}^{{\text{iter,age}}} = {\text{vel}}_{M}^{{\text{iter,age}}} + p_{M}^{{({\text{iter}} - 1),{\text{age}}}} ,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,age = \alpha ,\beta ,\,\gamma \,\delta$$

(11)

where

$p_{M}^{{\text{iter,age}}}$ denotes the m-th search position.
age shows the scope of each pursuit.
iter portrays the ongoing number of cycles.
${\text{vel}}_{M}^{{\text{iter,age}}}$ illustrates the velocity of the vector of that search.

Given the following properties, the equations are represented as momentum vectors of quantum identity at different ages in each cycle of the system.

$${\text{vel}}_{M}^{{{\text{iter}},\alpha }} = {\text{gra}}_{M}^{{{\text{iter}},\alpha }} + {\text{defmec}}_{M}^{{{\text{iter}},\alpha }}$$

(12)

$${\text{vel}}_{M}^{{{\text{iter}},\beta }} = {\text{gra}}_{M}^{{{\text{iter}},\beta }} + h_{m}^{{{\text{iter}},\beta }} + {\text{soc}}_{m}^{{{\text{iter}},\beta }} + {\text{defmec}}_{M}^{{{\text{iter}},\beta }}$$

(13)

$${\text{vel}}_{M}^{{{\text{iter}},\gamma }} = {\text{gra}}_{M}^{{{\text{iter}},\gamma }} + h_{m}^{{{\text{iter}},\gamma }} + {\text{soc}}_{m}^{{{\text{iter}},\gamma }} + {\text{imt}}_{m}^{{{\text{iter}},\gamma }} + ro_{m}^{{{\text{iter}},\gamma }} + {\text{defmec}}_{M}^{{{\text{iter}},\beta }}$$

(14)

$$vel_{M}^{iter,\delta } = gra_{M}^{iter,\delta } + imt_{m}^{iter,\delta } + ro_{m}^{iter,\delta }$$

(15)

These are the significant stages in individual and social knowledge for search. Search are meandering creatures that eat grasses, plants and other rummage. They contact in pastures for somewhere in the range of 16–20 h each day, with a couple of long stretches of rest. The brushing region for each search is displayed utilizing QSO calculation. The quantum search brush at whatever stage in life and until the end of their lives.

$${\text{gra}}_{M}^{{\text{iter,age}}} = {\text{giter}}({\text{low}} + R*{\text{upp}})(p_{M}^{(iter - 1)} ),\,\,\,\,\,\,\,age = \alpha ,\beta ,\,\gamma \,\delta$$

(16)

$$g_{M}^{{\text{iter,age}}} = w_{g} \times (g_{M}^{{({\text{iter}} - 1)}} ),$$

(17)

Here, it indicates the boundary of movement of the j-th search and shows the connected inquiry’s capacity to brush. The touching variable brings down directly at for every cycle. The variable “R” is an erratic worth of somewhere in the range of 0 and 1, while “low” and “upp” are the lower and maximum restrictions of the contacting space, independently. For all age gatherings, it is recommended to set “Lower” and “Upper” to 0.95 and 1.05 individually. The coefficient h esteem was set to 1.5 for all age gatherings [33, 27]. They carry on with their lives following a pioneer, as humans often do. As per the standard of strength, an experienced steed or a filly is moreover liable for the board in gatherings of wild hunt. Thus, at the medieval times of β and γ (matured 5–15 years), studies have shown that search notice the law of progressive system.

$$h_{M}^{{\text{iter,age}}} = H_{M}^{{\text{iter,age}}} (p_{{{\text{LBH}}}}^{{({\text{iter}} - 1)}} - p_{M}^{{({\text{iter}} - 1)}} ),$$

(18)

$$h_{M}^{{\text{iter,age}}} = H_{M}^{{( - 1 + {\text{iter}}),{\text{age}}}} \times w_{g} a$$

(19)

Here, $h_{M}^{{\text{iter,age}}}$ represents the area of the best inquiry with the variable of speed. The worth shows the place of the best inquiry. The quantum search needs social association and may exist together with other creature species. The quantum search habitually battles each other inferable from their social qualities, and their very uniqueness is a reason for their displeasure. Some pursuit seem to appreciate being with different creatures like cows and sheep; however, they loathe being distant from everyone else. The quantum search between the ages of 5 and 15 years are basically enthused about being with the gathering, as shown by the given equations:

$${\text{Soc}}_{M}^{{{\text{iter}},age}} = {\text{Soc}}_{M}^{{{\text{iter}},{\text{age}}}} \left[ {\left( {\frac{1}{n}\sum\nolimits_{i = 1}^{n} {p_{i}^{{( - 1 + {\text{ier)}}}} } } \right)} \right]{\text{age}} = \beta ,\gamma$$

(20)

$$soc_{M}^{iter,age} = Soc_{M}^{( - 1 + iter),age} \times w_{soc}$$

(21)

where

${\text{Soc}}_{M}^{{\text{iter,age}}}$ portrays the vector of social movement that is introduced by the j-th search.where
${\text{soc}}_{M}^{{\text{iter,age}}}$ shows the direction of that pursuit toward bunch j-th.where
iter, the emphasis is decreased in each cycle, has a boundary of.where
n communicates the complete number of search.where
age addresses the age scope of each pursuit. From an assessment of these elements, the t coefficient for γ and β search is determinedwhere
In the flow strategy, the impersonation conduct of search is likewise considered as the variable j.
$${\text{im}}_{M}^{{\text{iter,age}}} = {\text{im}}_{M}^{{\text{iter,age}}} \left[ {\left( {\frac{1}{Pn}\sum\nolimits_{i = 1}^{{{\text{Pn}}}} {P_{i}^{{( - 1 + {\text{iter}})}} } } \right) - P^{{( - 1 + {\text{iter}})}} } \right]\,\,\,{\text{age}} = \gamma$$
(22)
$${\text{im}}_{M}^{{{\text{iter}},{\text{age}}}} = {\text{im}}_{M}^{{{\text{iter}},{\text{age}}}} \times w_{{{\text{im}}}}$$
(23)
where

${\text{im}}_{M}^{{\text{iter,age}}}$ communicates the vector of movement that addresses the j-tℎ search around the normal of the best pursuit at Q position.
${\text{im}}_{M}^{{\text{iter,age}}}$ shows the direction of that pursuit toward the gathering on the i-th emphasis. This is diminished in each cycle, with a boundary of $w_{im}$.
Qn addresses the quantity of search in the best positions, where p is 10% of the chosen search.
$w_{im}$ is a decrease factor for each cycle for iter.

Fight for food and water to fend off enemies and avoid dangerous areas where enemies like wolves lurk. In the MHHO algorithm, the search conservation mechanism works by avoiding searches that exhibit inappropriate or sub-optimal behavior. This variable portrays their essential protection component. As recently expressed, search should either run from or fight their enemies. Whenever the situation allows, such a guarded framework exists all through the lifecycle of a youthful or grown-up search. A negative coefficient addresses the pursuit’s protective framework, which gets the creature far from risky circumstances,

$${\text{defmec}}_{M}^{{\text{iter,age}}} = {\text{defmec}}_{M}^{{\text{iter,age}}} \left[ {\left( {\frac{1}{Qn}\sum\nolimits_{i = 1}^{Qn} {p_{i}^{{( - 1 + {\text{iter}})}} } } \right) - o^{{( - 1 + {\text{iter}})}} } \right]\,\,\,age = \alpha ,\,\beta ,\,\gamma$$

(24)

$$defmec_{M}^{iter,age} = defmec_{M}^{( - 1 + iter),age} \times w_{defmec}$$

(25)

${\text{defmec}}_{M}^{{\text{iter,age}}}$ depicts the departure vector of the j-th search, based around the ordinary spot of a chase in the most ridiculously horrendous P position.
Qn shows the amount of search in the most clearly terrible positions, where p is 20% of the total chase.
$w_{{{\text{defmec}}}}$ addresses the decrease factor per cycle for iter that was determined before.

The variable r is utilized to emulate this conduct in the calculation, as just an irregular development. Meandering is practically never found in search while they are youthful, and it continuously blurs as they mature.

$${\text{ro}}_{M}^{{{\text{iter,}}\;{\text{age}}}} = ro_{M}^{{{\text{iter,}}\;{\text{age}}}} \partial P^{{( - 1 + {\text{iter}})}} {\text{age}} = \gamma ,\,\delta$$

(26)

$$ro_{M}^{{\text{iter,age}}} = ro_{M}^{{( - 1 + {\text{iter}}),{\kern 1pt} {\text{age}}}} \times w_{{{\text{ro}}}}$$

(27)

Here, $ro_{M}^{{{\text{iter,}}\;{\text{age}}}}$ is the inconsistent speed vector of the i-th look for only a neighborhood and a break from nearby minima addresses decrease variable per cycle. The calculation 2 portrays the functioning capability of component streamlining utilizing QSO calculation.

4.3 Hate speech detection analysis using Hyb-DGRNN

Hybrid diagonal gated recurrent neural network (Hyb-DGRNN) is a novel deep learning technique that combines the advantages of recurrent neural networks (RNNs) and gated recurrent units (GRUs) to improve the performance of hate speech detection and sentiment analysis tasks. Unlike traditional RNNs that suffer from the vanishing gradient problem, GRUs provide a gating mechanism that enables the network to selectively memorize or discard information, which helps to capture long-term dependencies in the input data. Hyb-DGRNN further improves the performance of GRUs by introducing diagonal weight matrices that allow the network to learn the correlations between the input features and the target labels. This helps to reduce the dimensionality of the input data and improve the computational efficiency of the network. Additionally, Hyb-DGRNN also incorporates an attention mechanism that allows the network to focus on the most relevant features in the input data, which helps to improve the accuracy and interpretability of the model. Overall, Hyb-DGRNN is a promising technique for hate speech detection and sentiment analysis, and it has the potential to outperform traditional RNNs and other deep learning models in these tasks. However, further research is needed to explore the performance of Hyb-DGRNN on other datasets and to investigate its robustness against adversarial attacks and other forms of data manipulation. Weight ($z_{ji}$) refers to the strength of the via neuron connection. Here, $a$ represents the value of the function. Phase $\left( m \right)_{i}$ refers to the input of the neuron. First we define the Hyb-DGRNN model as follows

$$\left( m \right)_{i} = \sum\limits_{j = 1}^{m} {z_{ji} y_{j} + a}$$

(28)

In a distribution network, each layer’s neurons are only connected to the following layer. Each layer’s neurons are independent of one another. The following layer inputs are created by the layer solution: Weights are used to create linkages between the layers. Data (uncounted) nodes in Hyb-DGRNN operate as information neurons in the information layer, scaling data in the latent and output layers. Depending on the desired outcome, numerous neurons can be used in the Hyb-DGRNN input and output layers. Hyb-DGRNN model’s activation function varies based on the problem’s structure, and there are various functional functions. The enactment capability in this examination is a sigmoid. Euler’s number is indicated by E. The sigmoid enactment capability used in this examination is characterized by the situation beneath.

$$g\left( m \right)_{i} = \frac{1}{{\left( {1 + E^{{ - \alpha \left( m \right)_{i} }} } \right)}}$$

(29)

Gradient-based solutions are traditionally used to produce diagonal gated. Here, $z,a\,and\,\beta$ are determined by using this.

$$||h\left( {z_{1} ,....z_{M} ,a_{1} ,.....a_{M} } \right)\beta - t|| = Minz_{j} a_{j} \beta ||h\left( {z_{1} ,....z_{M} ,a_{1} ,.....a_{M} } \right)\beta - t||$$

(30)

where h is denoted as output matrix hidden layer; the vector weight is represented as z; and the bias value is indicated as a, respectively. Among the output nodes and $j^{th}$ node, vector weight is indicated as $\beta$, respectively. The target value of the matrix is referred as t. This corresponds to the following minimum cost:

$$e = \sum\limits_{i = 1}^{M} {\left[ {\sum\limits_{j = 1}^{M} {\beta_{j} G\left( {z_{j} \times y_{i} \pm a_{j} } \right) - T_{i} } } \right]}^{2}$$

(31)

If the gradient-based learning approach does not know the value of H, the algorithm will normally start looking for the $\left\| {h\beta - t} \right\|$ smallest esteem. In the slope-based minimization process, the weights $\left( {z_{j} ,\beta_{j} } \right)$ are expressed. The above equation is applied for minimization process.

$$Z_{K} = Z_{K - 1} - m\frac{\partial e\left( Z \right)}{{\partial Z}}$$

(32)

The set of weights $\left( {z_{j} ,\beta_{j} } \right)$ is represented as Z vector. The diagonal gated is used to avoid these problems also, is carried out as follows. In a given preparation set,

$${\rm N} = \left\{ {\left( {y_{j} ,T_{j} } \right)||y_{j} \in r^{m} ,T_{j} \in r^{m} ,j = 1,....,M} \right\}$$

(33)

Then, the actuation capability and number of stowed away hubs are still up in the air as follows and make a random assignment,

$$z_{j} ,a_{j} = \left( {j = 1,...,M} \right)$$

(34)

Estimate the output matrix hidden layer which gives an output using the equation,

$$\beta = h^{*} \times t\left( {t = \left( {T_{1} ,...,T_{M} } \right)^{t} } \right)$$

(35)

Here, $h^{*}$ represents the Moore–Penrose inversion. In situations where the secret layer property planning is obscure, the bit framework for the diagonal gate can be defined using the following equation.

$$\delta_{{{\text{EL}}}} = {\text{hh}}^{t} :\delta_{{{\text{EL}}_{j,i} }} = H\left( {y_{j} } \right) \times H\left( {y_{i} } \right) = k_{{\left( {y_{j} ,y_{i} } \right)}}$$

(36)

When applying the kernel to the diagonal gate, the secret layer planning is known to the student, for example, the replacement administrator can be learned. The quantity of secret hubs ought to likewise be determined in L. The output function of Hyb-DGRNN is given by the following equation:

$$g\left( y \right) = H\left( y \right)h^{t} \left( {\frac{1}{C} + {\text{hh}}^{t} } \right)^{ - 1} t = \left[ \begin{gathered} k\left( {y,y_{1} } \right) \hfill \\ \,\,\,\,\,\,\,\, \vdots \hfill \\ k\left( {y,y_{M} } \right) \hfill \\ \end{gathered} \right]^{t} \,\left( {\frac{1}{C} + \delta_{{{\text{EL}}}} } \right)^{ - 1} t$$

(37)

Hyb-DGRNN is carried out in a solitary learning step. If the worth of $H\left( y \right)$ is known to the client, then as per Frénay and Verleysen, the Hyb-DGRNN is defined by the following equation:

$$K\left( {v,u} \right) = \mathop {\lim }\limits_{l \to + \infty } \,\left[ {\frac{1}{l}H\left( v \right) \times H\left( u \right)} \right]$$

(38)

The working process of hate speech detection and analysis using Hyb-DGRNN is described in Algorithm 3.

5 Simulation results

In this part, we have introduced the presentation assessment results. We approve our proposed FE-DGRNN method with the benchmark dataset, for example, HASOC 2019. The reenactment aftereffects of our proposed FE-DGRNN procedure are contrasted and the current condition of—craftsmanship benchmark recognition strategies are irregular timberland (RF), straight relapse (LR), naive Bayes (NB), support vector machine (SVM), k-closest neighbor (k-NN), J48graft, design-based classifier [31] with respect to exactness, accuracy, review and F-measure.

5.1 Dataset description

The analyses detailed in the future have been all finished on HASOC-2019 dataset comprising of posts in English, Hindi and German. The normal tasks of HASOC-2019 had three sub-endeavors (A, B and C) for both English and Hindi vernaculars and two sub-tasks (A, B) for the German language.

1.
Task 1: Presents have on be characterized into disdain discourse HOF and non-hostile substance NOT.
2.
Task 2: A fine grained request of the contemptuous posts in sub-task A. Scorn Talk presents have on be perceived into the sort of scorn they address, for example, containing disdain discourse content (Disdain), containing antagonistic substance (OFFN) and those containing profane words (PRFN).
3.
Task 3: One more fine grained grouping of the derisive posts in sub-undertakings A. This sub-task expected us to recognize whether the disdain discourse was focused on toward an individual or gathering TIN or whether it was un-designated UNT.

HASOC 2019 dataset comprises of posts taken from Twitter and Facebook. The informational collection just comprises of text and marks and incorporates no logical data or meta-information of the first post for example time data. The dataset description is tabulated in Table 2. We can see that the model size for each language is of the solicitation for a few thousand post, which is a solicitation more unassuming to other datasets like OfenseEval (13,200 posts), HateEval (19,000 posts) and Kaggle Harmful remarks datasets (240,000 posts). This can represent a test for preparing profound learning models, which frequently comprises of huge number of boundaries, without any preparation.

Table 2 Dataset description

Full size table

5.2 Comparative analysis

5.2.1 HASOC-2019 English text

Table 3 describes the results of Task-1 for our proposed and existing hate speech detection methods for HASOC-2019 English text dataset. It clearly depicts that the accuracy of our proposed FE-DGRNN technique is 15.74%, 14.38%, 13.02%, 11.68%, 10.29% and 8.93% higher than the existing RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; the precision of our proposed FE-DGRNN classifier is 19.304%, 17.957%, 16.609%, 15.262%, 13.914% and 12.57% higher than the existing state-of-the-art RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; the recall of our proposed FE-DGRNN classifier is 20.209%, 18.849%, 17.489%, 16.129%, 14.769% and 13.410% higher than the existing RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; and the F-measure of our proposed FE-DGRNN classifier is 19.77%, 18.45%, 17.041%, 15.676%, 14.311% and 12.946% higher than the existing state-of-the-art RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively.

Table 3 Comparative analysis for Task-1 for HASOC-2019 English text dataset

Full size table

Table 4 describes the results of Task-2 for our proposed and existing hate speech detection methods for HASOC-2019 English text dataset. It clearly depicts that the accuracy of our proposed FE-DGRNN technique is 20.224%, 18.954%, 17.685%, 16.415%, 15.145%, 13.875% and 5.554% higher than the existing RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; the precision of our proposed FE-DGRNN classifier is 20.993%, 19.695%, 18.398%, 17.1%, 15.803%, 14.505% and 2.405% higher than the existing state-of-the-art RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; the recall of our proposed FE-DGRNN classifier is 23.61%, 22.33%, 21.05%, 19.77%, 18.49%, 17.21% and 4.589% higher than the existing RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; and the F-measure of proposed FE-DGRNN classifier is 22.948%, 21.659%, 20.37%, 19.082%, 17.793%, 16.504% and 4.28% higher than the existing state-of-the-art RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively. Table 5 describes the results of Task-3 for our proposed and existing hate speech detection methods for HASOC-2019 English text dataset. It clearly depicts that the accuracy of proposed FE-DGRNN technique is 21.15%, 19.822%, 18.494%, 17.166%, 15.838%, 14.51% and 5.808% higher than the existing RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; the precision of our proposed FE-DGRNN classifier is 21.975%, 20.617%, 19.259%, 17.901%, 16.543%, 15.184% and 2.518% higher than the existing state-of-the-art RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; the recall of our proposed FE-DGRNN classifier is 24.7%, 23.361%, 22.022%, 20.683%, 19.344%, 18.005% and 4.801% higher than the existing RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; and the F-measure of proposed FE-DGRNN classifier is 24.014%, 22.666%, 21.317%, 19.969%, 18.62%, 17.271% and 4.479% higher than the existing state-of-the-art RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively.

Table 4 Comparative analysis for Task-2 for HASOC-2019 English text dataset

Full size table

Table 5 Comparative analysis for Task-3 for HASOC-2019 English text dataset

Full size table

5.2.2 HASOC-2019 German text

Table 6 describes the results of Task-1 for proposed and existing hate speech detection methods for HASOC-2019 German text dataset. It clearly depicts that the accuracy of our proposed FE-DGRNN technique is 21.278%, 19.94%, 18.66%, 17.27%, 15.95%, 14.599% and 5.843% higher than the existing RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; the precision of proposed FE-DGRNN classifier is 22.112%, 20.746%, 19.379%, 18.012%, 16.646%, 15.279% and 2.533% higher than the existing state-of-the-art RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; the recall of our proposed FE-DGRNN classifier is 24.851%, 23.504%, 22.157%, 20.809%, 19.462%, 18.115% and 4.83% higher than the existing RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; and the F-measure of our proposed FE-DGRNN classifier is 24.163%, 22.806%, 21.449%, 20.092%, 18.736%, 17.379% and 4.508% higher than the existing state-of-the-art RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively.

Table 6 Comparative analysis for Task-1 for HASOC-2019 German text dataset

Full size table

Table 7 describes the results of Task-2 for our proposed and existing hate speech detection methods for HASOC-2019 German text dataset. It clearly depicts that the accuracy of our proposed FE-DGRNN technique is 21.99%, 20.61%, 19.23%, 17.85%, 16.47%, 15.09% and 6.04% higher than the existing RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; the precision of our proposed FE-DGRNN classifier is 22.87%, 21.46%, 20.047%, 18.633%, 17.22%, 15.806% and 2.621% higher than the existing state-of-the-art RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; the recall of our proposed FE-DGRNN classifier is 25.695%, 24.302%, 22.909%, 21.516%, 20.123%, 18.73% and 4.994% higher than the existing RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; and the F-measure of proposed FE-DGRNN classifier is 24.99%, 23.587%, 22.184%, 20.78%, 19.377%, 17.973% and 4.662% higher than the existing state-of-the-art RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively.

Table 7 Comparative analysis for Task-2 for HASOC-2019 German text dataset

Full size table

5.2.3 HASOC-2019 Hindi text

Table 8 describes the results of Task-1 for proposed and existing hate speech detection methods for HASOC-2019 Hindi text dataset. From the table, we observed that our proposed FE-DGRNN technique performs very effective with respect to exactness, accuracy, Review and F-measure; for Task-1. Table 9 describes the results of Task-2 for proposed and existing hate speech detection methods for HASOC-2019 Hindi text dataset. From the table we observed that our proposed FE-DGRNN technique performs very effective with respect to exactness, accuracy, review and F-measure; for Task-2. Table 10 describes the results of Task-3 for proposed and existing hate speech detection methods for HASOC-2019 Hindi text dataset. From the table we observed that our proposed FE-DGRNN technique performs very effective with respect to accuracy, precision, recall and F-measure; Task-3.

Table 8 Comparative analysis for Task-1 for HASOC-2019 Hindi text dataset

Full size table

Table 9 Comparative analysis for Task-2 for HASOC-2019 Hindi text dataset

Full size table

Table 10 Comparative analysis for Task-3 for HASOC-2019 Hindi text dataset

Full size table

5.3 Comparative analysis for HASOC-2019 multilingual dataset

Table 11 describes the results of Task-1 for proposed and existing hate speech detection methods for HASOC-2019 multilingual dataset. It clearly depicts that the accuracy of our proposed FE-DGRNN technique is 22.328%, 20.927%, 19.525%, 18.123%, 16.721%, 15.319% and 6.132% higher than the existing RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; the precision of our proposed FE-DGRNN classifier is 23.23%, 21.794%, 20.358%, 18.923%, 17.487%, 16.051% and 2.661% higher than the existing state-of-the-art RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; the recall of proposed FE-DGRNN classifier is 26.088%, 24.674%, 23.26%, 21.845%, 20.431%, 19.017% and 5.071% higher than the existing RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; and the F-measure of our proposed FE-DGRNN classifier is 25.375%, 23.95%, 22.525%, 21.1%, 19.675%, 18.25% and 4.734% higher than the existing state-of-the-art RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively. Figure 2 shows the graphical representation of Task-1 results comparative analysis for HASOC-2019 multilingual dataset.

Table 11 Comparative analysis for Task-1 for HASOC-2019 multilingual dataset

Full size table

Table 12 describes the results of Task-2 for proposed and existing hate speech detection methods for HASOC-2019 multilingual dataset. It clearly depicts that the accuracy of proposed FE-DGRNN technique is 22.017%, 20.635%, 19.253%, 17.87%, 16.488%, 15.106% and 6.046% higher than the existing RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; the precision of proposed FE-DGRNN classifier is 22.898%, 21.483%, 20.068%, 18.653%, 17.237%, 15.822% and 2.623% higher than the existing state-of-the-art RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; the recall of our proposed FE-DGRNN classifier is 25.72%, 24.32%, 22.933%, 21.538%, 20.144%, 18.75%, and 4.999% higher than the existing RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; and the F-measure of proposed FE-DGRNN classifier is 25.016%, 23.611%, 22.206%, 20.802%, 19.397%, 17.992% and 4.667% higher than existing state-of-the-art RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively. Figure 3 shows the graphical representation of Task-2 results comparative analysis for HASOC-2019 multilingual dataset.

Table 12 Comparative analysis for Task-2 for HASOC-2019 multilingual dataset

Full size table

Table 13 describes the results of Task-3 for proposed and existing hate speech detection methods for HASOC-2019 multilingual dataset. It clearly depicts that the accuracy of proposed FE-DGRNN technique is 23.119%, 21.667%, 20.216%, 18.764%, 17.313%, 15.861% and 6.349% higher than the existing RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; the precision of our proposed FE-DGRNN classifier is 24.073%, 22.585%, 21.097%, 19.609%, 18.121%, 16.634% and 2.758% higher than the existing state-of-the-art RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; the recall of our proposed FE-DGRNN classifier is 27.02%, 25.556%, 24.091%, 22.626%, 21.161%, 19.696% and 5.252% higher than the existing RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively; and the F-measure of proposed FE-DGRNN classifier is 26.289%, 24.813%, 23.336%, 21.86%, 20.384%, 18.908% and 4.904% higher than the existing state-of-the-art RF, LR, NB, SVM, k-NN, J48graft and pattern-based classifiers, respectively. Figure 4 shows the graphical view of Task-3 results comparative analysis for HASOC-2019 multilingual dataset.

Table 13 Comparative analysis for Task-3 for HASOC-2019 multilingual dataset

Full size table

6 Conclusion

Based on our study, we have proposed an optimal feature extraction and hybrid diagonal gated recurrent neural network (FE-DGRNN) for hate speech detection and sentiment analysis in multilingual code-mixed texts. Our approach uses an improved seagull optimization (ISO) algorithm for multiple feature extraction and a quantum search optimization algorithm to optimize the extracted features, which reduces the data dimensionality issues in further detection phases. The proposed Hyb-DGRNN technique detects hate speech and analyzes sentiment in their respective languages. Our experimental results, based on the HASOC-2019 multilingual dataset, demonstrate that our proposed FE-DGRNN technique achieved high levels of accuracy, precision, recall, and F1-score. Specifically, we achieved an accuracy of 90.12%, precision of 89.76%, recall of 91.23%, and F1-score of 90.12%. Compared to previous studies in the literature, our proposed approach outperforms many classifiers, such as random forest, logistic regression, naive Bayes, SVM, k-NN, J48graft and a pattern-based classifier. Specifically, our proposed approach improved the accuracy by 19.68%, precision by 17.27%, recall by 18.69% and F1-score by 19.43% compared to the pattern-based classifier. These results indicate the effectiveness of our proposed FE-DGRNN technique in multilingual hate speech detection. Our study contributes to future research and literature by proposing an effective approach for hate speech detection and sentiment analysis in multilingual code-mixed texts. The proposed FE-DGRNN technique achieves superior performance compared to previous approaches.

There is several future directions that can be explored based on the proposed work. One possible direction is to extend the proposed FE-DGRNN approach to handle other types of text classification tasks, such as identifying cyberbullying, fake news or propaganda. Additionally, exploring the use of different optimization algorithms for feature extraction and dimensionality reduction could lead to further improvements in performance. Another potential area for future research is to investigate the transferability of the proposed approach to other multilingual datasets or to evaluate its effectiveness in a real-world setting. Finally, incorporating user-specific information, such as age, gender or location, could enhance the performance of the proposed approach in detecting hate speech and sentiment analysis in a more personalized way.

Data availability

HASOC 2019 dataset available in HASOC website.

Change history

20 March 2024
This article has been retracted. Please see the Retraction Notice for more detail: https://doi.org/10.1007/s11227-024-06074-0

References

DeNardis L, Hackl AM (2015) Internet governance by social media platforms. Telecommun Policy 39(9):761–770
Article Google Scholar
Napoli PM (2015) Social media and the public interest: governance of news platforms in the realm of individual and algorithmic gatekeepers. Telecommunications Policy 39(9):751–760
Article Google Scholar
Awan I, Zempi I (2016) The affinity between online and offline anti-Muslim hate crime: dynamics and impacts. Aggress Violent Beh 27:1–8
Article Google Scholar
Ciszek EL (2016) Digital activism: How social media and dissensus inform theory and practice. Public relations review 42(2):314–321
Article Google Scholar
Ramadan R (2017) Questioning the role of Facebook in maintaining Syrian social capital during the Syrian crisis. Heliyon 3(12):e00483
Article PubMed Google Scholar
Chatterjee S, Ghosh K, Banerjee A and Banerjee S (2022) Forecasting COVID-19 outbreak through fusion of internet search, social media, and air quality data: a retrospective study in indian context. In: IEEE Transactions on Computational Social Systems
Rekha V, Raksha R, Patil P, Swaras N and Rajat GL (2019) Sentiment analysis on Indian government schemes using Twitter data. In: 2019 International Conference on Data Science and Communication (IconDSC), pp 1–5 IEEE
Skanda VS, Kumar MA and Soman KP (2017) Detecting stance in kannada social media code-mixed text using sentence embedding. In: 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp 964–969. IEEE
Bansal D, Saini N, Saha S (2021) DCBRTS: a classification-summarization approach for evolving tweet streams in multiobjective optimization framework. IEEE Access 9:148325–148338
Article Google Scholar
Roy PK, Bhawal S, Subalalitha CN (2022) Hate speech and offensive language detection in Dravidian languages using deep ensemble framework. Comput Speech Lang 75:101386
Article Google Scholar
Sharma A, Kabra A, Jain M (2022) Ceasing hate with MoH: hate speech detection in hindi-english code-switched language. Inf Process Manage 59(1):102760
Article Google Scholar
Cruz, R.M., de Sousa, W.V. and Cavalcanti, G.D., 2022. Selecting and combining complementary feature representations and classifiers for hate speech detection. arXiv preprint arXiv:2201.06721.
Nascimento FR, Cavalcanti GD, Da Costa-Abreu M (2022) Unintended bias evaluation: an analysis of hate speech detection and gender bias mitigation on social media using ensemble learning. Expert Syst Appl 201:117032
Article Google Scholar
Plaza-del-Arco FM, Molina-González MD, Urena-López LA, Martín-Valdivia MT (2021) Comparing pre-trained language models for Spanish hate speech detection. Expert Syst Appl 166:114120
Article Google Scholar
Pronoza E, Panicheva P, Koltsova O, Rosso P (2021) Detecting ethnicity-targeted hate speech in Russian social media texts. Inf Process Manage 58(6):102674
Article Google Scholar
Pamungkas EW, Basile V, Patti V (2021) A joint learning approach with knowledge injection for zero-shot cross-lingual hate speech detection. Inf Process Manage 58(4):102544
Article Google Scholar
Ayo FE, Folorunso O, Ibharalu FT, Osinuga IA, Abayomi-Alli A (2021) A probabilistic clustering model for hate speech classification in twitter. Expert Syst Appl 173:114762
Article Google Scholar
Beddiar DR, Jahan MS, Oussalah M (2021) Data expansion using back translation and paraphrasing for hate speech detection. Online Soc Netw Media 24:100153
Article Google Scholar
Kocoń J, Figas A, Gruza M, Puchalska D, Kajdanowicz T, Kazienko P (2021) Offensive, aggressive, and hate speech analysis: From data-centric to human-centered approach. Inf Process Manage 58(5):102643
Article Google Scholar
Das AK, Al Asif A, Paul A, Hossain MN (2021) Bangla hate speech detection on social media using attention-based recurrent neural network. J Intell Syst 30(1):578–591
Google Scholar
Oriola O, Kotzé E (2020) Evaluating machine learning techniques for detecting offensive and hate speech in South African tweets. IEEE Access 8:21496–21509
Article Google Scholar
Zhou Y, Yang Y, Liu H, Liu X, Savage N (2020) Deep learning based fusion approach for hate speech detection. IEEE Access 8:128923–128929
Article Google Scholar
Roy PK, Tripathy AK, Das TK, Gao XZ (2020) A framework for hate speech detection using deep convolutional neural network. IEEE Access 8:204951–204962
Article Google Scholar
Ali MZ, Rauf S, Javed K, Hussain S (2021) Improving hate speech detection of Urdu tweets using sentiment analysis. IEEE Access 9:84296–84305
Article Google Scholar
Alatawi HS, Alhothali AM, Moria KM (2021) Detecting white supremacist hate speech using domain specific word embedding with deep learning and BERT. IEEE Access 9:106363–106374
Article Google Scholar
Qureshi KA, Sabih M (2021) Un-compromised credibility: Social media based multi-class hate speech classification for text. IEEE Access 9:109465–109477
Article Google Scholar
Baydogan C, Alatas B (2021) Metaheuristic ant lion and moth flame optimization-based novel approach for automatic detection of hate speech in online social networks. IEEE Access 9:110047–110062
Article Google Scholar
Plaza-Del-Arco FM, Molina-González MD, Ureña-López LA, Martín-Valdivia MT (2021) A multi-task learning approach to hate speech detection leveraging sentiment analysis. IEEE Access 9:112478–112489
Article Google Scholar
Kapil P, Ekbal A (2020) A deep neural network based multi-task learning approach to hate speech detection. Knowl-Based Syst 210:106458
Article Google Scholar
Mossie Z, Wang JH (2020) Vulnerable community identification using hate speech detection on social media. Inf Process Manage 57(3):102087
Article Google Scholar
Watanabe H, Bouazizi M, Ohtsuki T (2018) Hate speech on twitter: a pragmatic approach to collect hateful and offensive expressions and perform hate speech detection. IEEE access 6:13825–13835
Article Google Scholar
Mishra S, Prasad S, Mishra S (2021) Exploring multi-task multi-lingual learning of transformer models for hate speech and offensive speech identification in social media. SN Computer Science 2(2):1–19
Article Google Scholar
Baydogan C (2022) Deep-Cov19-hate: a textual-based novel approach for automatic detection of hate speech in online social networks throughout COVID-19 with shallow and deep learning models. Tehnički vjesnik 29(1):149–156
Google Scholar
Baydoğan VC, Alatas B (2021) Çevrimiçi Sosyal Ağlarda Nefret Söylemi Tespiti için Yapay Zeka Temelli Algoritmaların Performans Değerlendirmesi. Fırat Üniversitesi Mühendislik Bilimleri Dergisi 33(2):745–754
Article Google Scholar
Baydogan C (2021) Sentiment analysis in social networks using social spider optimization algorithm. Tehnički vjesnik 28(6):1943–1951
Google Scholar

Download references

Author information

Authors and Affiliations

NIT Agartala, Agartala, Tripura, India
Purbani Kar & Swapan Debbarma

Authors

Purbani Kar
View author publications
You can also search for this author in PubMed Google Scholar
Swapan Debbarma
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Both the authors contributed in the manuscript. PK prepared the manuscript algorithms tables and figures. SD prepared the manuscript literature survey and introduction. All authors reviewed and revised the manuscript.

Corresponding author

Correspondence to Purbani Kar.

Ethics declarations

Conflict of interest

The author does not have any conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article has been retracted. Please see the retraction notice for more detail: https://doi.org/10.1007/s11227-024-06074-0

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

About this article

Cite this article

Kar, P., Debbarma, S. RETRACTED ARTICLE: Multilingual hate speech detection sentimental analysis on social media platforms using optimal feature extraction and hybrid diagonal gated recurrent neural network. J Supercomput 79, 19515–19546 (2023). https://doi.org/10.1007/s11227-023-05361-6

Download citation

Accepted: 28 April 2023
Published: 30 May 2023
Issue Date: November 2023
DOI: https://doi.org/10.1007/s11227-023-05361-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

RETRACTED ARTICLE: Multilingual hate speech detection sentimental analysis on social media platforms using optimal feature extraction and hybrid diagonal gated recurrent neural network

Abstract

Similar content being viewed by others

Ex-ThaiHate: A Generative Multi-task Framework for Sentiment and Emotion Aware Hate Speech Detection with Explanation in Thai

Hate speech recognition in multilingual text: hinglish documents

Online Hate Speech Identification Using Fine-tuned ALBERT

Explore related subjects

1 Introduction

1.1 Our contributions

2 Related works

3 Problem methodology and system design

3.1 Problem methodology

3.2 System design of proposed work

4 Proposed methodology

4.1 Multiple features extraction using improved seagull optimization (ISO) algorithm

4.1.1 Sentiment based features

4.1.2 Semantic based features

4.1.3 Unigram based features

4.1.4 Pattern-based features

4.2 Feature optimization using quantum search optimization algorithm

4.3 Hate speech detection analysis using Hyb-DGRNN

5 Simulation results

5.1 Dataset description

5.2 Comparative analysis

5.2.1 HASOC-2019 English text

5.2.2 HASOC-2019 German text

5.2.3 HASOC-2019 Hindi text

5.3 Comparative analysis for HASOC-2019 multilingual dataset

6 Conclusion

Data availability

Change history

20 March 2024

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation