Improved sine cosine algorithm with simulated annealing and singer chaotic map for Hadith classification

Tubishat, Mohammad; Ja’afar, Salinah; Idris, Norisma; Al-Betar, Mohammed Azmi; Alswaitti, Mohammed; Jarrah, Hazim; Ismail, Maizatul Akmar; Omar, Mardian Shah

doi:10.1007/s00521-021-06448-y

Improved sine cosine algorithm with simulated annealing and singer chaotic map for Hadith classification

Original Article
Published: 08 September 2021

Volume 34, pages 1385–1406, (2022)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Neural Computing and Applications Aims and scope Submit manuscript

Improved sine cosine algorithm with simulated annealing and singer chaotic map for Hadith classification

Download PDF

Mohammad Tubishat¹,
Salinah Ja’afar²,
Norisma Idris³,
Mohammed Azmi Al-Betar^4,5,
Mohammed Alswaitti⁶,
Hazim Jarrah⁷,
Maizatul Akmar Ismail³ &
…
Mardian Shah Omar²

743 Accesses
20 Citations
1 Altmetric
Explore all metrics

Abstract

Feature selection (FS) represents an important task in classification. Hadith represents an example in which we can apply FS on it. Hadiths are the second major source of Islam after the Quran. Thousands of Hadiths are available in Islam, and these Hadiths are grouped into a number of classes. In the literature, there are many studies conducted for Hadiths classification. Sine Cosine Algorithm (SCA) is a new metaheuristic optimization algorithm. SCA algorithm is mainly based on exploring the search space using sine and cosine mathematical formulas to find the optimal solution. However, SCA, like other Optimization Algorithm (OA), suffers from the problem of local optima and solution diversity. In this paper, to overcome SCA problems and use it for the FS problem, two major improvements were introduced to the standard SCA algorithm. The first improvement includes the use of singer chaotic map within SCA to improve solutions diversity. The second improvement includes the use of the Simulated Annealing (SA) algorithm as a local search operator within SCA to improve its exploitation. In addition, the Gini Index (GI) is used to filter the resulted selected features to reduce the number of features to be explored by SCA. Furthermore, three new Hadith datasets were created. To evaluate the proposed Improved SCA (ISCA), the new three Hadiths datasets were used in our experiments. Furthermore, to confirm the generality of ISCA, we also applied it on 14 benchmark datasets from the UCI repository. The ISCA results were compared with the original SCA and the state-of-the-art algorithms such as Particle Swarm Optimization (PSO), Genetic Algorithm (GA), Grasshopper Optimization Algorithm (GOA), and the most recent optimization algorithm, Harris Hawks Optimizer (HHO). The obtained results confirm the clear outperformance of ISCA in comparison with other optimization algorithms and Hadith classification baseline works. From the obtained results, it is inferred that ISCA can simultaneously improve the classification accuracy while it selects the most informative features.

CDMO: Chaotic Dwarf Mongoose Optimization Algorithm for feature selection

Article Open access 06 January 2024

A novel hybrid BPSO–SCA approach for feature selection

Article Open access 23 October 2019

Social coevolution and Sine chaotic opposition learning Chimp Optimization Algorithm for feature selection

Article Open access 04 July 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Text classification (text categorization) means to classify a given text (document) into one of the given classes (categories) based on the text contents. The classified document will be grouped according to the class it belongs to. For example, a document that discusses sports topics will be classified as a sports documents, whereas a document that mainly discusses political subjects will be classified as a political document. Text classification is an important task for many applications such as social media, sentiment analysis, data mining, and medical applications. There are many methods in the literature that were used for text classification such as Support Vector Machines (SVM), K-Nearest Neighbor (KNN), Naive Bayes, or deep learning [9].

The importance of text classification comes from the need to classify different types of documents, for example, the growing volume in web contents which require to be classified for different purposes. Text classification was applied to different types of applications such as sentiment analysis. However, few works were conducted on the classification of Hadith (Prophet Mohammed (Peace and blessings of Allah be upon him (PBUH)) sayings) [52]. Most of the studies on text classification were conducted on languages such as English and Chinese with few works on the Arabic language. Hadiths were written using the Arabic language. Hadiths are considered as the second main resource of Islam after the Quran. Hadith in Islam means a report of sayings, deeds, and teachings of the Prophet Mohammed (PBUH). There are thousands of written Hadiths which are organized into different classes. Automatic classification of Hadiths is considered as an important phase in data mining and Natural Language Processing (NLP) tasks [52]. However, few works were directed toward Hadiths classification. Furthermore, to the best of our knowledge, there is no study in the literature that used an optimization algorithm for FS on Hadith classification. The importance of the optimization algorithm comes from its ability to select the most informative features and discard irrelevant set. Thus, it can be used as a preprocessing step to improve the classification performance. FS has become an imperative to deal with the high number of irrelevant features in many applications. Hence, researchers consider developing FS techniques and applying them on various fields, including computer vision, pattern recognition, and classification, to name but a few. Examples of such contributions can be found in [64] and [63]. These studies and their significant contributions motivated us to apply optimization algorithm (OA) for FS on Hadith classification.

There are thousands of available Hadiths. The first step of classifying these Hadiths is to carry some preprocessing steps such as tokenization, stemming, and stop words removal. The next step is to build the term frequency–inverse document frequency(TF-IDF) matrix on the left terms. However, not all left terms are relevant. Thus, before using the machine learning classifier on the whole set of features, FS is required to select the most informative features. There are two main FS approaches, one is called the filter-based FS approach, while the other is wrapper-based FS. Examples of the filter-based FS are Gini Index (GI), Chi-square (CHI), and Information Gain (IG). However, these FS techniques are unable to make direct contact with the used machine learning classifier. On the other hand, wrapper-based FS approachs can directly contact with both the features and the classifier. The wrapper-based FS idea is based on using an OA to select a subset of features from the full set of features to train the classifier [39].

In this work, the Sine Cosine Algorithm (SCA) as a recent OA is proposed to select the most informative features in wrapper-based FS mode for Hadiths classification. SCA represents one of the recently developed metaheuristic algorithms by Mirjalili [43]. The main idea of SCA is based on the use of sine and cosine mathematical functions to improve solutions' positions. The SCA algorithm begins with generating a number of random solutions (i.e., search agents) in the solution space. Iteratively, it will update the positions of these candidate solutions toward or outward the destination solution (i.e., best solution) using sine and cosine functions [43]. SCA is simple in terms of concepts, derivative-free parameters, easy-to-use, and sound and complete. Therefore, SCA algorithm has been utilized for different benchmark functions and has achieved promising results in comparison with other famous optimization algorithms [43]. Nowadays, SCA is able to tackle a wide range of optimization problems due to its successes characteristics. The impressive merits of using SCA is that it has almost no parameters compared to other famous optimization algorithms [43].

As aforementioned, SCA has been adapted to solve several optimization problems. For example, M. A. [17, 18] improved SCA by using the opposition learning technique and applied it on engineering and global optimization problems. M. E. A. Elaziz, Ewees, Oliva, Duan, & Xiong [18] improved SCA by using Differential Evolution (DE) operators for FS problem. Zhao, Zou, & Chen [71] applied SCA for community detection problem. Belazzoug, Touahria, Nouioua, & Brahimi [14] improved SCA exploration by using a new equation for updating the solutions’ positions for FS problem. Ramteke, Gurjar, & Deshmukh [50] applied SCA for FS problem. [57] improved SCA by using local search and heuristic crossover and applied it on traveling salesman problem. Lan, Fan, Liu, & Yang [36] improved SCA by using variable neighborhood search (VNS), which is a local search algorithm and applied it on scheduling problems. [25] improved SCA by using crossover and greedy selection mechanism for global optimization problem. [28] hybridized SCA with Ant Lion Optimizer (ALO) for FS problem. [22] improved SCA by using quasi-opposition learning strategy, random weighting agent, and adaptive mutation strategy for global optimization problem. Guo, Wang, Dai, & Xu [24] improved SCA by using optimal neighborhood and quadratic interpolation strategy for global optimization problem. Further, [26] hybridized SCA with artificial bee colony (ABC) for global optimization and image segmentation problems.

However, according to No Free Lunch (NFL) theorem [66], there is no single optimization algorithm that is superior to all other optimization algorithms in solving all types of problems. Hence, this motivates the usage of the SCA algorithm for FS on Hadith classification in this work. However, the SCA algorithm as other optimization algorithms tends to fall in local optima and has a problem in solutions diversity. This gives room for improvement over the basic version of SCA to avoid the mentioned problems. One successful way to improve the convergence behavior of an OA is utilizing chaotic map concepts.

In chaotic map theory, chaos has a unique characteristic, which includes its capability of generating numbers that satisfy the following: regularity, ergodicity, unpredictable, and non-repetitive numbers [65, 70]. The chaotic theory has been incorporated with many metaheuristics algorithms in the literature and proved its capability to improve these optimization algorithms. Examples of algorithms, which were improved by using chaotic theory, are as follows. [70] used the chaotic maps with a bean optimization algorithm (BOA) to improve its population diversity and global search. [65] used the chaotic maps with moth-flame optimization (MFO) to improve the balance between the exploitation and exploration, enhance the MFO convergence speed, and avoid being stuck at local optima. [47] used the chaotic maps with whale optimization algorithm (WOA) to avoid the local optima and to solve random distribution problems for WOA internal parameters. [21] used the chaotic maps to improve the global optimization of monarch butterfly optimization (MBO). [44] used the chaotic maps to improve convergence speed for Gravitational Search Algorithms (GSA) and to avoid falling into local optima. Sharma, Kaur, Sharma, Sharma, & Bansal [55] improved the stochastic nature of the Spider Monkey Optimization (SMO) algorithm by employing chaotic theory. [53, 54] improved the convergence speed of the Dragonfly Algorithm (DA) by using the chaotic theory. Similarly, [34] improved the convergence speed and achieved the balance between the exploitation and exploration processes of the firefly algorithm (FA) using the chaotic theory. Also, [53, 54] improved the convergence speed of the Crow Search Algorithm (CSA) and the solutions being stuck into local optima by employing chaos theory.

As outlined, many studies in the literature employed chaotic maps and that proves its efficiency in improving the used OA. Thus, this advantage of using chaotic maps motivates the current work to utilize it with the SCA algorithm. The first contribution includes the integration of the standard SCA with a chaotic Singer map. The Singer map will be used to improve the balance between SCA exploration and exploitation, which in turn improves its solutions diversity.

The second contribution to the SCA includes the hybridization of simulated annealing (SA) [35] at the end of each chaotic SCA iteration to improve its exploitation (local search) by improving the current best solution. SA algorithm has been used by many researchers in the literature to improve other optimization algorithms and proved its ability to improve the performance of these algorithms. Examples of algorithms, which were improved by using SA algorithm, are as follows. [40] hybridized WOA with SA to improve WOA’s exploitation. Azmi, Pishgoo, Norozi, Koohzadi, & Baesi [12] hybridized GA with SA to exploit the advantages of both algorithms. [58] hybridized FA with SA to improve FA’s exploitation. [32] hybridized PSO with SA to improve PSO’s local search. Afshar-Nadjafi, Yazdani, & Majlesi [3] hybridized SA with tabu search (TS) to take the advantages of both algorithms. Potthuri, Shankar, & Rajesh [48] hybridized differential evolution (DE) algorithm with SA for global optimization. [51] improved the exploitation ability of ACO by using SA as a local search operator. Furthermore, [68] improved the exploitation of Coral reefs optimization (CRO) by using SA as a local operator at the end of each CRO iteration.

From our findings, there are very few studies conducted on Hadith classification [52]. Adding to that, there is no study in the literature that applied OA for FS on Hadith classification. Thus, an improved SCA (ISCA) is proposed in this study to be used for Hadith classification. Furthermore, ISCA is used in this work to improve Hadith classification accuracy and to reduce the number of selected features.

In the proposed work, the main objectives can be summarized as follows:

The GI as a filter-based approach and ISCA algorithm is embedded to complement their advantages and solve their shortcomings and thus classify the Hadith text more accurately.
The ISCA is proposed using Chaos theory and SA algorithm as follows:
1. a.
  Singer chaotic map: by incorporating singer chaotic map within SCA to improve the diversity of its solutions.
2. b.
  SA algorithm: the embedding of the SA algorithm as a local search operator at the end of each SCA iteration to improve its exploitation and to avoid the local optima problem.
The proposed ISCA was tested on three Hadiths datasets. The experiments conducted showed the superiority of the ISCA compared to the five other comparative methods (i.e., SCA, PSO, GA, GOA, and HHO) and other Hadith classification baseline works.
The collection of three different types of Hadiths datasets including D1, D2, and D3 datasets, where D2 represents a new type of dataset in Hadith research. In addition, for the D3 dataset, it represents the English translated version of Sahih Al-Bukhari in which no research in the literature has applied classification tasks for English Hadiths translated version. In addition, the application of the ISCA to the English dataset confirms the ability of the proposed algorithm to work on different languages.
The proposed ISCA was tested on 14 benchmark datasets from the UCI repository to confirm the generality of the ISCA. Again, the conducted experiments proved the superiority of the ISCA against other comparative algorithms including (SCA, PSO, GA, GOA, and HHO)

The rest of this paper is organized as follows: Sect. 2 includes the related works. Section 3 presents the details of the proposed ISCA algorithm. Section 4 describes the conducted experiments and provides in-depth analysis for the results obtained. Section 5 presents discussion about ISCA algorithm. Finally, the conclusion of the study is presented in Sect. 6.

2 Related works

Several studies were conducted for Arabic text classification. For example, [4] applied a first-order Markov model for the classification of the hierarchical Arabic text. Bahassine, Madani, Al-Sarem, & Kissi [13] improved Chi-square filter feature selection and used it with the SVM classifier for Arabic documents classification. [1] used linear discriminant analysis (LDA) for the classification of Arabic documents. Another examples such as in [42] they used classification for verifying the clustering results. Moreover, [41] proposed a technique for classifying Indic documents based on using k-means clustering, latent semantic analysis, and Gaussian clustering.

The following represents examples of studies that were conducted for Hadith classification. For example, [46] conducted a comparison between three machine learning classifiers for classifying Malay translated Hadith based on Sanad. The experiment was conducted on 100 Hadiths that were labeled manually. They reported that the SVM classifier outperformed other classifiers with 82% accuracy. Al Faraby, Jasin, & Kusumaningrum [8] applied SVM classifier with the kernel to classify Hadiths of Al-Bukhari book into three categories. They collected 1650 Hadiths from the Bahasa translated version of the Al-Bukhari book. In addition, they labeled each Hadith manually into one of three classes as a negative suggestion, positive suggestion, or information. They reported that the best-achieved result was 88% using the F1-score. Mohammed Naji [6] used a dataset from Sahih Al-Bukhari book containing 200 Hadiths which are grouped into eight books (classes). In the developed system, they determined the Term frequency–inverse document frequency (TF-IDF) matrix of the terms and then, the system ranks the classified Hadiths by its subject. They reported that the best achieved accuracy was 83.2%.

In a study by [7], four different types of classifiers were compared including the Rocchio algorithm, Naive Bayes (NB), SVM, and KNN. The experiment was conducted on 1500 Hadiths which collected from Sahih Al-Bukhari and categorized into eight books (classes). TF-IDF was used to calculate the frequency of the terms. They reported that the highest precision was 67.11% using the Rocchio classifier. [45] used the associative rule mining classification for classifying Hadiths into either Da’ief (rejected) Hadith or Sahih (accepted). However, no results were reported in the study. [2] used TF-IDF with the Random Forest classifier to classify Hadiths. They applied the experiment to 1650 Hadiths which were collected from the translated version of Sahih Al-Bukhari into Bahasa. They reported that the best-achieved result was 90% in terms of the F1-score. Mohammed N Al-Kabi, Wahsheh, & Alsmadi [7] compared three different classification algorithms including NB, LogiBoost, and Bagging for Hadiths classification. They applied the experiment on 227 Hadiths which were collected from the Sahih Al-Bukhari book. They reported that the precision and recall results of the NB classifier were the best with 59.9% and 60.4%, respectively. [5] compared the use of NB, Euclidean, cosine, Jaccard, inner product, and Dice for Hadith classification. They applied the experiment on a dataset that was collected from the Sahih Al-Bukhari book. They reported that the NB classifier achieved the best results with F1- score 85%. Harrag, El-Qawasmah, & Al-Salman [30] compared using three different stemming techniques with two classifiers (Artificial Neural Networks (ANN) and SVM for Hadith classification. These examined stemming techniques such as light stemming, dictionary-lookup stemming, and root stemming. They applied the experiment on a dataset that was collected from the Prophetic encyclopedia with a total of 453 Hadiths. They reported that the use of dictionary-lookup stemming with the ANN classifier achieved the best results with an F1-score value of 50%. [29] used ANN classifier with singular value decomposition (SVD) to classify Hadiths. They applied the experiment on a dataset that was collected from the Prophetic encyclopedia with a total of 453 Hadiths. They reported that the use of ANN with SVD achieved the best result with F-score is 88%.

Table 1 presents a summary of the discussed works on Hadiths classification, which highlighted the adopted method for each study, the dataset used, and the results obtained from the conducted experiments. Based on Table 1, it is observed that all the previous Hadith classification works used classifiers without using an optimization algorithm for minimizing the number of features. Therefore, an OA can be used to solve this issue and improve the classification performance.

Table 1 Comparative summary of previous works in Hadiths classification

Improved sine cosine algorithm with simulated annealing and singer chaotic map for Hadith classification

Abstract

Similar content being viewed by others

CDMO: Chaotic Dwarf Mongoose Optimization Algorithm for feature selection

A novel hybrid BPSO–SCA approach for feature selection

Social coevolution and Sine chaotic opposition learning Chimp Optimization Algorithm for feature selection

Explore related subjects

1 Introduction

2 Related works

3 The proposed Improved Sine Cosine Algorithm (ISCA)

3.1 The Sine Cosine Algorithm (SCA)

3.2 The Simulated Annealing (SA)

3.3 Chaotic map

3.4 Gini Index (GI)

3.5 Improved Sine Cosine Algorithm (ISCA)

3.5.1 Preprocessing phase

3.5.2 Filter phase

3.5.3 Wrapper phase (Improved SCA based on SA and chaos)

4 Experimental results and analysis

4.1 Datasets

4.1.1 D1 dataset

4.1.2 D2 dataset

4.1.3 D3 dataset

4.2 Experiments results and evaluation

4.2.1 Experiment 1

4.2.2 Experiment 2

4.2.3 Experiment 3

4.2.4 Experiment 4

4.2.5 Experiment 5

4.3 Experiment 6

5 Discussions

6 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation