Feature Selection Using Modified Sine Cosine Algorithm with COVID-19 Dataset

Zivkovic, Miodrag; Jovanovic, Luka; Ivanovic, Milica; Krdzic, Aleksa; Bacanin, Nebojsa; Strumberger, Ivana

doi:10.1007/978-981-16-9605-3_2

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 116))

799 Accesses
33 Citations

Abstract

The research proposed in this paper shows application of the sine cosine swarm intelligence algorithm for feature selection problem in the machine learning domain. Feature selection is a process that is responsible for selecting datasets’ features that have the biggest effect on the performances and the accuracy of the system. The feature selection task performs the search for the optimal set of features through a enormous search space, and since the swarm intelligence metaheuristics have already proven their performances and established themselves as good optimizers, their application can drastically enhance the feature selection process. This paper introduces the improved version of the sine cosine algorithm that was utilized to address the feature selection problem. The proposed algorithm was tested on ten standard UCL repository datasets and compared to other modern algorithms that have been validated on the same test instances. Finally, the proposed algorithm was tested against the COVID-19 dataset. The obtained results indicate that the method proposed in this manuscript outperforms other state-of-the-art metaheuristics in terms of features number and classification accuracy.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Feature Selection in Machine Learning by Hybrid Sine Cosine Metaheuristics

A novel feature selection method for data mining tasks using hybrid Sine Cosine Algorithm and Genetic Algorithm

Article 22 February 2021

RETRACTED ARTICLE: Sine–cosine algorithm for feature selection with elitism strategy and new updating mechanism

Article 20 January 2017

Keywords

1 Introduction

Rapid developments in information science have resulted in a dramatic increase in dataset dimensions over the past decade. Potential dimension reduction algorithms are needed to remove redundant or irrelevant information from these datasets, since these features can lead to reduced performance of learning algorithms [22].

Typically considered a mechanism for preprocessing, feature selections are used for decreasing the total number of input variables, as well as finding the most relevant subset from a complete features set. Feature selection reduces the dimensionality of data by removing the noise and irrelevant attributes. This challenge is very important, especially when the real-time classification is needed by finding optimal or near-optimal subset of features, the training process can be shortened and classification accuracy can be improved. It is applied so as to increase the precision of prediction results given by the machine learning model, by reducing complexity, and diminishing redundant and irrelevant features in the dataset. This can be crucial in case of some critical applications, such as medical diagnostic [10]. Feature subset evaluation and search strategy are the two primary stages of preprocessing. Search strategy uses techniques for subset feature selection, while feature subset evaluation utilizes a classifier for evaluating the quality for the selected feature subset. All methods for feature selection, according to reviewed literature, are defined as either filter based or wrapper based.

Metaheuristic algorithms are considered the most reliable and efficient techniques for optimization and show great results when applied to problems considered more challenging or with higher-dimensional datasets. As a result, these algorithms show great promise and have been applied to many real-world problem that require optimization and performance improvements [3, 4, 25, 32, 34, 36]. Although these algorithms are often nature inspired, this is not necessarily always the case as shown in the sine cosine algorithm (SCA) [20].

Because of the high accuracy results achieved, as well as the reduced computational times when compared to traditional discrete methods, the metaheuristic approach has been employed by researchers, in wrapper-based methods, when solving the problem of feature selection. A Gaussian mutational chaotic fruit fly optimization algorithm’s [31] application has been suggested for tackling the problem of feature selection, specifically to classification tasks. An augmented model of the dragonfly algorithm (DA), the hyper-learning binary dragonfly algorithm (HLBDA), has been implemented for feature evaluation and utilized on coronavirus (COVID-19) datasets [28].

SCA is a population-based algorithm, named after its use of the sine and cosine functions in its formulation, originally intended for use in solving optimization problems [20]. The algorithm initially creates a collection of multiple randomized solutions requiring them to fluctuate toward the best solution during the exploitation phase or outwards to encourage exploration employing a mathematical model formed from the sine cosine functions.

Some deficiencies were observed in the original SCA while performing practical empirical simulations with standard unconstrained benchmarks. Because of this, we have attempted to improve the basic SCA by performing hybridization with the well-known ABC algorithm. The mSCA is benchmarked using ten datasets form the University of California, Irvine (UCI) repository, and Arizona State University, as well as a single dataset of the coronavirus disease (COVID-19).

The main contribution of this conducted research can be outlined in the following:

Proposal of a mSCA applied to the problem of feature selection elements of the ABC algorithm is integrated into the SCA to improve exploratory behavior.
Testing the mSCA on ten standard benchmark datasets with low medium and high dimensions sets represented.
Comparing the mSCA to other advanced feature selection algorithms and demonstrating the improvements made.
Applying the proposed mSCA to solving a case study of COVID-19.

The remainder of this article is organized according to the following order: Sect. 2 shows a summary of the reviewed literature. Section 3 consists of a description of the original SCA. Section 4 shows experimental results and discusses the findings based on said results. Section 5 summarizes the findings and presents proposals for the direction of further work in this field.

2 Literature Review

When we have large datasets that are too difficult to classify, the use of swarm intelligence-based algorithm is suggested. Each large dataset contains features that are insignificant and irrelevant which can prove to be difficult when trying to analyze and interpret data. Swarm intelligence algorithm’s purpose is to reduce dimensionality (feature selection) by keeping only useful features and those containing rich information. As a result of using dimensionality reduction technique, we have better understanding and interpretation of data, as well as higher accuracy of the results. There are two main steps in dimensionality reduction process, extracting features and selecting features. Before any further explanation of those features, we should give a short overview of swarm intelligence algorithms.

Swarm intelligence algorithms are part of the artificial intelligence (AI) field, and they are so-called nature-inspired metaheuristics [29]. Many groups of animals form collective intelligence which means that every member acts independently, but they mutually exchange information. That information eventually takes the group toward the optimal solution of their problem. Such animal colonies are ants, birds, hawks, fish, and more [16, 21, 29]. Nature-inspired metaheuristics are not that efficient at finding the most optimal solutions inside the search area, but they are efficient at finding the candidate solutions. Furthermore, they are especially good at finding possible solutions inside very large search areas. Because they take unreasonable amount of time to find the most optimal solution, swarm intelligence algorithms are also classified as NP-hard problems [15]. Many diverse problems can be solved with swarm intelligence algorithms such as wireless sensor network optimization [4, 32], cloud computing [6, 8, 35] and optimization of neural networks [2, 5, 12, 24], machine learning, and COVID-19 prediction [33], all the way to solving complicated problems in the field of medicine [7].

In order to prepare raw and unprocessed data, feature extraction is used [17]. A new dataset is formed by keeping some of the core features after which new features can be derived. Eventually, we have a new dataset that is cleaner, containing only features relevant to the specific problem and with fewer dimensions compared to the original dataset.

Since we have our most relevant and important data after feature extraction, the next step is feature selection. With feature selection, we select attributes previously defined in original dataset. This step is extremely important since the combination of the right attributes can improve the model’s performance and accuracy. A common example of feature selection, alongside feature extraction, is image processing and analysis. Large amount of statistical features can be retrieved from the image, but a combination of only a few gives satisfactory results.

A side effect of feature selection is a possible loss of a certain amount of information, but, due to achieving simplicity of the model and significant performance improvement, it is well worth it. There are three distinct categories of techniques for selecting features, the wrapper, filter, and embedded technique [9].

Filter techniques choose the features that should contain the most information, without taking into consideration whether there are any relationships between the features or not. Wrapper techniques choose features that are most accurate to our machine learning model by going through all feature combinations. As for the embedded technique, the features are chosen while the model is still being constructed [9]. With these techniques, a decent performance can be achieved on relatively small datasets, but, for larger datasets, because of the decline in performance, a different method should be used such as swarm intelligence algorithm. In a reasonable amount of computational time, satisfactory results on large datasets are provided by the algorithm.

3 Original and Proposed Modified Sine Cosine Algorithm

SCA originally designed with the purpose of solving optimization problems, and first introduced by Seyadali Mirjalili [20], is a generally new population-based algorithm. The algorithm stochastically looks for the most optimum solution to our problems. At the very beginning, it starts with a randomized set of solutions, then repeatedly evaluates this set against an objective function, and follows a given ruleset that forms the core of the given optimization technique. As such, finding the most optimal solution in the first iteration is not guaranteed; however, given enough iterations and a large enough collection of randomized solutions, the probability of the global optimal solution being found increases.

The process of optimization in the stochastic population-based approach, regardless of the algorithm being applied, can be split across two distinct phases: exploration phase and exploitation phase. In the exploration phase, the algorithm quickly, in a very random manner, combines solutions from a given random set, looking through the search space for the most favorable regions. With the exploitation phase, the changes are gradually made, however, noticeably less severe than those from the exploitation phase.

The original SCA proposes the use of the following equations for position updating in both phases Eq (1):

$$\begin{aligned} \begin{aligned} X_i^{t+1} = X_i^t + r_1 \times \sin (r_2) \times |r_3P_i^t - X_i^t | \\ X_i^{t+1} = X_i^t + r_1 \times \cos (r_2) \times |r_3P_i^t - X_{i_{i}}^t | \end{aligned} \end{aligned}$$

(1)

where X represents the current solution’s position in the i-th dimension after the t-th iteration, $P_i$ represents the point of destination in the i-th dimension, $r_1$, $r_2$ and $r_3$ are random numbers, and || indicates an absolute value.

In Eq. (2), a combination of these two Eq. (1) can be seen:

$$\begin{aligned} X_i^{t+1} = {\left\{ \begin{array}{ll} X_i^{t+1} = X_i^t + r_1 \times \sin (r_2) \times |r_3P_i^t - X_i^t |, r_4 < 0.5\\ X_i^{t+1} = X_i^t + r_1 \times \cos (r_2) \times |r_3P_i^t - X_{i_{i}}^t|, r_4 \ge 0.5 \end{array}\right. } \end{aligned}$$

(2)

where $r_4$ represents a random value in [0,1].

The four major parameters of the SCA are $r_1, r_2, r_3$, and $r_4$, as shown in the equations above. Parameter $r_1$ defines region of the following position. Said position signifies one of the two possible spaces: the space between the solution and destination or the space outside of the two. Parameter $r_2$ dictates the movement away from or toward the destination, or more precisely, how distant the movement is. The role of parameter $r_{3}$ is to stochastically diminish ($r_3 < 1$) or emphasize ($r_3 > 1$) the distribution effects on distance definition. Lastly, parameter $r_4$ plays the part of switching between the sine and cosine components in Eq. (2).

The effects of the sine and cosine functions on Eqs. (1) and (2) are depicted in Fig. 1. The search space in between the two solutions is dictated by these two equations as depicted in said figure. These two equations can also be expanded to include higher dimensions; however, Fig. 1 depicts a two-dimensional model.

The sine and cosine functions cyclic pattern allows for solution repositioning around a different solution. This can provide a guarantee of exploitation in the defined space enclosed by the two calculated solutions. Altering the range of sine and cosine function enables the solutions to search outside the space that is defined by the corresponding destinations, and this is done so as to ensure exploration.

While changing the function range, as shown in Fig. 2, it is necessary to update the new position of the solution taking into account positions of the existing solutions. The updated position is attained by choosing a random value in range $[0,2\pi ]$ for $r_2$ from Eq. 2, and it can be either on the outside or on the inside. This mechanism ensures both exploitation and exploration of the search space.

The algorithm needs to have the ability to balance both exploration and exploitation when searching for promising regions inside a given search space. This is done to eventually converge on a global optimum. The SCA does this by changing range of the sine and cosine adaptively in Eq. 2 according to Eq. 3:

$$\begin{aligned} r_1=a-t{\frac{a}{T}} \end{aligned}$$

(3)

where a represents a constant, T represents the maximum amount of allowed repetitions, and finally t represents the active iteration.

Through many repetitions of Eq. 2, we get a decreasing range of sine and cosine as shown in Fig. 3.

By taking into consideration both Figs. 2 and 3, it can be deduced that the SCA focuses on exploitation when the given ranges are in $[-1,1]$, and on exploration when the ranges are in between (1, 2] and $[-2,-1)$.

Finally, the pseudocode for the SCA are shown in Fig. 4. As depicted, the algorithm begins the process of optimization with randomized set of solutions. Every time the algorithm encounters a solution, it considers the most optimal so far, and it assigns it as a target point. The algorithm then, in regard to the most optimal solution, updates other solutions. During this process, the iteration counter is increased, and the ranges of sine and cosine function are, after every iteration, updated emphasizing exploitation of the defined search space. When the counter reaches the maximum allowed amount of iterations, the optimization process of the original SCA stops. Other conditions for termination can be implemented as well, including the total number of functional evaluations or reaching a desired global optimum accuracy.

3.1 Proposed Modified SCA Approach

Notwithstanding the fact that the basic SCA metaheuristics establish excellent results for standard benchmark instances [20], based on additional conducted experiments with basic congress on evolutionary computation (CEC) benchmark suites, it was concluded that the basic SCA can be further improved.

As many other swarm intelligence approaches, original SCA may be stuck in non-optimal regions of the search domain in early iterations of execution. In this early phase, due to the lack of exploration power, if the search process is not “lucky” and if does not register optimal domain of the search space, algorithm may stuck in sub-optimal domain for many iterations. As a consequence, worse mean values are generated, and performance of the metaheuristic is seriously degraded.

Without adding complexity to the algorithm, abovementioned drawback of original SCA can overcome by introducing simple mechanism in the search process as follows: after every iteration, 5% of worst solutions from the population are replaced with the randomly generated individuals within the boundaries of the search space in the first 50% of iterations:

$$\begin{aligned} X_{\text {rnd}}^{j} = L^{j} + \phi \cdot (U^{j}-L^{j}), \end{aligned}$$

(4)

where $X_{\text {rnd}}^{j}$ is j-th component of the newly generated random solution, phi is the value derived from the uniform distribution, and $U^{J}$ and $L^{j}$ are upper and lower boundaries of j-th parameter, respectively.

Based on conducted simulations, it was concluded that in approximately first 50% of iterations described exploration mechanism should be triggered. However, in later iterations, this mechanism is not needed, and it would only represent an obstacle in performing a fine-tuned search around the promising domain of the search region. Proposed method is named modified SCA (mSCA), and its pseudocode is shown in Algorithm 1.

4 Experiments and Discussion

In the research presented in this manuscript, the proposed mSCA algorithm was tested on ten basic datasets and one additional COVID-19 dataset. The experimental simulations in this research were executed through 20 independent runs, while each run consisted of 70 iterations. The size of the population was set to 8, and a mixed initializer was utilized to randomly select 2/3 from the available amount of features. The suggested improved optimization method’s performance has been tested on ten UCI datasets that are very popular among researchers and used as a benchmark in Table 1.

The performance of mSCA was evaluated on a computer with a central processing unit (CPU) with a clock frequency of 2.90 GHz, additionally with 16.0G of available random access memory (RAM) and programmed in the language of Python with Anaconda framework using machine learning libraries including NumPy, SciPy and scikit-learn. The performance is judged based on five calculated evaluation metrics. The evaluation metrics include optimal fitness value, average fitness value, fitness value normal divination, precision of classification, and the ratio of feature selection with each method executed and evaluated 20 times. The repetition is performed to better represent results and avoid bias caused by optimization algorithms stochastic nature. The result averages are logged and presented after the last iteration of the 20 individual runs.

The mSCA in tested against ten standard datasets and COVID-19 dataset. And its performance is then evaluated. The datasets are acquired from the UCI repository [11] and Arizona State University [18]. Table 2 represents best overall fitness while Table 3 represents the mean fitness metric. Tables 4 and 5 each represent standard deviation, average classification accuracy and feature selection of already referenced ten datasets. The best results are marked in bold in each table, except in the case of tie, where none of the results are marked. Tests of the proposed mSCA have been conducted on different structures, so as to provide evidence of the algorithms efficiency and performance in differing dimension.

Table 1 List of experimental simulation datasets

Full size table

Table 2 Best fitness metric over ten UCI datasets for the compared approaches

Full size table

Table 3 Statistical mean fitness metric over ten datasets for the compared approaches

Full size table

Table 4 Standard deviation results for ten datasets included in the comparative analysis

Full size table

Table 5 Percentage of selected feature for ten datasets included in comparative analysis

Full size table

The obtained results from Tables 2, 3, 4 and 5 from conducted experiments proved the efficiency and efficacy of mSCA proposed algorithm. Based on the empirical analysis, a deduction can be made that the proposed mSCA can yield higher-quality results than the algorithms it has been tested against. The eight algorithms tested in this paper are (BDA) [19], binary artificial bee colony (BABC) [14], binary multiverse optimizer (BMVO) [1], binary particle swarm optimization (BPSO) [30], chaotic crow search algorithm (CCSA) [23], binary coyote optimization algorithm (BCOA) [27], evolution strategy with covariance matrix adaptation (CMAES) [13] and success history-based adaptive differential evolution with linear population size reduction (LSHADE) [26] algorithms.

Based on the presented results, it can be concluded that the proposed mSCA metaheuristics clearly outperformed the original SCA approach for all observed metrics. In general, when compared to other approaches included in the simulations, mSCA obtained the best performances. Based on the results from Table 2, the proposed mSCA approach obtained the best results for best fitness metrics on five out of the ten UCI datasets. When the statistical mean fitness metric is observed, from Table 3, it can be concluded that the mSCA obtained the best results on six out of ten UCI datasets. In case of the standard deviation, Table 4 shows that the mSCA obtained the best results on four datasets and tied the best results on the Glass dataset. In Table 5, comparative analysis between proposed mSCA and other approaches in terms of selected features (expressed as ratios of total number of features in the datasets) is presented. From results, it can be seen that proposed mSCA in average utilizes a smaller number of features than other methods which means that it managed to substantially reduce the problem dimensions, which makes the training process of a classifier much faster (Figs. 5 and 6).

5 Conclusion

The conducted research that is presented in this manuscript proposes a novel feature selection method. The implemented mSCA metaheuristics address the drawbacks of the original SCA method that are observed from the results of the conducted experiments. The proposed mSCA approach was later used to help find the crucial features for the classification process. The presented algorithmic method of optimization was validated on ten benchmark datasets, and the results are represented in comparison with other swarm intelligence metaheuristics. Finally, the mSCA method was used on COVID-19 dataset. The conducted experiments results indicate that the mSCA approach outperformed other methods included in the comparative analysis. Based on defined research contributions, the novelty of proposed research can be summed as follows: more efficient SCA metaheuristics are devised, solving feature selection challenge was improved in terms of classification accuracy, and the number of employed features and classification for the most recent and important COVID-19 dataset was performed.

The future research in this area will be focused on including additional datasets to the experimental simulations. Also, the future work will deal with adaptation of other swarm intelligence metaheuristics, with a goal to further enhance the classification accuracy.

References

Al-Madi, N., Faris, H., Mirjalili, S.: Binary multi-verse optimization algorithm for global optimization and discrete problems. Int. J. Mach. Learn. Cybern. 10(12), 3445–3465 (2019). https://doi.org/10.1007/s13042-019-00931-8
Article Google Scholar
Bacanin, N., Bezdan, T., Tuba, E., Strumberger, I., Tuba, M.: Monarch butterfly optimization based convolutional neural network design. Mathematics 8(6), 936 (2020)
Article Google Scholar
Bacanin, N., Bezdan, T., Tuba, E., Strumberger, I., Tuba, M., Zivkovic, M.: Task scheduling in cloud computing environment by grey wolf optimizer. In: 2019 27th Telecommunications Forum (TELFOR). pp. 1–4. IEEE (2019)
Google Scholar
Bacanin, N., Tuba, E., Zivkovic, M., Strumberger, I., Tuba, M.: Whale optimization algorithm with exploratory move for wireless sensor networks localization. In: International Conference on Hybrid Intelligent Systems. pp. 328–338. Springer (2019)
Google Scholar
Bezdan, T., Tuba, E., Strumberger, I., Bacanin, N., Tuba, M.: Automatically designing convolutional neural network architecture with artificial flora algorithm. In: Tuba, M., Akashe, S., Joshi, A. (eds.) ICT Systems and Sustainability, pp. 371–378. Springer Singapore, Singapore (2020)
Chapter Google Scholar
Bezdan, T., Zivkovic, M., Antonijevic, M., Zivkovic, T., Bacanin, N.: Enhanced flower pollination algorithm for task scheduling in cloud computing environment. In: Machine Learning for Predictive Analysis, pp. 163–171. Springer (2020)
Google Scholar
Bezdan, T., Zivkovic, M., Tuba, E., Strumberger, I., Bacanin, N., Tuba, M.: Glioma brain tumor grade classification from mri using convolutional neural networks designed by modified fa. In: International Conference on Intelligent and Fuzzy Systems. pp. 955–963. Springer (2020)
Google Scholar
Bezdan, T., Zivkovic, M., Tuba, E., Strumberger, I., Bacanin, N., Tuba, M.: Multi-objective task scheduling in cloud computing environment by hybridized bat algorithm. In: International Conference on Intelligent and Fuzzy Systems. pp. 718–725. Springer (2020)
Google Scholar
Chandrashekar, G., Sahin, F.: A survey on feature selection methods. Computers and Electrical Engineering 40(1), 16–28 (2014). https://doi.org/10.1016/j.compeleceng.2013.11.024. https://www.sciencedirect.com/science/article/pii/S0045790613003066, 40th-year commemorative issue
Chen, J.I.Z., Hengjinda, P.: Early prediction of coronary artery disease (cad) by machine learning method-a comparative study. J. Artif. Intell. 3(01), 17–33 (2021)
Google Scholar
Dua, D., Graff, C.: UCI machine learning repository (2017). http://archive.ics.uci.edu/ml
Gajic, L., Cvetnic, D., Zivkovic, M., Bezdan, T., Bacanin, N., Milosevic, S.: Multi-layer perceptron training using hybridized bat algorithm. In: Computational Vision and Bio-Inspired Computing, pp. 689–705. Springer (2021)
Google Scholar
Hansen, N., Kern, S.: Evaluating the CMA Evolution Strategy on Multimodal Test Functions, pp. 282–291. Lecture Notes in Computer Science. Springer (2004)
Google Scholar
He, Y., Xie, H., Wong, T.L., Wang, X.: A novel binary artificial bee colony algorithm for the set-union knapsack problem. Future Gener. Comput. Syst. 78, 77–86 (2018). https://doi.org/10.1016/j.future.2017.05.044. https://www.sciencedirect.com/science/article/pii/S0167739X17310415
Article Google Scholar
Johnson, D.S.: The np-completeness column: an ongoing guide. J. Algorithms 6(3), 434–451 (1985). https://doi.org/10.1016/0196-6774(85)90012-4. https://www.sciencedirect.com/science/article/pii/0196677485900124
Article MathSciNet MATH Google Scholar
Karaboga, D.: Artificial bee colony algorithm. Scholarpedia 5(3), 6915 (2010). https://doi.org/10.4249/scholarpedia.6915, revision #91003
Levine, M.: Feature extraction: a survey. Proc. IEEE 57(8), 1391–1407 (1969). https://doi.org/10.1109/PROC.1969.7277
Article Google Scholar
Li, J., Cheng, K., Wang, S., Morstatter, F., Trevino, R.P., Tang, J., Liu, H.: Feature selection: a data perspective. ACM Comput. Surv. (CSUR) 50(6), 94 (2018)
Article Google Scholar
Mirjalili, S.: Dragonfly algorithm: a new meta-heuristic optimization technique for solving single-objective, discrete, and multi-objective problems. Neural Comput. Appl. 27(4), 1053–1073 (2016). https://doi.org/10.1007/s00521-015-1920-1
Article MathSciNet Google Scholar
Mirjalili, S.: Sca: a sine cosine algorithm for solving optimization problems. Knowl Based Syst 96 (2016). DOI 10.1016/j.knosys.2015.12.022
Google Scholar
Mirjalili, S., Mirjalili, S.M., Lewis, A.: Grey wolf optimizer. Adv. Eng. Softw. 69, 46–61 (2014). https://doi.org/10.1016/j.advengsoft.2013.12.007. https://www.sciencedirect.com/science/article/pii/S0965997813001853
Article Google Scholar
Ranganathan, G.: A study to find facts behind preprocessing on deep learning algorithms. J. Innov. Image Proces. (JIIP) 3(01), 66–74 (2021)
Article Google Scholar
Sayed, G.I., Hassanien, A.E., Azar, A.T.: Feature selection via a novel chaotic crow search algorithm. Neural Computing and Applications 31(1), 171–188 (2019). https://doi.org/10.1007/s00521-017-2988-6
Article Google Scholar
Strumberger, I., Tuba, E., Bacanin, N., Zivkovic, M., Beko, M., Tuba, M.: Designing convolutional neural network architecture by the firefly algorithm. In: 2019 International Young Engineers Forum (YEF-ECE). pp. 59–65. IEEE (2019)
Google Scholar
Strumberger, I., Tuba, E., Zivkovic, M., Bacanin, N., Beko, M., Tuba, M.: Dynamic search tree growth algorithm for global optimization. In: Doctoral Conference on Computing, Electrical and Industrial Systems. pp. 143–153. Springer (2019)
Google Scholar
Tanabe, R., Fukunaga, A.: Improving the search performance of shade using linear population size reduction. 2014 IEEE Congress on Evolutionary Computation (CEC) pp. 1658–1665 (2014)
Google Scholar
Thom de Souza, R.C., de Macedo, C.A., dos Santos Coelho, L., Pierezan, J., Mariani, V.C.: Binary coyote optimization algorithm for feature selection. Pattern Recognition 107, 107470 (2020). https://doi.org/10.1016/j.patcog.2020.107470. https://www.sciencedirect.com/science/article/pii/S0031320320302739
Too, J., Mirjalili, S.: A hyper learning binary dragonfly algorithm for feature selection: A covid-19 case study. Knowledge-Based Systems 212,(2021). https://doi.org/10.1016/j.knosys.2020.106553. https://www.sciencedirect.com/science/article/pii/S0950705120306821
Yang, X.S.: A new metaheuristic bat-inspired algorithm. In: Nature inspired cooperative strategies for optimization (NICSO 2010), pp. 65–74. Springer (2010)
Google Scholar
Yin, P.Y.: A discrete particle swarm algorithm for optimal polygonal approximation of digital curves. Journal of Visual Communication and Image Representation 15(2), 241–260 (2004). https://doi.org/10.1016/j.jvcir.2003.12.001. https://www.sciencedirect.com/science/article/pii/S1047320303000981
Zhang, X., Xu, Y., Yu, C., Heidari, A.A., Li, S., Chen, H., Li, C.: Gaussian mutational chaotic fruit fly-built optimization and feature selection. Expert Systems with Applications 141,(2020). https://doi.org/10.1016/j.eswa.2019.112976. https://www.sciencedirect.com/science/article/pii/S0957417419306943
Zivkovic, M., Bacanin, N., Tuba, E., Strumberger, I., Bezdan, T., Tuba, M.: Wireless sensor networks life time optimization based on the improved firefly algorithm. In: 2020 International Wireless Communications and Mobile Computing (IWCMC). pp. 1176–1181. IEEE (2020)
Google Scholar
Zivkovic, M., Bacanin, N., Venkatachalam, K., Nayyar, A., Djordjevic, A., Strumberger, I., Al-Turjman, F.: Covid-19 cases prediction by using hybrid machine learning and beetle antennae search approach. Sustain. Cities Soc. 66, 102669 (2021)
Google Scholar
Zivkovic, M., Bacanin, N., Zivkovic, T., Strumberger, I., Tuba, E., Tuba, M.: Enhanced grey wolf algorithm for energy efficient wireless sensor networks. In: 2020 Zooming Innovation in Consumer Technologies Conference (ZINC). pp. 87–92. IEEE (2020)
Google Scholar
Zivkovic, M., Bezdan, T., Strumberger, I., Bacanin, N., Venkatachalam, K.: Improved harris hawks optimization algorithm for workflow scheduling challenge in cloud–edge environment. In: Computer Networks, Big Data and IoT, pp. 87–102. Springer (2021)
Google Scholar
Zivkovic, M., Zivkovic, T., Venkatachalam, K., Bacanin, N.: Enhanced dragonfly algorithm adapted for wireless sensor network lifetime optimization. In: Data Intelligence and Cognitive Informatics, pp. 803–817. Springer (2021)
Google Scholar

Download references

Author information

Authors and Affiliations

Singidunum University, Danijelova 32, 11000, Belgrade, Serbia
Miodrag Zivkovic, Luka Jovanovic, Milica Ivanovic, Aleksa Krdzic, Nebojsa Bacanin & Ivana Strumberger

Authors

Miodrag Zivkovic
View author publications
You can also search for this author in PubMed Google Scholar
Luka Jovanovic
View author publications
You can also search for this author in PubMed Google Scholar
Milica Ivanovic
View author publications
You can also search for this author in PubMed Google Scholar
Aleksa Krdzic
View author publications
You can also search for this author in PubMed Google Scholar
Nebojsa Bacanin
View author publications
You can also search for this author in PubMed Google Scholar
Ivana Strumberger
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Miodrag Zivkovic .

Editor information

Editors and Affiliations

Department of Information Science and Engineering, Research and Industry Incubation Center, Dayananda Sagar College of Engineering, Bengaluru, Karnataka, India
V. Suma
Ryerson Communications Lab, Toronto, ON, Canada
Xavier Fernando
Department of Electrical and Computer Engineering, Concordia University, Montreal, QC, Canada
Ke-Lin Du
Go Perception Laboratory, Cornell University, Ithaca, NY, USA
Haoxiang Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zivkovic, M., Jovanovic, L., Ivanovic, M., Krdzic, A., Bacanin, N., Strumberger, I. (2022). Feature Selection Using Modified Sine Cosine Algorithm with COVID-19 Dataset. In: Suma, V., Fernando, X., Du, KL., Wang, H. (eds) Evolutionary Computing and Mobile Sustainable Networks. Lecture Notes on Data Engineering and Communications Technologies, vol 116. Springer, Singapore. https://doi.org/10.1007/978-981-16-9605-3_2

Download citation

DOI: https://doi.org/10.1007/978-981-16-9605-3_2
Published: 22 March 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-9604-6
Online ISBN: 978-981-16-9605-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Feature Selection Using Modified Sine Cosine Algorithm with COVID-19 Dataset

Abstract

Similar content being viewed by others

Feature Selection in Machine Learning by Hybrid Sine Cosine Metaheuristics

A novel feature selection method for data mining tasks using hybrid Sine Cosine Algorithm and Genetic Algorithm

RETRACTED ARTICLE: Sine–cosine algorithm for feature selection with elitism strategy and new updating mechanism

Keywords

1 Introduction

2 Literature Review