Introduction

Coronaviruses (CoV) are a large family of viruses that cause illness ranging from common cold to more severe diseases. These are positive-stranded RNA (+ ssRNA) viruses with a crown-like appearance. They are mainly susceptible to mutation and recombination and are, therefore, highly diverse. They reside in bats and wild birds, and can spread to other animals and hence to humans. The virus which causes COVID-19 is thought to have originated in bats [1]. Outbreaks of COVID-19 have been reported in various types of workplaces and job categories. All workers should be protected from acquiring COVID-19 because of their work. The prevention of COVID-19 in work settings should be combined with measures for protecting physical and mental health, safety and wellbeing of workers from other occupational hazards in the operation, closures and reopening of workplaces [2]. COVID-19 is likely spread in the following ways; When the virus travels in respiratory droplets when an infected person coughs, sneezes, talks, sings or breathes near you (within six feet). This is thought to be the main way it is spread. Another way is through close contact (touching, shaking hands) with an infected person. And moreover, by touching surfaces that the virus has landed on, then touching our eyes, mouth, or nose before washing the hands [3]. Due to the huge growth of machine learning techniques, predicting and diagnosing diseases made it easy in current days. Machine learning can learn through machines and makes a proper prediction by self-learning. Several techniques of machine learning were utilized to identify the affected individuals with personalized characteristics. Machine learning is an exclusive technology that is used to allow a computer to train itself and make correct decisions and can be employed for predicting coupled information from previous records, geography, and demographics, climate as well as outbreak severity [4]. Machine learning (ML) is one of the most advanced concepts of artificial intelligence (AI), and provides a strategic approach to developing automated, complex and objective algorithmic techniques for multimodal and dimensional biomedical or mathematical data analysis [5]. This study presents the machine learning model as a method for predicting the transmission of COVID-19 pandemic in an easily understandable way using statistical visualization graphs e.g., normal distribution. It determines the predictive value of the technique with quality and density of collected data of WHO. The resulting predictions will reduce the population’s anxiety and prepares their mentality for accepting and dealing with the next phases of the pandemic. This prediction study proposed a model for COVID-19 prediction and it has achieved a classification accuracy of 91.62%. Even more important fact is it yields a sensitivity of around 95% for COVID-positive cases [6].

From all the researches made so far stated that, the machine learning models can be used to predict and detect this disease (Fig. 1). It will help the people to reduce their panic about the symptoms caused by normal cold or fever. This pandemic will end definitely once all of us must be vaccinated and as well as ensuring the guidelines stated by WHO while in workplace, traveling, in social gathering etc., Even though we used to research on this virus alleviation and mutiny, we need the prediction about causes of spread, symptoms, death rate, recovery rate, and remedies.

Fig. 1
figure 1

Prediction using machine learning models

The remaining paper contains the following subsections; “Related Work” for work done by other researchers after reception of this disease, “Machine Learning Methods” to describe the various machine learning models, “Case Study Table” (Table 1) to compare the methodologies used, and Conclusion.

Related Work

Due to COVID-19 pandemic, the need of disease analysis has been raised to detect whether a person is infected with virus or not. In the year 2021, Corona 2.0 obscured many people lives. To avoid this situation in future, health care providers must have some prediction techniques to save human life [1, 2]. Supervised machine learning is used to get better prediction accuracy. These are some papers given the results regarding Corona disease prediction and detection.

In recent months, different techniques using image processing are done by various researchers. In this paper, a major review on image acquisition, segmentation, diagnosis, avoidance, and management are presented. An analytical comparison of the various proposed algorithm by researchers for corona virus has been carried out. ML methods train the input data and analyze the output data statistically. The application of ML includes the detection of infected persons and the temperature of the person [7]. The machine learning models was trained to predict drug structures that could potentially treat COVID-19. The model is employed with AI neural network with multiple hidden layers and neural systems such as repetitive neural networks (RNN), convolutional neural networks (CNN), deep belief networks (DBN), and completely associated feed-forward systems that supported to investigate drug molecule design and re-positioning of drug molecules [8, 9].

While the RT-PCR is the silver bullet test for confirming the COVID-19 infection, it is limited by the lack of reagents, time-consuming, and the need for specialized labs. This study develops six predictive models for COVID-19 diagnosis using 6 different classifiers (i.e., BayesNet, Logistic, IBk, CR, PART, and J48) based on 14 clinical features. This study retrospects 114 cases from the Taizhou hospital of Zhejiang Province in China. The results showed that the CR meta-classifier is the most accurate classifier for predicting the positive and negative COVID-19 cases with an accuracy of 84.21% [10].

Generally, a huge amount of data is accumulating regarding the COVID-19 pandemic, which makes hot research topics for machine learning researchers. This study considered machine learning approaches to predict the spread of the COVID-19 in many countries. The experimental results of the proposed model showed that the overall R2 is 0.99 from the perspective of confirmed cases. A machine learning model has been developed to predict the estimation of the spread of the COVID-19 infection in many countries and the expected period after which the virus can be stopped [11, 12].

The increasing interest in developing artificial intelligence (AI) applications has addressed several medical problems. Some of the AI applications in healthcare are AI-enabled clinical support system, machine learning for drug discovery [13], diagnosing tissue samples, and AI-assisted image analysis for radiology etc. However, such applications remain insufficient given the high potential threat posed by this virus to global public health.

This systematic review addresses automated AI applications based on data mining and machine learning (ML) algorithms for detecting and diagnosing COVID-19. We aimed to obtain an overview of various deep learning techniques included 3D and 2D analysis of the chest CT images a Hybrid-Covid network, an IRRCNN model with NABLA-N network and Covid-Net and CheXNet network architecture-based detection. The accuracy of the CheXNet architecture-based detection had the highest accuracy. [14].

Sensing the threatening impacts of COVID-19, researchers of computer science have started using various techniques and approaches of machine learning and deep learning to detect the presence of the disease using X-rays and CT images. Artificial intelligence-centered tools can be designed and developed quickly for adapting the existing AI models and for leveraging the ability to modify and associating them with the preliminary clinical understanding to address the new group of COVID-19 and the novel challenges associated with it [15].

Many research done so far have emphasized that we need a more effective technique to detect and forecast infectious people based on all of these facts. The following section contains a diagram (Fig. 2) that depicts various machine learning models to enhance the research in prediction and detection of crucial COVID-19 in early days from the basic symptoms.

Machine Learning Models

Machine learning is typically a system of automated data processing algorithms that help to make decisions more natural and enhance performance based on the results. The “Learning” here means that the algorithm can garner new information and insights without being explicitly programmed. There are several models of machine learning present. From those, some primary paradigms are described as follows [16]:

Supervised machine learning: It is defined as, use of labeled datasets to train algorithms that can classify data or predict outcomes accurately. As input data are fed into the model, the weights are adjusted until the model is properly fitted, which happens during the cross-validation phase. Organizations can use supervised learning to tackle a range of real-world problems at scale, such as spam classification in a distinct folder from our email.

Unsupervised machine learning: These algorithms can be used to find patterns as the inputs are unlabeled. Unlabeled datasets are analyzed and clustered using machine learning methods. Without the need for human intervention, these algorithms uncover hidden patterns or data groupings. It is the best solution for exploratory data analysis, cross-selling techniques, consumer segmentation, and image identification because of its capacity to detect similarities and differences in information.

Reinforcement machine learning: Similar to supervised ML, but in this case, instead of a labeled output, there are rewards and the algorithm’s goal is to maximize rewards. Reinforcement learning is a machine learning training strategy that rewards desirable behaviors while penalizing undesirable ones. A reinforcement learning agent can perceive and comprehend its surroundings, act, and learn through trial and error in general.

Semi-supervised machine learning: Input data are a mixture of labeled and unlabelled, so the model must learn structures to organize as well as make predictions. Semi-supervised learning is a type of machine learning that tries to solve issues involving both labeled and unlabeled data. Semi-supervised learning makes use of mathematical concepts such as clustering and classification method features.

From these types, supervised learning is intensively helpful in finding solution for real-world computational problems. The algorithms predict the outcome results from unlabeled data. Different types of supervised learning are regression, classification, naïve Bayesian Model, decision trees, random forest model, neural networks, support vector machines.

The diagram (Fig. 2) below showed the major types of machine learning models.

Fig. 2
figure 2

Machine learning models overview

Discussion

The purpose of this study is to recognize which machine learning method works well in predicting and detecting diseases with highest accuracy. The entire world faces a big crisis due to COVID-19. During this pandemic, more researches have been made to identify the infectious persons in a proper way at right time. From those researches, this comparison table illustrates that the supervised learning plays a vital role in prediction using symptoms, clinical features and chest X-rays. Some researchers used classification model with different algorithms for example, shallow single-layer perceptron neural network, Gaussian process regression, Bayes Net, logistic, J48, multinomial Naive Bayes classifier, support vector machine etc., [17,18,19,20,21,22,23,24]

Some researchers used regression, recurrent neural network and convolutional neural network. From all the experiments and studies, it is noted that supervised learning techniques are alone used to obtain more accurate results. To develop an effective algorithm for detection of corona virus presence in human beings, the above table (Table 1) would be helpful. The results were compared and it is observed that classification model is best among all the others. It yields higher accuracy, more specificity when compared with regression and convolutional neural networks. Furthermore, this study would be useful to develop a new algorithm for detecting this deadly disease in efficient manner. The research may be fruitful by concentrating on new classification model.

Table 1 Comparison table about method, algorithmic models used to predict and detect COVID-19

Conclusion

Thus, this comparative case study ensures that the machine learning models are efficient in a way to predict and detect the infectious persons. In that, supervised learning algorithms are more powerful when compared to other algorithms. This study focused to show that this prediction will be helpful to end this epidemic in future. Our conclusion is machine learning applications in the field of disease diagnostics showed promising results with higher accuracy, and specificity. In particular, supervised machine learning takes a leading role in disease diagnosis. This machine learning technique yield accurate results when compared to other techniques. The researchers have used so many models such as, linear regression, naïve Bayes classifier, support vector machine, decision tree and ResNet50 etc., to train and test the data set for predicting and detecting Coronavirus disease in an effective manner. This study will be expanded into developing a new algorithm based on the models available in terms of supervised machine learning for earlier diagnosis of diseases especially COVID-19.