Keywords

1 Introduction

Brain is the most respective part of the human body. The Brain and the spinal cord comprise of the body’s nervous system. The Brain controls the action of the body through this nervous system and this involves receiving information from the sensory organs of the body, interprets the information, and then guides the body’s response. Any infection, damage, abnormality to the Brain can cause threat to the normal functioning of the Brain and even human life. Advances in computer-aided diagnosis have simplified the task of a radiologist for manual detection of the tumors. Advancement in technology like artificial intelligence and machine learning has made diagnosis more accurate, time efficient, and cost-effective. In this paper, we discuss some of the machine learning techniques used for Brain tumor detection.

1.1 Challenges in Machine Learning

The kind of learning where a machine improves its performance based on experience is called as inductive learning. An agent is said to learn from experience with respect to some task and is measured by performance say ‘p,’ then ‘p’ improves with experience, and this is the actual concept of machine learning (a canonical definition stated by Tom Mitchell in 1997) [1]. There are several challenges to build a machine learning solution. These are: How good the model is? How one does choose a model? Does one have enough data? Is the data of sufficient quality? Is there any error in the data? Is one describing the data correctly? How confident can one be the result? The above questions have to be solved correctly in order to build a machine learning solution using some of the important machine learning assumptions/rules.

Supervised Learning: In the supervised learning, there are labeled data and the goal here is to come up with a function where input is given to get the output. The process for the supervised learning algorithm is shown in Fig. 1.

Fig. 1
figure 1

Block diagram for process of supervised learning

In the supervised learning, the training algorithm gets the input and produces the output and compares the output with the target, and if there is difference between the target and the output, error occurs which is sent to the agent for updating the weights [2]. Another problem in supervised learning is prediction or regression where the output predicted is not a discrete value and is a continuous value where most of the error occurs and the solution to fit the noise/error in the data is to over fit the data; here, care must be taken to avoid the over fit of the training data and so we can go for linear regression to minimize the sum of square errors where the aim is to minimize the prediction error by taking the square of the errors.

Unsupervised Learning: Here, there is an unlabeled training data set where clustering is done with groups of coherent or cohesive data points in the input space. The analysis is based on how spread out the points is that belong to the single group. In image processing, it can be with respect to image pixels depending on the similar pixels. The association rules are mining or learning frequent patterns or rules which are conditional dependency and these patterns are with respect to sequences, transactions, and graphs [2] (Fig. 2).

Fig. 2
figure 2

Block diagram for process of unsupervised learning [1]

Reinforcement Learning: It is the learning of a machine to control a system through trial and error and minimal feedback (Fig. 3).

Fig. 3
figure 3

Block diagram for process of reinforcement learning [1]

1.2 A Broad View on Different Techniques Used for Brain Tumor Detection

Brain tumor is detected from images acquired from Brain Magnetic Resonance Imaging (MRI). The image thus captured is preprocessed and then the region of interest is extracted; this process is called as segmentation of the image. The region of interest (ROI), i.e., the tumor region, is obtained by segmenting the input image which is a machine learning process.

The approach of the segmentation depends upon classifying the similar category of intensity values in an image and to form group of such similar intensity values (subjects). Here, the approach is based on two standards; i.e., similarity of intensity values is the dissimilarity between them and the task is either to find the similarity measure between the subjects and group them, and these subjects will have most similar index grouped into clusters or the dissimilarity among the objects separate to give the most dissimilar subjects in the space [3]. Figure 4 shows the different methods used for segmentation [4].

Fig. 4
figure 4

Block diagram to represent a broad view on the techniques present for detection of Brain tumor

2 Related Work

In this paper, the segmented tumor is obtained by machine learning algorithm. The flow for the algorithm is as shown in the shaded region of Fig. 4. The Brain MRI images obtained are corrupt with noise so the initial step is to preprocess the image to reduce the unwanted noise and the high-frequency components. For a machine learning algorithm, the data is given as some kind of a probability distribution, where the input is drawn from a P-dimensional real space and the output is also a real space which can be given as

$$ {\text{Input}}\;X = R^{P} $$
(1)
$$ {\text{Output}}\;Y = R $$
(2)

The probability distribution is obtained from a set of samples that are drawn from {(x1, y1) (x2, y2), (x3, y3), …, (xn, yn)}, and these can also be called as training data set.

Learn a function

$$ f(x)R^{P}\,=\,R $$
(3)

Then, the function produces a predicted output which can be given as

$$ \widehat{\text{Y}} = {\text{f(x)}} = \beta_{ 0} + \beta_{1} x_{1} + \beta_{2} x_{2}+\, \cdots +\,\beta_{p} x_{p} $$
(4)
$$ f(x) = \beta_{0} + \sum\limits_{j = 1}^{p} {(\beta_{j} ,x_{j} )} $$
(5)

Set Xo = 1 then

$$ f(x) = \sum\limits_{j = 1}^{p} {(x_{j} ,\beta_{j} )} $$
(6)

Equation (6) is of linear regression.

For the nearest neighbor classifier, we get

$$ \widehat{Y}(x) = \frac{1}{{k_{0} }}\sum\limits_{xi = Nx}^{\infty } {(Y)} $$
(7)

The performance measure is to compare the true output with the predicted output where the loss function is given by

$$ L = (Y,f(x)) $$
(8)

Find f(x) to minimize the loss function where the loss function is the expected predicted error. Minimal error is obtained on conditioning a specific value of x

$$ f(x) = \arg {\mkern 1mu} {\text{min}}{\mkern 1mu} E_{Y} \left| {x([y - c])^{2} } \right|x$$
(9)

The value of C is the error and should be as small as possible, and this is called as the conditioning on a point.

$$ \therefore \;(y - x)^{2} = EPE(x) $$
(10)

One of the multi-clustering algorithms that are used for extracting tumor in MR images is the k-means clustering. This is a region-based segmentation which divides the region into number of clusters. The required number of clusters is represented by ‘K’ for particular application. Here, the technique is to find the nearest distance of every pixel to each cluster. Compared with the traditional algorithms like hierarchical algorithms, one of the advantages of using k-means algorithm is that for huge sample space on smaller value of ‘k’ the algorithm runs much faster. The other limitations that are associated with the algorithm are: 1. For a certain application, it is pretty difficult to guess the ideal value of ‘k’; 2. Bounding of the initial partitioning varies these results in different clusters; 3. One of the main factors is the density of the clusters which affects the performance of the algorithm as the clusters with different density algorithm does not work well enough [5].

Similarly K-Nearest Neighbor (K-NN) is a clustering algorithm, which is proficient in integration the nearby pixels that have similar intensity values by measuring the Euclidean distance among the classes of K-samples of the Brain MRI. The algorithm is shown in Fig. 5.

Fig. 5
figure 5

Block diagram to represent of the K-NN algorithm the flow shows the bottom-up approach

The MRI images are first preprocessed using some of the image enhancement techniques like filtering, mostly the apostrophic filtering or the median filtering techniques are used. The main component as an input to the K-NN algorithm is the feature values of various intensity levels obtained by the histogram of the input. The next step is re-sampling where the proper geometrical representation of the image is obtained [6]. The K-NN is a non-parametric algorithm which is trained with data, as the number of trained samples increases the output is more efficient or accurate. If S is the number of samples and C is the number of classes where S > C, the distance between the two pixels is found out by the Euclidean distance measure which is given as [7]. Let X = (x1, x2, x3, …, xn) and U = (u1, u2, u3, …, un) be the two points then Y, the distance of order is defined as

$$ D(x,u) = \sum\limits_{i = 1}^{n} {\left( {\left| {x_{i} - u_{i} } \right|^{y} } \right)^{{\frac{1}{y}}} } $$
(11)
  • For y = 1: Manhattan Distance

  • y = 2: Euclidean Distance

  • y = ∞: Infinity Distance

Thus, the distance metrics are obtained and the results for some of the input images are as given in Table 2.

3 Experimental Results

Table 1 shows the results of the segmented tumor, and the corrupted input MRI images are preprocessed to initially enhance and eliminate the unwanted noise. High-frequency components are extracted from the enhanced image and the segmented output is obtained by clustering using the k-means algorithm. Manhattan distance metric is used for distance measurement. The segmented output is as shown in the last column of Table 1.

Table 1 Results for different input images using k-means algorithm

One of the supervised learning algorithms is the k-nearest neighbor. Researcher (Sudharani) [7] worked on a set of images to train as well as test the K-NN algorithm. The results obtained are shown in Tables 2 and 3 where Table 2 gives the distance measure of the data of tumors and Table 3 gives classification and identification score.

Table 2 Results for distance metrics
Table 3 Results for different input images using K-NN algorithm. The classification and the identification score

4 Conclusion

This research is piloted to identify Brain tumor using medical imaging techniques. Here, more emphasis is given on the machine learning techniques and the results obtained are better than the results got form morphological operations. We observe that machine learning techniques have the learning capability which adapts itself for new data sets, which helps in minimizing errors by updating weight. Taking into account these powerful techniques of machine learning we are looking forward to find the volume of the tumor which is even more challenging which may help the radiologist for planning the therapy of diagnosis.