Abstract
Crop production affects the economy of a specific region or country. To maintain the economic development of any territory crops disease detection is a leading factor in agriculture. There exists various techniques to detect the crops disease, but they offers lilimted accuracy. The proposed mulistage technique utilizes leaf region separation from background prior to the feature extraction for classification of images as healthy and diseased. Initially color transformation-based foreground separation is performed followed by the features extraction. The novelty of the method is attruibuts that are computed though adaptive analytic wavelet transform (AAWT). The AAWT decomposes the preprocessed images into various sub-band images as features. The attributes utilized for categorization are a combination of bag of visual word, Fisher vectors, and AAWT based-features extracted from leaf images. Performance of the proposal is analyzed through PlantVillage dataset. The simulation outcomes validate the superiority of the proferred classification system as compared with the existing techniques of the field. The proposed leaf classification method provides an average accuracy of 94.07% with the area under the characteristic curve 0.961.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction
The agricultural landmass is more than just being feeding fount in current world. The economic growth of developing nations is very much depending on agricultural productivity. The crops’ leaf-based disease identification in the agriculture, at the starting phase, [8] performs an dramatic role to sustain their economic growth. The leaf image analysis of plants provides symptoms of various types of diseases [52]. The image processing applied to explore the various regions in the images of plant leaves. There are different types and levels of image analysis techniques. The initial level of image covers the specifice points and region localization. The point or corner and contour-based image analysis [25] provide lots of useful information to mark the diseases like early and late blight, target spot, etc. Neto et al. [45] worked on leave extraction using clustering and genetic algorithm. It is connected components based fuzzy clustering using genetic optimization.
The second level of image analysis covers the specific features based prediction. The contour detection-based image segmentation to determine the region of interest in a hierarchical manner is presented by Arbelaez et al. [5] through the tree formation. Rumpf et al. [50] presented hyperspectral data analysis to differentiate diseased from healthy sugar beet leaves. Different spectral vegetation indices, containing the physiological information were used as attributes for automatic classification. The differentiation of inoculated plants with particular diseases was analyzed using a support vector machine (SVM) classifier. Leaves recovery in addition to image segmentation and classification is performed by Teng et al. [60] with arbitrary image conditions. Xu et al. presented a color and texture features based leaves extraction that utilized the intensity histogram, derived histogram termed as the differential histogram, the Fourier transform, and the wavelet packets [66]. The third level of image analysis increases the depth of features study and compute the most significant attributes through different feature optimization approaches. The feature selection is performed using a genetic algorithm to obtain the suitable details for the disease diagnosis. Wang et al. [61] presented an adaptive thresholding technique to segment the single leaves from the leaf images.
Soares et al. [58] studied the leaf images segmentation in semi-controlled conditions (LSSCC) [56, 57]. Biswas et al. de-correlated the stretching to enhance the image colors and employed segmentation through the fuzzy C-mean clustering (SFCMC) [10]. Aparajita et al. [4] worked on automated potato disease identification as late blight by employing the statistical features-based adaptive thresholding for segmentation (SFATS). Wu et al. presented a multi-feature fusion recognition model (MFFRM) [64] for the classification of tomato images.
A plant disease detection and classification (PDDC) was performed by Barbedo et al. [6]. Yanikoglu et al. [68] worked on a plant recognition system for automatically identifying the plant in a given image. An image recognition based plant disease detection (IRPDD) was performed in [48]. A clustering based binary classification for the lesion pixels or healthy pixels separation to obtain the segmented image. After segmentation the texture, color, and shape features were extracted from the lesion images. Sabrol et al. [51] presented classification by classifying tree [53] using color, texture and shape features of localized healthy and disease affected tomato leafs’ images. Islam et al. studied a plane disease diagnosis system through machine learning (PDDML) [27]. The PDDML performed L*a*b* color based segmentation and the extraction of GLCM features with statistical traits for the classification of plant diseases. This automated method classifies diseases (or absence thereof) on potato plants. The disease prediction using SVM was analyzed through the segmented images. Patil et al. [46] worked on potato leaf images to develop an automated disease management techniques (ADMT) [18, 54]. The ADMT performed image segmentation using blob analysis and morphological filters.
Grand-Brochier et al. [22] studied the user and input stroke interaction application, along with the utilization of color distance maps. A deep convolutional neural network for plant disease identification was presented by Wang et al. [62]. Kaur et al. [28] presented a k-means clustering-based semi-automatic method (KMCSAM) for plant disease classification. A genetic approach-based attribute selection for apple disease identification and recognition (GAFSADR) was presented by Khan et al. [30]. The GAFSADR [28] system utilized hybrid filtering throuhg the Gaussian, box, and median filter, followed by correlation based segmentation and finally, the color, histogram, and Local Binary Pattern (LBP) features are fused by comparison based parallel fusion. The KMCSAM worked on L*a*b* color based segmentation and color and texture attributes for each cluster are extracted to train the classifier. The features are clustered first before training the classifier for the specific class. Mu et al. [44] presented geometric features and Haar wavelet-based feature extraction method that easily provides the tree leaves features. Kurmi et al. [37, 38] presented a leaf image-based crop diseases classification (LICDC) that extracted Fisher vector (FV) features for the detection of disease classes. All these methods compute the image level features based on the texture, color and shape, those are not capable to differentiate the different types of patterns present in the leaf region. These features can be extracted out through the adaptive analytic wavelet transform (AAWT) and FV that are capable to extract the specific pattern related features.
The existing work faces the impediments of different types for multi-class categorization of the dataset. However, the implementation work still paucity the segmentation outcome accuracy. The disease affected leaves segmentation is crucial to determine the disease type that needs a priori information and covers very few diseases. There is a requirement of the work to be extended and integrated that cover various crops and their diseases. The methodology upgrading, as well as database enlargement and refinement, is needed in order to achieve better accuracy.
The contribution of paper comes in the third level of image analysis, that overcomes the above discussed limitations. Initially, the regions of interest are extracted by color space transformations. The L*a*b color space utilization [12, 31, 63], to map the RGB (red, green, blue) in a high-dimensional color space. From which the intended region can be easily extracted in terms of the saliency map to determine an optimal color coefficients. It provides the leaf region separate from the background. Followed by the leaf county extraction different types of features using adaptive analytic wavelet transform (AAWT), BoW, and Fisher vector for the classification of images. The AAWT decomposes the preprocessed image into different sub-band image. Afore, Relief F and a box counting algorithm is employed to extricate the different entropy and fractal dimension (FD) attribute, individually.
It recommends the feature fusion of spatial domain with the frequency domain to extract the best possible discriminative information of the selected domain. Further the significance of extracted features is investigated using LDA and less significant features are removed from the feature set used for the classification purpose in order to reduce the complexity of classification. The features used for classification are an ensemble of bag of visual word features, Fisher vectors, extracted from the preprocessed image and 13 handcrafted features, extracted from the internal parts of the leaf. The extracted feature sets and their combinations are utilized logistic regression, multilayer perceptron model, and SVM classifiers. The proposal is offering improved classification accuracy of various plant diseases.
The remaining organization of the work goes further as: Section 2 explicates the materials and method. In Section 3, we have discussed the result computed from different models and compared the proposed system. Finally, the Section 4 provides conclusion of the work and gives future research directions.
2 Materials and method
The leaf image-based disease analysis needs a set of images to understand and a technique that is able to predict the disease by utilizing its dataset understanding. Here, we first discuss the dataset that is used to develop the efficient predictive methodology followed by leaf region localization, feature extrication, and classification.
2.1 Dataset
The dataset used in our study is taken from the PlantVillage that is freely available for research. It contains leaf image 14 plants with a good amount of images for each category. Here, we are utilizing only three plants bell pepper, potato, and tomato are members of the nightshade family (Solanaceae) [24]. The diseases effect on the nightshade family is different for different locations where crops grows and also varies with time at which the crop can be grown. To manage the crop production it is very essential to identify and control these diseases. The details of these datasets are given below:
2.1.1 Dataset 1: Bell pepper
The database of plant leaf images for the image-based disease study is accessible as PlantVillage data. It comprises 14 various crops data that contains 54,309 labeled images. The bell pepper dataset has two classes as bacterial spot and healthy. The sample images from the database are depicted in Fig. 1. The image count for each class includes 997 bacterial spot samples and 1478 healthy sample images as shown in Table 1. Latin name of pathogens [15, 41] causing bacterial spot diseases is Xanthomonas vesicatoria.
2.1.2 Dataset 2: Potato
The sample potato leaf images from the dataset are shown in Fig. 2. It comprises three different categories of potato leaf images as healthy, early blight, and late blight. Latin names of pathogens [15, 41] causing plant diseases are mentioned in braces with the plant disease. The image sets of early blight (Alternaria solani) and late blight (Phytophthora infestans) containing 1000 images each on the other hand healthy leaves category has 152 images as given in Table 2.
2.1.3 Dtaset 3: Tomato
A set of images of tomato leaves having various diseases is taken from the PlantVillage dataset. There are ten different categories that have nine disease classes and one healthy class. The sample images are shown in Fig. 3.
The tomato database consists 1404 images of target spot class, 373 images for mosaic virus, 3209 samples of yellow leaf curl virus affected images, 2127 leaf images with bacterial spot. It has 1000 images of early blight category, 1591 healthy images, and 1909 late blight class. A set of 952 images of leaf mold, 1771 of septoria leaf spot, and tomato leaf affected by spider mites are 1676 as shown in Table 3. Latin names of the pathogens [15, 41] that causes the tomato disease are also included in Table 3.
2.2 Proposed method
The images persist some artifacts (noise) in some of the leaves. So, we applied some sort of pre-processing mechanism to encounter this problem. The intensity histogram equalization. In this section, a novel methodology for the automated classification of crop disease stages based on AAWT is developed. The block diagram of the proposed system is demonstrated in Fig. 4. In preprocessing, the input images are resized using bi-cubic interpolation technique into 360 × 480 pixels for the same resolution and to decrease the computational time.
The proposed segmentation methodology explicates a three-stage system. The first step separates the foreground i.e. extrication of leaf region from the background. The next step performs initial leaf identification purporting to the AAWT technique. The result of AAWT decomposition has been utilized for features extraction alongwith BoW, FV for the classification of leaf images.
2.2.1 Leaf foreground extraction
The seed point initialization requires prior information of the considered object in terms of leaves. The plant leaf images easily offer a color based seed point marking for the leaf region segmentation. The color space transformation is utilized for this process. The conversion of an image from RGB color space to L*a*b* color space provides an extension of spectral range. The RGB color space analysis is explained with visual illustration in Fig. 5.
The light effect is reduced by normalization of the RGB color space and denoted as (Rnorm; Gnorm; Bnorm). The perceptual color spaces are CIELab and CIELuv. The color conversion from RGB to the perceptual color space, initially, the color space RGB is converted to the CIE XYZ color space that provides the basis for conversion to perceptual color spaces. The constituent L is similar for CIELab and CIELuv color spaces. The value of L indicates the lightness that is not dependent on the other two constituents. The formulation of the conversion process is given as follows:
where Xn,Yn, and Zn are base tristimulus values explained in the CIE chromaticity representation [21] and
Global thresholding method has been employed for the segregation of the intensity extent of leaves i.e. the image pixels encompassing the leaves from the background pixels. Initially, the primary seed points have been chosen using this color based properties of the processed image. In this work, green channel (G) is extracted from red (R) and blue (B) channels, which contains more significant details, it goes through with contrast limited adaptive histogram equalizations process (CLAHE) for enhancing the pixel contrast. The result of color based leaf foreground region is illustrated in Fig. 6.
The preprocessed images decompose into different subband images using AAWT image decomposition. Furthermore, Relief F and the box-counting algorithm is employed to extract the significant entropy features: Kapoor entropy (KE), Yager entropy (YE), Renyi entropy (RE), and FD features, independently [65]. Then, the extracted features value was ranked using the Fishers’ LDA dimensionally reduction strategy. Finally, the SVM classifier is used for classification.
2.3 Attribute extrication
Feature extrication is used to obtained relevant data from the decomposed images. To classify the crop diseases, texture attributes are applicable to evaluate the coarseness, smoothness, and pixel-regularities.
2.3.1 Adaptive analytic wavelet transform
The AAWT is a developed version of the discrete wavelet transform (DWT) decomposition technique which is a useful tool for medical imaging. The time-frequency covering is the most significant feature of AAWT [7]. Figure 7 depicts a different aspect of time-frequency awning of the wavelet framework. It offers various kind of attractive characteristics such as high recurrence frequency resolution and essential command on the dilation factor measures, the number of decomposition levels, Q-factor (QF) and redundancy (r). Additionally, it is appropriate for 2D signal processing because it comprises Hilbert transform companion of atoms. It can also make a narrow chirplet framwork for discrete signals that is applicable for time-frequency analysis [55]. Iterated filter banks (FBs) are used to obtain the transform, which provides fast processing for 2D signals. AAWT transform with iterated FBs shown in Fig. 8. It contains two high-pass and one low-pass channels. One high-pass channel utilized for negative frequency inspection and another for positive frequency investigation. The transition bands for the filters in Fig. 7, are depicted in Fig. 9.
Where, ω0 represents central frequency and Δ denotes bandwidth, Control parameter QF. The time locator of wavelet function is handled by redundancy [70]. AAWT defines the QF, redundancy and dilation factor using parameters: e, f, g, h, and β, where e and f for up and downsampling of high pass channel, g and h are employed for up and downsampling of low pass channel, respectively, β is a positive coefficient, which is defined with QF and it can be represented as:
In AAWT decomposition, the parameters e, f, g, h, and β are used as a controlling parameter in the wavelet. AAWT decomposition is done using iterative FBs consisting of high-pass and low-pass channels at each level of iteration [23]. Negative and positive frequencies are differentiated by the low-pass and high-pass channels of the FBs, respectively. The frequency response curve proportional to the high-pass filter is represented as:
The low-pass filter frequency response is represented as:
Where dilation factor d, Q-factor Δf/f and shift parameter Δt; \(\omega _{p} = \frac {(1-\beta )\pi +\varepsilon }{e}\); \(\omega _{s} = \frac {\pi }{f}\); \(\omega _{0}=\frac {(1-\beta )\pi +\varepsilon }{g}\); \(\omega _{1}=\frac {e\pi }{fg}\), \(\omega _{2}=\frac {\pi -\varepsilon }{g}\) \(\omega _{3}=\frac {\pi +\varepsilon }{g}\); \(\varepsilon \le \frac {e-f+\beta f}{e+f}\pi \).
The 𝜃(ω) can be given by
The constraints for the selection of QF parameter is represented as:
In this work, AAWT decomposition is applied iteratively using the iterated filter bank on the preprocessed green channel. AAWT decomposed components are suitable for extracting significant detail for the classification of crop disease stages as shown in Fig. 10. The AAWT components are band-limited, pixel variation is captured with the high value from the earlier sub-band image for further decomposition [3]. The iterative approach is effectively used in extracting the finer detail from previous decomposed sub-band images. The AAWT is successfully applied for discriminating normal, crop disease stages. The implementation of AAWT for Analytic wavelet-based features (AWF) extraction can be accessed from http://web.itu.edu.tr/ibayram/AnDWT/
Here, the entropy and FD attributes are extracted [26]. The entropy traits such as KE, RE, YE have been employed to quantify the uncertainty.
Non-Shannon entropies is used because of the higher dynamic range [9]. The px can be represented as \(p_{x} =\frac {y}{r\times c} \). Where px denotes probability of pixel x appeared y times and r × c represents the image size. The KE, RE and YE can be defined as [9]:
Fractal dimension (FD) features: The irregularity measure of the self-similarity with roughness has been provided by FD for the texture evaluation [2]. The crop leaf images having texture, which can be captured using Fractal. If an image surface S, scaled through f, is defined as self-similar surface solely if S signifies the union of none overlying (Sf) of itself [16].
Calculated as
where \(f= \frac {scale~ value}{original~ scale} \) modified sequential box-counting algorithm is very useful to calculate fractal features [1]. In this approach sequential box-counting (SBC) algorithm is used, initially grid size is fix to the power of 2 and the final size is set to (r×c).
2.3.2 Bag of visual words
The BoW extrication is an automated system performing the rotation and scale invariant geometry-based attributes extraction [43]. The shape-based scale-invariant feature transform (SIFT) features are extracted using the BoW system from original images. The SIFT features of patches are utilized to construct the coding table of visual words. The coding table containing a definite number of codewords (or visual words) are contrived with descriptors as shown in Fig. 11.
2.3.3 Fisher vectors
Fisher vectors (FV) can be understand as a high-dimensional illustration of dense vectors [40]. The dense vector depiction in terms of the Gaussian Mixture Model (GMM) fitted to the attributes by encoding the derivatives of the log-likelihood of the model with respect to parameters [34]. The considered GMM system has been trained through diagonal covariance as well as the derivatives with respect to variances and mean, which additionally averages the first and second-order differences between the GMM centers and their dense features.
Here {Wk,μk,σk}k are the GMM internal mesures termed as weights, means values, and diagonal covariances, consecutively. αp(k) indicates the soft weight assignment of the pth attribute xp to the kth Gaussian. An FV Φ can be computed by stockpile the differences: \({\Phi }=[{\Phi }_{1}^{(1)},{\Phi }_{1}^{(2)} ,...,{\Phi }_{K}^{(1)} ,{\Phi }_{K}^{(2)} ]\). The encoding explores the attributes distribution of a given image distance from the dispersal fitted to the feature of all the training images. The diagonal covariance of GMM based FV is amenable to be initially decorrelated by PCA. The PCA is applied to the SIFT features along with dimensionality reduction up to half from 128 to 64. The dimensionality of the Fisher vector is 2Kd, with K Gaussians in GMM and the patch feature vector dimensionality is d. The dimensionality of FV is high for d = 64 and K = 512 it is 65536, which still lower as compared with the dimensionality of stacked dense features (1:5M in this case). By following [40], the FV performance can be improved by applying the L2 normalization.
2.4 Feature selection
To design a strong learning system, the robust attributes selection helps to enhance the classification performance. Before selection feature normalization is an important step to bring all features in a common range for analysis. The attribute normalization boosts the system performance. The skewed details may offer the wrong caution and restrain the performance of the system. Generally, the information is standardized by scaling the values in a range of 0 to 1. In the proposed approach, the attributes are normalized through a z −score normalization function having unit standard deviation with zero mean [39]. Let A is data, and sdt signifies standard deviation subsequently the normalized data \( \tilde {A} \) by employing the z-score normalization function is defined as:
As we know all the features are not contributing equally in classification problem. Similarly, the selection of the ranked features among the available attributes set is important to acquire the better classification performance. The significance of the selected features is considered [42]. The features with a higher rank are more valuable for the classification of crop disease stages than lower-ranked features. LDA is an appropriate approach in machine learning employed for classification and provides the highest possible discrimination to classify the stages of crop disease using Fisher’s discriminant index (FDI) [20]. LDA offers maximum category adaptability by extending the fraction amidst different classes [1]. Maximum discriminations of classes can be obtained with Fisher’s LDA. In this work, LDA reduced the dimensionality of twenty-four selected features to thirteen robust features without affecting the variance. Results of dimensionality reduced ranked features are described in Table 4. The LDA using Fisher’s discrimination index provides thirteen high ranked robust features among the entire features set [67].
2.5 Classification
Basically, the classification of particulars follows the features extraction and classifying process. Applying a similar technique we extracted the handcrafted feature and machine learning-based features. Now, we discuss the applied classification models. With all the classifier models 10-fold cross-validation has been utilized in the experiments.
2.5.1 Logistic regression
The logistic regression (LR) [69] used to predict the disease based on the available features set. It is a type of probabilistic, statistical classification model to measures the relationship of a categorical dependent variable to one or more independent variables through the probability scores of the dependent variable. The LR generated the coefficients of features that show significance level of features and formulated to predict a logit transform of the probability of the features. The LR model formulated as:
where PLLF represents the probability of the presence of leaf localized features (LLF) and the logit transformation in terms of logged odds is given as:
and
The concept used in LR is the parameters optimization by maximizing the likelihood of observing samples instead of error minimization as in ordinary linear regression. The implementation through the L2-regularization has been utilized the LIBLINEAR [17].
2.5.2 Multilayer perceptron model
A multilayer perceptron model (MLP) [49] employed for the multi-class classification with nonlinearity is given as:
where x represents an input vector that is treated as the “zeroth layer output”. The notation \( \hat {o}^{l-1} \) is used to show an operation where a number 1 shows a prepend vector, in this way, the bias terms of considered layer l to represent the first column of the matrix Wl. The activation function (sigmoid) is represented by notation Fl applied to all vector components. The softmax function is used as the output layer activation function of the four-layered MLP model. It has one input layer, two hidden layers, and the output layer. The two hidden layers utilized in the MLP model have a different number of neurons as 7 and 8 respectively. The fully connected feed-forward MLP model is implemented with sigmoid activation. The learning rate utilized is η = 0.15 during backpropagation to train the model by considering the input features as the training parameters while the image labels as the target variable. The stochastic gradient descent algorithm based gradient is computed as weight error and the parameters are adjusted as to move the MLP one step closer to minimize the error. The 10-fold cross-validation is used in the experiments.
2.5.3 Support vector machine
It is used for the binary and multi-class classification of the dataset. The binary classification through SVM provides high flexibility for separation of classes [13, 35]. An adjustable linear decision surface called hyperplane is used that is capable to take the decision and splits the space into the number of classes. The concept of nonlinear hyperplane is based on the soft marginal SVM. Kernel-based SVM is used to discriminate the different diseases and their stages. A kernel-based multiclass SVM is used with multiple features. The kernel function helps in mapping the data into a next feature space for straightly divisible of features [14].
subject to \( y_{i} (\bar {w}.\bar {x}+b) \ge 1-\zeta _{i} \) for i = 1,... ,N.
The value of C is used to provide an adjustable margin in SVM. The feature mapping based nonlinearity was also utilized by transforming the Hilbert space as it provides the facility of transformation from one spatial domain to another. The Gaussian kernel application offers the nonlinear projection that creates additional separation between the data points in mapped high-dimensional space. The formulation of Gaussian kernel is given as
A k-fold cross validation based strategy is used in SVM model training and testing, where k = 10 with C = 1 and γ= 1 using the hyperparameter tuning [32]. The mean of the optimal hyperparameters is determined to find a robust estimate. The parameters C, γ, and kernel are optimized for SVM and the activation, solver, and learning rate are optimized for MLP classifier.
3 Result and discussion
Performance of the proferred leaf image segmentation technique is done using the PlantVillage dataset.
3.1 Experimental setup
The proposed segmentation approach was implemented on Python using the Keras API [29], Scikit-learn [47] library, and Tensorflow back-end [59]. NVDIA Quadro K5200 graphic card was used for the image data processing on a computer (Intel®;Xeon®; Processor E5-2650 v3 @2.30 GHz, with 8 GB RAM). The proposed technique classifies the dataset into multiple classes as mentioned in dataset description Section 2.1. The performance comparison shown here includes the average performance for all the categories and one versus all concept is utilized to compare the multi-class classification. The 70% and 30% images are used for the training and validation of the results. Inside the training dataset the 10 fold cross validation is performed for the training and testing experimentation.
3.2 Performance parameters
The performance measures of the proferred system are defined in terms of the classification accuracy (Ac) [11, 33, 36] and area under the curve (AUC) through the receiver operating characteristic (ROC) curve [19]. For better performance the AUC should be with high numeric value.
3.2.1 Accuracy
The accuracy measure is given as follows:
3.2.2 Area under the curve (AUC)
The classification performance is evaluated continuously through the ROC curve. It basically provides a trade-offs between sensitivity (true positive outcome rate) and false-positive outcome rate (i. e. 1-specificity).
3.3 Performance evaluation
The classification performance is evaluated for the bell pepper, potato, and tomato datasets using three classifiers LR, MLP, and SVM. The classification is performed by extracting a different set of features along with their combinations. The features in terms of BoW, FV, and AWF, along with their combinations as BoW+AWF, FV+AWF, and combination of all three BoW+FV+AWF as proposed features set. The performance is evaluated in terms of accuracy (%) and AUC using three classifiers as LR, SVM, and MLP. The classification results in terms of Acc. and AUC using LR, MLP, and SVM for bell pepper, potaton and tomato datasets are given in Table 5.
The first classification experiment is performed by taking an original image set of the bell pepper dataset, as mentioned in Table 1. The SVM classifier gives performance using BoW+AWF and FV+AWF with values 88.33% and 89.23% accuracy with AUC > 0.88 as shown in Table 5. The MLP classifier with FV+AWF gives an average accuracy of 88.36% and AUC = 0.851. The tabular analysis shows that the two-class classification accuracy using three classifiers LR, SVM, and MLP the combinations of the features as BoW+ AWF, FV+AWF, and BoW+FV+AWF perform better as compared with individual types of features. A different set up for classification as Experiment 2 using the potato dataset as mentioned in Table 2. The LR gives 86.57% classification accuracy using FV+AWF features with 0.842 AUC as shown in Table 5. The MLP classifiers give performance using BoW+AWF and FV+AWF with values 88.69% and 88.27% accuracy with AUC > 0.86. The SVM classifiers give performance using BoW+AWF and FV+AWF with values 88.84% and 89.21% accuracy with AUC > 0.87. The combination of BoW+FV+AWF offers 94.13% accuracy with 0.966 AUC.
Experiment 3 performed on the tomato dataset using LR gives 86.99% classification accuracy with FV+AWF features and 0.864 AUC as shown in Table 5. The MLP classifier gives performance using BoW+AWF and FV+AWF with values 83.732% and 88.32% accuracy with AUC > 0.86. The SVM classifiers give performance using BoW+AWF and FV+AWF with values 86.85% and 87.28% accuracy with AUC > 0.86. The proposed combination of BoW+FV+AWF offers 91.89% accuracy with 0.939 AUC.
The average AUC and Ac. comparison of the proferred method with the state of the art methods is shown in Fig. 12.
The GAFSADR [30] shows the AUC performance with 0.914 and the LICDC [38] method shows AUC 0.950. The proposed method shows 0.961 AUC that is 5% better than the GAFSADR [30] method.
The detailed comparison of the proposed method with the state of the art methods is given in Table 6. The proposed techniques classifies the different leave images in the multiple classes of healthy and unhealthy categories. Referred methods are used in the manuscript for the performance analysis purpose to rate the classification accuracy with the simulation codes of these methods has been run to solve the considered dataset classification problem. The results of proposed method have been compared with various state of the art methods for the validation. The classification accuracy for PDDC [6] method is 0.860. The IRPDD [48] method utilized the texture, color, and shape features that missed the internal patterns as well as the frequency based features in the leaf region that limits the classification accuracy that is 0.888. The PDDML [27] method only the GLCM and statistical features of segmented region which may be efficient for a particular type of dataset with accuracy of 0.868. The KMCSAM [28] performs the clustering of features before trainig the classifier which is not very much supportive to improve the results and offers accuracy of 0.864. The GAFSADR [30] method gave an accuracy of 0.874, 0.818, and 0.815, for pepper, potato, and tomato leaf image datasets, respectively. A set of different features using FV and AWF along with BoW has been introduced in the proposed model that improves the classification performance. The average accuracy of the proposed method (BoW+FV+AWF using SVM classifier) is 0.941%. The average AUC has reported 0.961 which is 8% and 12% better as compared with to and 0.836 and 0.864 provided by GAFSADR [30] and KMCSAM [28], respectively as shown in Table 6.
3.4 Complexity analysis
The computational complexity of PDDC [6], IRPDD [48], PDDML [27], and LICDC [38] is O(N3). GAFSADR [30] offers is O(N2 × n), for n images of training set. The prediction system during testing possesses O(N2) computations. The computing complexity offered by the ADMT [54] method is O(N2) and nuclei refinement is \( O(N^{2}L_{B} +N_{P} {L_{B}^{2}}) \) and O(N2LB), respectively. A computational complexity of the KMCSAM [28] and the proposed method is O(N3). The classification performance of the proposed method has improved to a good extent without increasing the computational complexity.
4 Conclusion
Detection of diseases in crops is based on the visual observation of structural deformations in leaf images. The paper recommends the feature fusion of spatial domain with the frequency domain to extract the best possible discriminative information of the selected domain. Further the significance of the extracted features is investigated using LDA and less significant traits are removed from the feature set used for the classification purpose in order to reduce the complexity of classification. In this method, AAWT decomposes the preprocessed images into various sub-band images. Afore, Relief F and a box-counting algorithm has been employed to extricate the different entropy and fractal dimension attributes, respectively. The extracted features value were optimized using LDA strategy. The attributes utilized to classify images are a combination of 120 BoW, 120 FVs, extracted from the preprocessed image and 13 AAWT features of leaf region. The performance of the proposal is analyzed through PlantVillage datasets of bell pepper, potato, and tomato images using LR, MLP, and SVM classifiers. The proposal is offering improved classification accuracy of various plant diseases. The simulation outcomes validate the performance superiority of the proferred classification approach than the existing methods in the field. The presented classification technique offered an average accuracy of 94% on the considered PlantVillage database. The future work can be the deep learning-based feature extraction and classification.
References
Acharya UR, Bhat S, Koh JE, Bhandary SV, Adeli H (2017) A novel algorithm to detect glaucoma risk using texton and local configuration pattern features extracted from fundus images. Comput Biol Med 88:72–83
Acharya UR, Dua S, Du X, Sree VS, Chua CK (2011) Automated diagnosis of glaucoma using texture and higher order spectra features. IEEE Trans Inf Technol Biomed 15(3):449–455
Acharya UR, Mookiah MRK, Koh JE, Tan JH, Noronha K, Bhandary SV, Rao AK, Hagiwara Y, Chua CK, Laude A (2016) Novel risk index for the identification of age-related macular degeneration using radon transform and dwt features. Comput Biol Med 73:131–140
Aparajita R, Sharma A, Singh M, Dutta K, Riha K, Kriz P (2017) Image processing based automated identification of late blight disease from leaf images of potato crops. In: 2017 40th international conference on telecommunications and signal processing (TSP), pp 758–762
Arbelaez P, Maire M, Fowlkes C, Malik J (2011) Contour detection and hierarchical image segmentation. IEEE Trans Patt Anal Mach Intell 33 (5):898–916
Barbedo J (2013) Digital image processing techniques for detecting, quantifying and classifying plant diseases. Springer Plus 2:660, 12
Bayram I (2013) An analytic wavelet transform with a flexible time-frequency covering. IEEE Trans Signal Process 61(5):1131–1142
Beucher S, Meyer F (1993) The morphological approach to segmentation. The Watershed Transformation 34:433–481
Biswas MK, Ghose T, Guha S, Biswas PK (1998) Fractal dimension estimation for texture images: a parallel approach. Pattern Recogn Lett 19(3):309–313
Biswas S, Jagyasi B, Singh BP, Lal M (2014) Severity identification of potato late blight disease from crop images captured under uncontrolled environment. In: 2014 IEEE Canada international humanitarian technology conference - (IHTC), pp 1–5
Chaurasia V, Chaurasia V (2016) Statistical feature extraction based technique for fast fractal image compression. J Vis Commun Image Represent 41:87–95
Chen Y, Ma Y, Kim DH, Park S (2015) Region-based object recognition by color segmentation using a simplified pcnn. IEEE Trans Neur Netw Learn Syst 26(8):1682–1697
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297
Cuingnet R, Glaunès J. A., Chupin M, Benali H, Colliot O (2013) Spatial and anatomical regularization of svm: a general framework for neuroimaging data. IEEE Trans Pattern Anal Mach Intell 35(3):682–696
Dean R, Van Kan JAL, Pretorius ZA, Hammond-Kosack KE, Di Pietro A, Spanu PD, Rudd JJ, Dickman M, Kahmann R, Ellis J, Foster GD (2012) The top 10 fungal pathogens in molecular plant pathology. Mol Plant Pathol 13(4):414–430
Dua S, Acharya UR, Chowriappa P, Sree SV (2012) Wavelet-based energy features for glaucomatous image classification. IEEE Trans Inf Technol Biomed 16(1):80–87
Fan R-E, Chang K-W, Hsieh C-J, Wang X-R, Lin C-J (2008) LIBLINEAR: a library for large linear classification. J Mach Learn Res 9:1871–1874
Fang Y, Ramasamy R (2015) Current and prospective methods for plant disease detection. Biosensors 5:537–61, 08
Fawcett T (2006) An introduction to ROC analysis. Pattern Recogn Lett 27(8):861–874. ROC Analysis in Pattern Recognition
Fisher RA (1936) The use of multiple measurements in taxonomic problems. Annals of Eugenics 7(2):179–188. [Online]. Available: https://doi.org/10.1111/j.1469-1809.1936.tb02137.x
Gonzalez REW, Rafael C, Eddins SL (2004) Digital Image Processing Using MATLAB, ser. International series of monographs on physics. Upper Saddle River, Prentice Hall
Grand-Brochier M, Vacavant A, Cerutti G, Kurtz C, Weber J, Tougne L (2015) Tree leaves extraction in natural images: comparative study of preprocessing tools and segmentation methods. IEEE Trans Image Process 24(5):1549–1560
Gupta V, Chopda MD, Pachori RB (2019) Cross-subject emotion recognition using flexible analytic wavelet transform from eeg signals. IEEE Sensors J 19(6):2266–2274
Hančinský R, Mihálik D, Mrkvová M, Candresse T, Glasa M (2020) Plant viruses infecting solanaceae family members in the cultivated and wild environments: a review, Plants. Basel Switzerland 9(5):1–17
Hojjatoleslami SA, Kittler J (1998) Region growing: a new approach. IEEE Trans Image Process 7(7):1079–1084
Iftekharuddin K, Jia W, Marsh R (2003) Fractal analysis of tumor in brain mr images. Mach Vis Appl 13:352–362, 03
Islam M, Dinh Anh, Wahid K, Bhowmik P (2017) Detection of potato diseases using image segmentation and multiclass support vector machine. In: 2017 IEEE 30th canadian conference on electrical and computer engineering (CCECE), pp 1–4
Kaur S, Pandey S, Goel S (2018) Semi-automatic leaf disease detection and classification system for soybean culture. IET Image Process 12 (6):1038–1048
Keras (2018) Keras Documentation, https://keras.io. Accessed 2 Feb 2018
Khan MA, Lali MIU, Sharif M, Javed K, Aurangzeb K, Haider SI, Altamrah AS, Akram T (2019) An optimized method for segmentation and classification of apple diseases based on strong correlation and genetic algorithm based feature selection. IEEE Access 7:46261–46277
Kim J, Han D, Tai Y, Kim J (2016) Salient region detection via high-dimensional color transform and local spatial support. IEEE Trans Image Process 25(1):9–23
Klein A, Falkner S, Bartels S, Hennig P, Hutter F (2017) Fast Bayesian optimization of machine learning hyperparameters on large datasets. In: Proceedings of the 20th international conference on artificial intelligence and statistics (AISTATS 2017), ser. Proceedings of Machine Learning Research, vol 54. PMLR, pp 528–536
Kurmi Y, Chaurasia V (2018) Multifeature-based medical image segmentation. IET Image Process 12(8):1491–1498
Kurmi Y, Chaurasia V (2020) Classification of magnetic resonance images for brain tumor detection. IET Image Process 1–13
Kurmi Y, Chaurasia V, Ganesh N (2019) Tumor malignancy detection using histopathology imaging. Journal of Medical Imaging and Radiation Sciences
Kurmi Y, Chaurasia V, Ganesh N, Kesharwani A (2020) Microscopic images classification for cancer diagnosis. Signal Image and Video Processing 14 (4):665–673
Kurmi Y, Gangwar S, Agrawal D, Kumar S, Saxena D, Saxena M, Shrivastava H (2020) An algorithm for various crop diseases detection and classification using leaves images. In: 2nd international conference on data engineering and applications (IDEA), pp 1–5
Kurmi Y, Gangwar S, Agrawal D, Kumar S, Srivastava HS (2020) Leaf image analysis-based crop diseases classification. Signal Image and Video Processing 1–9
Li M, Yuan B (2005) 2D-lda: a statistical linear discriminant analysis for image matrix. Pattern Recogn Lett 26(5):527–532
Liu L, Wang P, Shen C, Wang L, Hengel AVD, Wang C, Shen HT (2017) Compositional model based fisher vector coding for image classification. IEEE Trans Pattern Anal Mach Intell 39(12):2335–2348
Mansfield J, Genin S, Magori S, Citovsky V, Sriariyanum M, Ronald P, Dow M, Verdier V, Beer SV, Machado MA, Toth I, Salmond G, Foster GD (2012) Top 10 plant pathogenic bacteria in molecular plant pathology. Mol Plant Pathol 13(6):614–629
Martis RJ, Acharya UR, Min LC (2013) Ecg beat classification using pca, lda, ica and discrete wavelet transform. Biomed Signal Process Cont 8 (5):437–448
Mercan E, Aksoy S, Shapiro LG, Weaver DL, Brunye T, Elmore JG (2014) Localization of diagnostically relevant regions of interest in whole slide images. In: 2014 22nd international conference on pattern recognition, pp 1179–1184
Mu H, Ni H, Zhang M, Yang Y, Qi D (2019) Tree leaf feature extraction and recognition based on geometric features and haar wavelet theory. Engineering in Agriculture, Environment and Food
Neto JC, Meyer GE, Jones DD (2006) Individual leaf extractions from young canopy images using gustafson–kessel clustering and a genetic algorithm. Comput Electron Agric 51(1):66–85
Patil P, Yaligar N, M. S. M (2017) Comparision of performance of classifiers - svm, rf and ann in potato blight disease detection using leaf images. In: 2017 IEEE international conference on computational intelligence and computing research (ICCIC), pp 1–5
Pedregosa F, Varoquaux G, Gramfort A. e. a. (2011) Scikit-learn: machine learning in python. J Mach Learn Res 384(12):2825–2830
Qin F, Liu D, Sun B, Ruan L, Ma Z, Wang H (2016) Identification of alfalfa leaf diseases using image recognition technology. PLOS ONE 11 (12):1–26, 12
Raji CG, Vinod Chandra SS (2017) Long-term forecasting the survival in liver transplantation using multilayer perceptron networks. IEEE Trans Syst Man Cybern Syst 47(8):2318–2329
Rumpf T, Mahlein A-K, Steiner U, Oerke E-C, Dehne H-W, Plümer L (2010) Early detection and classification of plant diseases with support vector machines based on hyperspectral reflectance. Comput Electron Agric 74(1):91–99
Sabrol H, Satish K (2016) Tomato plant disease classification in digital images using classification tree. In: 2016 international conference on communication and signal processing (ICCSP), pp 1242–1246
Sankaran S, Mishra A, Ehsani R, Davis C (2010) A review of advanced techniques for detecting plant diseases. Comput Electron Agric 72(1):1–13
Scharr H, Minervini M, French AP, Klukas C, Kramer DM, Liu X, Luengo I, Pape J-M, Polder G, Vukadinovic D, Yin X, Tsaftaris SA (2016) Leaf segmentation in plant phenotyping: a collation study. Mach Vis Appl 27(4):585–606
Schor N, Bechar A, Ignat T, Dombrovsky A, Elad Y, Berman S (2016) Robotic disease detection in greenhouses: Combined detection of powdery mildew and tomato spotted wilt virus. IEEE Robot Autom Lett 1(1):354–360
Sharma M, Pachori RB, Acharya UR (2017) A new approach to characterize epileptic seizures using analytic time-frequency flexible wavelet transform and fractal dimension. Pattern Recogn Lett 94:172–179
Silva L, Koga M, Cugnasca C, Costa A (2013) Comparative assessment of feature selection and classification techniques for visual inspection of pot plant seedlings. Comput Electron Agric 97:47–55
Singh V, Misra A (2017) Detection of plant leaf diseases using image segmentation and soft computing techniques. Inform Process Agricult 4(1):41–49
Soares JAV, Jacobs DW (2013) Efficient segmentation of leaves in semi-controlled conditions. Mach Vision Appl 24(8):1623–1643. [Online]. Available: https://doi.org/10.1007/s00138-013-0530-0
Team GB (2018) TensorFlow, https://www.tensorflow.org/. Accessed 2 Feb 2018
Teng C-H, Kuo Y-T, Chen Y-S (2011) Leaf segmentation, classification, and three-dimensional recovery from a few images with close viewpoints. Opt Eng 50(3):1–13
Wang J, He J, Han Y, Ouyang C, Li D (2013) An adaptive thresholding algorithm of field leaf image. Comput Electron Agric 96:23–39
Wang G, Sun Y, Wang J (2017) Automatic image-based plant disease severity estimation using deep learning. Computational Intelligence and Neuroscience 1–8, 07 2017
Wang S, Yan W, Li X, Zhao G, Zhou C, Fu X, Yang M, Tao J (2015) Micro-expression recognition using color spaces. IEEE Trans Image Process 24(12):6034–6047
Wu J, Zhang B, Zhou J, Xiong Y, Gu B, Yang X (2019) Automatic recognition of ripening tomatoes by combining multi-feature fusion with a bi-layer classification strategy for harvesting robots in. Sensors
Xu D, Erdogmuns D (2010) Renyi’s entropy, divergence and their nonparametric estimators. Springer, New York, pp 47–102
Xu G, Zhang F, Shah SG, Ye Y, Mao H (2011) Use of leaf color images to identify nitrogen and potassium deficient tomatoes. Pattern Recogn Lett 32(11):1584–1590
Yan S, Xu D, Zhang B, Zhang H, Yang Q, Lin S (2007) Graph embedding and extensions: a general framework for dimensionality reduction. IEEE Trans Pattern Anal Mach Intell 29(1):40–51
Yanikoglu B, Aptoula E, Tirkaz C (2014) Automatic plant identification from photographs. Mach Vision Appl 25(6):1369–1383
Zhang X, Hu B, Ma X, Xu L (2015) Resting-state whole-brain functional connectivity networks for MCI classification using l2-regularized logistic regression. IEEE Transactions on NanoBioscience 14(2):237–247
Zhang C, Li B, Chen B, Cao H, Zi Y, He Z (2015) Weak fault signature extraction of rotating machinery using flexible analytic wavelet transform. Mech Syst Signal Process 64-65:162–187
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Kurmi, Y., Gangwar, S., Chaurasia, V. et al. Leaf images classification for the crops diseases detection. Multimed Tools Appl 81, 8155–8178 (2022). https://doi.org/10.1007/s11042-022-11910-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-11910-7