Keywords

1 Introduction

The main aim of a development project is to change the current situation into a better one over time. Though industry has been playing an important role in Indian economic scenario, the contribution of agriculture still cannot be denied [1]. Agriculture is one of the biggest shares of economy in India, in terms of generating employment as well as provision of food for the ever-increasing population. The recognition and classification of crop diseases are of major economical and technical importance in agricultural industry. Maize popularly known as “corn” is one of the most versatile and major cash crops having wider adaptability under varied climatic conditions. Maize is one of the major cereal crops and is the third most contributing major crop in India after rice and wheat. Estimated losses due to major diseases of maize in India are about 13% of which fungal diseases cause major yield losses [2]. As per survey, bacterial blight, anthracnose, leaf spot, and rust are major commonly occurring diseases affecting maize crops in northern parts of Karnataka. They affect the photosynthesis with adverse reduction in the yield to an extent of 2891%. These diseases affect the quality as well as quantity of the agricultural products and lead economic losses. The leaves are the first one to be affected in case of any disease, generally before or after blooming which effects the growth plants. Our main objective is to concentrate on maize leaf disease detection based on the texture of the leaf.

2 Related Work

See Table 1.

Table 1 Literature review

3 Methodology

Image processing methods involve versatility, repeatability, and the preservation of original data precision. The various image processing techniques are:

  1. 1.

    Image acquisition

  2. 2.

    Image preprocessing

  3. 3.

    Image segmentation

  4. 4.

    Feature extraction

  5. 5.

    Image classification

  1. (i)

    Image Acquisition

    The images of the maize leaves are captured using a digital camera in JPEG format. The size of each image is 1500 × 1500. The samples are collected from a field located in the University of Agricultural Sciences, Dharwad. Three hundred images of each category: healthy, northern blight, southern blight, and rust, are collected by keeping constant distance of 1 foot. One hundred and fifty images of each category are used for training, and remaining 150 images are used for testing. Thus, we have 600 images for training and 600 images for testing.

  2. (ii)

    Image preprocessing

  3. 1.

    The images are of varying contrast. Preprocessing is necessary in order to correct nonuniform illumination. First, RGB image is converted into L*a*b [1,2,3] color space and the luminance component (L) is extracted. Then, contrast limited adaptive histogram equalization (CLAHE) is applied to L component and concatenated with ‘a’ and ‘b’ components, and finally, the image is converted from L*a*b* color space to RGB space (Fig. 1).

    Fig. 1
    figure 1

    Steps in leaf segmentation

  4. (iii)

    Image segmentation

    Image segmentation is the way toward partitioning a propelled image into various bits (sets of pixels, generally called super-pixels). The goal of division is to contemplate the picture in detail and also change the features of an image into something that is more vital and less complex to analyze. Image segmentation is consistently used to discover objects and edges (lines, curves, etc.) in images. More definitively, image segmentation is the path toward consigning a label to every pixel in an image to such a degree, to the point that pixels with a comparative feature share certain characteristics. The yield of this division is a course of action of portions that cover the entire image. Each pixel in a fragmented zone is similar with respect to some basic element, for instance, color, texture, intensity, and contrast. Neighboring locales are diverse relating to similar characteristics.

  5. (iv)

    Feature Extraction using Haar wavelet and GLCM

    Texture analysis plays an important role in image analysis, such as disease detection, medical imaging, machine vision, and content indexing of image databases. Texture analysis can be done by wavelet transform. The Haar transform has been used as a necessary tool in the wavelet transform for feature extraction. The method is based on the application of Haar wavelet on RGB image of the preprocessed image to obtain horizontal, vertical, and diagonal coefficients. Then, gray-level co-occurrence matrix [1, 3, 6, 7] (GLCM) is constructed into two directions (0° and 90°) for each of the coefficients. GLCM matrix produces four statistical features like contrast, correlation, energy, and homogeneity.

  6. (v)

    Classification using SVM classifiers

    The concept of support vector machine (SVM) was presented by Vapnik and collaborators. It gets predominance as it offers alluring highlights and competent equipment to handle the issue of order. The SVM depends on measurable learning hypothesis. SVM’s better speculation execution depends on the standard of structural risk minimization (SRM). The idea of SRM is to expand the edge of class partition. The SVM was characterized for two-class issue and it searched for ideal hyperplane, which amplified the separation, the edge, between the closest cases of the two classes, named SVM. At exhibit, SVM is mainstream grouping instrument utilized for design acknowledgment and other characterization purposes. The standard SVM classifier takes the arrangement of information and predicts to characterize them in one of the main two unmistakable classes. For multiclass order issue, we decay multiclass issue into various paired class issues, and we plan reasonable joined numerous parallel SVM classifiers.

4 Data set

For the purpose of this project, focus is laid mainly on maize leaves. The samples are gathered from a field situated in University of Agricultural Sciences, Dharwad. A total of 1200 deals were gathered with 600 images utilized for preparing and 600 for testing. In any case, non-dominant part of this maize is exhausted clearly by individuals. A bit of the maize production is used for corn ethanol, animal support, and other maize substances, for instance, corn starch and corn syrup. Some of the normal ailments found in maize leaves are northern blight, southern blight, rust, anthracnose, etc. These sicknesses are essentially found in Southern Indian districts.

5 Software Requirements

  • The system will identify whether the leaf image is healthy or diseased.

  • The system will detect different types of diseases that commonly occur in maize.

  • The system is able to detect the percentage of the disease-affected area.

  • The system is able to suggest the right treatment plan for the identified disease in maize leaf.

6 Nonfunctional Requirements

  1. 1.

    Performance: Fast and accurate detection of disease. (3–5 s).

  2. 2.

    Usability: The system will be useful for farmers and agricultural analysts.

  3. 3.

    Reliability: The system will be able to detect the three commonly occurring diseases in maize leaves and suggest the remedy.

    1. a.

      SVM: 93–95% reliable with training data set of 400 images.

    2. b.

      KNN: 86–90% reliable with training data set of 400 images.

  4. 4.

    Availability: The proposed system is readily available on any platform and provides on-demand service.

  5. 5.

    Simplicity: The proposed system is user friendly.

7 Algorithms Used

  1. 1.

    Histogram equalization

Histogram shows the frequency of pixel intensity values. It is used for analysis of image and thresholding. Histogram equalization is used for contrast adjustment.

  1. 2.

    K-Means Clustering

The K-means clustering is utilized for division and characterization of leaves in view of an arrangement of features, into K number of groups. The classification is done by minimizing clusters.

The algorithm for K-means clustering:

  • Choose the center of K cluster, randomly or based on some logic.

  • Relocate each pixel in the leaf image to the cluster that minimizes the distance between the pixels and the cluster center.

  • Recompute the cluster centers by averaging all of the pixels in the cluster.

  • Repeat steps 2 and 3 until effective clustering is achieved.

  1. 3.

    Haar Wavelet

The Haar wavelet is used for texture analysis. In this method, the preprocessed image is used to obtain the horizontal, vertical, and diagonal coefficients. The obtained coefficients are used for texture analysis.

  1. 4.

    Gray-Level Co-occurrence Matrix (GLCM)

It is a statistical method of examining texture that considers the spatial relationship of pixels. GLCM is used to calculate how often a pixel with gray-level value ‘i’ occurs either horizontally, vertically, or diagonally to an adjacent pixel with gray-level value ‘j’.

Statistics derived from the GLCM include contrast, correlation, energy, homogeneity, etc., of the image.

  1. 5.

    Support Vector Machine (SVM)

Linear support vector machine is used for classification of leaf diseases. SVM is a binary classifier which uses a hyperplane called the decision boundary between two classes. It maximizes the margin around the separating hyperplane.

  1. 1.

    Mean: It measures the mean value of all pixels in the relationships that contributed to the GLCM. It can be measured using the following formula:

    where

    P ij :

    Element i, j of the image.

    N :

    Number of gray levels

    $$ \mu = \mathop \sum \limits_{i,j = 0}^{N - 1} i\left( {P_{ij} } \right) $$
    (1)
  2. 2.

    Variance: The variance of the intensities of all the reference pixels is calculated using the following formula:

    $$ \sigma^{2} = \mathop \sum \limits_{i,j = 0}^{N - 1} P_{ij} \left( {i - \mu } \right)^{2 } $$
    (2)
  3. 3.

    Contrast: Contrast measures the quantity of local change in an image. It reflects the sensitivity of the textures in relation to the changes in the intensity. It returns the measure of intensity contrast between a pixel and its neighborhood. Contrast is 0 for a constant image. It can be measured using the following formula:

    $$ \mathop \sum \limits_{i,j = 0}^{N - 1} P_{ij} \left( {i - j} \right)^{2} $$
    (3)
  4. 4.

    Energy: Energy also means uniformity. It returns the sum of squared elements in the GLCM. The more homogenous the image is, the larger the value. When energy equals to 1, the image is believed to be a constant image. It can be measured using the following formula:

    $$ \mathop \sum \limits_{i,j = 0}^{N - 1} \left( {P_{ij} } \right)^{2} $$
    (4)
  5. 5.

    Correlation: This feature is used to measure how correlated a pixel is to its neighborhood. Correlation is 1 or −1 for a perfectly positive or negative correlated image. It can be measured using the following formula:

    $$ \mathop \sum \limits_{i,j = 0}^{N - 1} P_{ij} \frac{{\left( {i - \mu } \right)\left( {j - \mu } \right)}}{{\sigma^{2} }} $$
    (5)
  6. 6.

    Homogeneity: Homogeneity measures the similarity of pixels. A diagonal gray-level co-occurrence matrix gives homogeneity of 1. It becomes large if local textures only have minimum changes. It can be measured using the following formula:

    $$ \mathop \sum \limits_{i,j = 0}^{N - 1} \frac{{P_{ij} }}{{1 + \left( {i - j} \right)^{2 } }} $$
    (6)

8 Result

Sample 1 (Fig. 2).

Fig. 2
figure 2

a Input leaf and b Segmented leaf

Sample 2 (Fig. 3).

Fig. 3
figure 3

a Input leaf and b Segmented leaf

Sample 3 (Fig. 4).

Fig. 4
figure 4

a Input leaf and b Segmented leaf

Sample 4 (Fig. 5).

Fig. 5
figure 5

a Input leaf and b Segmented leaf

Sample 5 (Fig. 6).

Fig. 6
figure 6

a Input leaf and b Segmented leaf

Sample 6 (Fig. 7).

Fig. 7
figure 7

a Input leaf and b Segmented leaf

Sample 7 (Fig. 8).

Fig. 8
figure 8

a Input leaf and b Segmented leaf

Sample 8 (Fig. 9 and Table 2).

Fig. 9
figure 9

a Input leaf and b Segmented leaf

Table 2 GLCM values of different samples

9 Conclusion

This work deals with crop yield detection based on the healthiness of leaves using image texture features. The texture features are Harlick features in addition to Haar wavelet features. The classification algorithm is a combinational one, SVM and K-means classification. The experiment is carried out on maize leaves database which is collected from Dharwad University. The results are found satisfactory. However, the accuracy may be improved by novel algorithms for texture features.