Keywords

1 Introduction

The agricultural sector holds significant importance in the economy, serving as the primary means of sustenance for a significant portion of the population and playing a vital role in livelihoods worldwide. Here’s a brief overview of agriculture in India and the world.

Agriculture in the World: Over 26% of the world's workforce is employed in the agriculture sector, which is an important one. The agricultural sector only made up 4.00% of the global GDP, while in low-income nations it accounts for an average of 30.00% of the GDP. International trade in agriculture contributes to meeting the various demands of nations and the availability of food [1].

Agriculture in India: In India, agriculture employs about 50% of the population and generates close to 17.5% of the GDP. Figure 1 shows the contribution of different sectors to the GDP of India over the years. It plays a crucial role in the country's development, alleviation of poverty, and food security. India is a major producer of several goods, including rice, wheat, pulses, oil seeds, fruits, and vegetables [2].

Fig. 1
A grouped bar graph of contribution to G D P versus years from 1950 to 2016. Agriculture contributes 55% from 1950 to 1951. Manufacturing contributes 15 to 30% from 1950 to 2016. Services contribute a maximum of 58% from 2010 to 2011. Values are estimated.

Sector-wise contribution to GDP of India [2]

We can see agriculture holds great significance on a global scale and any disruption in agriculture can have significant impacts on various aspects, including food security, economy, environment, and social well-being. Agriculture faces various challenges, including fragmented landholdings, dependence on monsoons, crop damage due to plant diseases, water scarcity, inadequate infrastructure, post-harvest losses, farmer indebtedness, and market volatility. One such biggest challenge faced in agriculture is plant diseases. Plant diseases can lead to significant economic losses by reducing crop yields, quality, and market value. They can affect both food crops and cash crops, impacting farmers’ livelihoods and global food supplies.

The five main crops grown across the world include wheat, rice, maize, soybeans, and potatoes—contributing roughly 18.3, 18.9, 5.4, 3.3, and 2.2 percent of the calories consumed worldwide, respectively. Each of these crops has an estimated 21.5, 30.0, 22.6, 21.4%, and 17.2% loss worldwide as a result of illnesses and pests infecting these plants [3]. Figure 2 shows the percentage of crop loss caused by different plant pathogens.

Fig. 2
A donut chart provides the distribution of crop loss by plant pathogens. The percentage values are fungi, 42, bacteria, 27, viruses, 18, and nematodes, 13.

Crop loss by plants pathogens [4]

At the current time deep learning [5] has spread across different domains and areas with advancements in hardware and technologies and the same can be employed in the field of agriculture. In this paper, we try to review different plant disease detection techniques based on deep learning. This study's key contribution can be summed up as follows:

  • To provide a comprehensive review of the current research work by utilizing deep learning for plant disease detection.

  • Study and discuss the pros and cons of different models and architectures proposed for plant disease detection.

  • Identify the challenges and limitations for future research direction.

The paper is organized with Sect. 2 giving background about plant diseases and deep learning and its components. Section 3 provides information on different plant disease datasets available and the latest research work done utilizing deep learning for plant disease detection. Section 4 describes the various limitations and research gaps that need to be addressed in the future. Section 5 gives the conclusion of this study work.

2 Background

2.1 Plant Diseases

Plant diseases refer to the abnormal conditions or disorders that affect plants, leading to a decline in their health, growth, and productivity. These diseases can be caused by various factors, including pathogens, parasitic plants, and abiotic factors (such as nutrient deficiencies, extreme temperatures, and pollution). Here, we discuss different types of plant diseases [6]:

  • Fungal Diseases: Fungi can impact various parts of plants including leaves, stems, fruits, and roots. Some common types of plant fungal diseases include powdery mildew, rust, downy mildew, smuts, leaf spots, and root rots.

  • Bacterial Diseases: Bacteria can infect plants and cause various diseases like bacterial blight, bacterial canker, crown gall, and bacterial wilt. Wilting, leaf spots, cankers, or galls are some examples of the symptoms of bacterial infections.

  • Viral Diseases: Viruses, such as mosaic viruses, leaf curl viruses, and necrotic ring spot viruses, are contagious agents that can harm plants. Symptoms of viral infections frequently include stunted or deformed development, yellowing, mosaic patterns, and leaf mottling.

  • Nematode Diseases: Microscopic worms that can parasitize plant roots are the cause of nematode infections, which include cyst nematodes and root-knot nematodes. Plants that are impacted could exhibit signs of growth stunting, root galling, or nutritional shortages.

  • Parasitic Plant Diseases: Diseases caused by parasitic plants include dodder and witchweed, which physically attach to their host plants and siphon off nutrients and water.

  • Abiotic Diseases: Various abiotic factors like extreme temperatures, nutrient imbalances or deficiencies, water stress, chemical toxicity, or air pollution can cause plant illnesses. These illnesses can cause tissue death (necrosis), chlorosis (yellowing), or an overall decline in plant health.

It's important to note that the specific types and names of plant diseases can vary across different plant species, regions, and environmental conditions. Figure 3 shows sample images of some of the plant diseases. Proper diagnosis and identification of plant diseases are crucial for implementing effective disease management strategies.

Fig. 3
A collage of 8 photographs of different leaves with different diseases. The labels under the photos are tomato late blight, tomato leaf mold, tomato curl virus, tomato mosaic virus, tomato bacterial spot, bell pepper bacterial spot, potato early blight, and potato late blight.

Sample images of plant diseases

2.2 Deep Learning

Deep learning is a subfield of machine learning that focuses on building artificial neural networks that can learn from complicated data and make predictions based on that data. These networks are modeled after the structure and operation of the neural network in the human brain. The key components of a deep learning architecture are as follows:

  • Input Layer: It is the first layer in the model that receives the input data or features that are fed into the deep learning model. A feature or property of the input data is represented by each neuron in the input layer. The input layer is connected to hidden layers.

  • Hidden Layers: Deep learning models have one or more hidden layers associated with them. The hidden layer is composed of multiple neurons that are connected to one another. The weighted sum of the inputs received from the last layer is performed for each neuron in the hidden layer, which is followed by applying the activation function to finally produce an output. The hidden layer helps the network learn complex information and extract hierarchical features from the input data.

  • Activation Functions: Activation functions are used to add nonlinearity to the network. Some of the commonly used activation functions are sigmoid, hyperbolic tanh, Rectified Linear Unit, etc. This nonlinearity helps the network learn the complex patterns and relationships from the data.

  • Weight Parameters: The connections between neurons in different layers are associated with weight parameters. These weights are adjusted to minimize the loss function during the training of the model, using techniques like gradient descent and backpropagation.

  • Output Layer: It is the final layer of the model which produces the final output or predictions based upon learned knowledge from the previous layers. The activation function and neurons in the final layer depend upon the nature of the task being performed.

  • Loss Function: The difference between the actual output and the expected output is measured by the loss function. During model training, the weights are adjusted using the gradients of the loss function to enhance the model's ability to predict outcomes.

  • Optimization Algorithm: Optimization algorithms are used to adjust the weights and bias associated with the neurons using the gradients of the loss function. Some commonly used optimization algorithms are stochastic gradient descent, Adam, AdaGrad, etc. These algorithms decide how much weight should be updated and in what direction to iteratively boost the model's performance.

Deep learning architectures can vary depending on the specific task or application. Well-known architectures encompass convolutional neural networks (CNNs) [7] that excel in image and video processing, recurrent neural networks (RNNs) [7] which are ideal for handling sequential data, and transformer models that are widely used in natural language processing tasks. These architectures incorporate specialized layers and components to handle the unique characteristics and challenges of different data types and tasks. Figure 4 shows a general deep learning framework for plant disease detection.

Fig. 4
A workflow. Sample images enter training, validation, and test sets. The flow from the training set includes training, evaluation, and prediction. From validation, it is fine-tuned and trained. From the test, it directly enters the prediction stage which classifies healthy and diseases 1 to N.

General deep learning framework for plant disease detection

3 Related Work

3.1 Literature Survey

In recent years, extensive work has been done using deep learning toward plant disease detection. This literature review seeks to give an overview of the major advancements and developments in deep learning-based plant disease detection. The literature survey highlights the rapid advancements and significant contributions of deep learning techniques in plant disease detection. Deep learning-based models tend to show better accuracy and performance in comparison with other methods provided quality data is available to train and evaluate the model. Some of the recent works are discussed below:

Gehlot et al. [8] proposed an architecture named EffiNet-TS, which consists of two classifiers and one decoder. The proposed model is based on state-of-the-art Teacher/Student architecture built around EfficientNetV2. The suggested model highlights the important feature for classifying plant disease, which improves classification and offers a clearer visual representation of specific plant disease symptoms. Many authors have used state-of-the-art object detection algorithms with some internal modifications to improve the performance. Li et al. [9] proposed an integrated model that combines single-stage and two-stage target detection networks. The single-stage network is built upon the YOLO with internal structure optimization. The two-stage network, on the other hand, is based on the Faster R-CNN (Region Convolutional Neural Network). Initially, the target frames are clustered using clustering techniques followed by the integration of two models to perform the disease detection task. Another author Mahum et al. [10] proposed a model based on DenseNet-201 for classifying potato leaves into five different categories.

Some of the authors have proposed a novel deep learning model for plant disease detection like Yu et al. [11] and Ramamoorthy et al. [12] both the authors proposed a novel deep learning-based model with the former proposing a model based on inception convolution and vision transformer and the latter based on MobileNet V1 architecture. Wang et al. [13] proposed a lightweight model which is based on state-of-the-start YOLOV5 architecture for plant disease detection which has better accuracy and performance in comparison with other state-of-the-art techniques. Another author Elaraby et al. [14] proposed a model based on AlexNet for classifying the diseases in plants and use of Particle Swarm Optimization (PSO) for feature selection which helped in optimizing the performance of the overall model.

Saleem et al. [15] proposed a model named region-based fully convolution network (RFCN). The author also studied the use of different data augmentation techniques and the effect of hyperparameter tuning on the performance of the model. Shah et al. [16] proposed a model named ResTS (Residual Teacher/ Student) which is based on CNN architecture. It consists of two classifiers and a decoder. The proposed model is capable of finer visualization for disease detection. Panchal et al. [17] performed a comparative study on four models, namely Inception-v3, ResNet50, VGG16, and VGG19. The author also performed parameter tuning on these models to obtain better results and gave insights for each of these models.

The studies discussed demonstrate the effectiveness of deep learning models, such as CNNs, in accurately identifying and classifying plant diseases across various crops and imaging modalities. In Table 1, we highlight the key findings of our literature review and perform a comparative study of the different models proposed for plant disease detection. The various deep learning approaches that have been studied can prove to be highly beneficial to the farmers in monitoring and identifying the different types of diseases and taking appropriate action.

Table 1 Literature survey of recent deep learning-based plant disease detection

3.2 Datasets

Datasets play a crucial role in deep learning-based models, and they tend to show better accuracy and performance in comparison with other methods provided quality data is available to train and evaluate the model. Some of the commonly used datasets are discussed in brief and mentioned in Table 2.

Table 2 Datasets used in plant disease detection

The Plant Pathology 2021—FGVC8 [18] is a dataset for apple foliar disease detection in apple plants released in the year 2021. Another dataset released in the year 2021 is the DiaMOS [19] plant dataset belonging to pear fruit which consists of 3505 images of leaves belonging to four different classes of disease.

The PlantDoc [20] dataset consists of 2598 images of 13 plant species belonging to 17 different classes of disease released in the year 2019. RoCoLe [21] dataset was also released in the year 2019 focusing on the Robusta Coffee Leaf image divided into 6 classes. BRACOL [22] dataset was also released in the year 2019 consisting of 1747 images focusing on Arabica coffee leaves affected by different diseases belonging to five different classes.

PlantVillage [23] is one of the most extensively used datasets in different studies. The PlantVillage dataset consists of high-quality images of diseased and healthy plant leaves captured under controlled conditions. It consists of a total of 54,303 images spanning across 14 crop species. The dataset contains images depicting a wide range of plant diseases caused by fungi, bacteria, viruses, and other pathogens.

4 Research Gap

From the above literature review in Table 1, we identify various limitations that can be addressed in the future. Following are the research gaps we could identify from this study:

  • Generalization: Most of the research work has been performed using the PlantVillage dataset but it may not represent a practical real-world scenario as it was developed under a controlled environment. The generality of the algorithms is impacted by this issue, making them unsuitable for real-world deployment.

  • Limited scope: Most of the study focus only on plant leaf disease detection but diseases may also be associated with other organs of plant like stem, fruits, etc. Hence there is a need for comprehensive plant disease detection beyond the leaf organ of plants.

  • Recognition efficiency at the cost of inference efficiency: The vast majority of current studies concentrate on recognition efficiency while ignoring inference efficiency, which restricts their practical real-world application.

  • Small datasets: Dataset is very crucial for deep learning-based models for better performance and accuracy. There are limited and small datasets for plant disease detection which needs to be addressed.

  • Multiple disease detection: Furthermore, it hasn't been done to simultaneously detect several illnesses in a single plant organ. Also, the effectiveness of the same optimized/modified model has not been studied in complicated horticulture settings consisting of different crops.

5 Conclusion

This paper aims to provide a comprehensive review of the latest research work by utilizing deep learning for plant disease detection. Large datasets and advances in deep learning architectures have allowed researchers to detect plant diseases with high accuracy rates, outperforming earlier techniques and enabling quick and accurate diagnosis. Deep learning is advantageous for detecting plant diseases because it has the potential for real-time monitoring and is scalable for use in large-scale agricultural applications. Deep learning algorithms can be used to develop intelligent systems that can identify diseases early on, assisting with timely disease management decisions. The various deep learning approaches that have been studied can prove to be highly beneficial to the farmers in monitoring and identifying the different types of diseases and taking appropriate action, which would ultimately prevent crop wastage and financial loss to the farmer. After a thorough analysis and study, we could identify some limitations and research gaps that could be addressed in the future work. Future research directions might also involve combining deep learning with cutting-edge technologies like drones and the Internet of Things (IoT).

In conclusion, deep learning-based plant disease detection has enormous potential to transform crop protection and disease management techniques. Deep learning techniques will continue to evolve, along with the incorporation of complementing technology, opening the door for more precise, effective, and sustainable agricultural practices that will ultimately improve crop health and contribute to global food security.