Automatic Classification of Medicinal Plants of Leaf Images Based on Convolutional Neural Network

Berihu, Mengisti; Fang, Juan; Lu, Shuaibing

doi:10.1007/978-981-16-9709-8_8

Mengisti Berihu¹⁴,
Juan Fang¹⁴ &
Shuaibing Lu¹⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1496))

Included in the following conference series:

CCF Conference on Big Data

965 Accesses
2 Citations

Abstract

Plants are the basis of all living things on earth, supplying us with oxygen, food, shelter, medicine, and preserving the planet from dam-ages that could face climate changes. Concerning their medicinal abilities, limited access to proper medical centers in many rural areas and developing countries made traditional medicine preferable by the community. In addition, their lower side effect and affordability also plays a big role. More than half of the population uses medicinal plants directly and indirectly for animals and personal use in Ethiopia. However, accurate medicinal plant identification has always been a challenge for manual identification and automatic recognition systems mainly because the knowledge transfer between the knowledge holders (traditional physicians, elderly) and modern science have a huge gap. Several studies addressed an automatic plant recognition system using different feature extraction methods and classification algorithms. In this paper, a novel dataset, which was based on Ethiopian medicinal plants, that use the leaf part of the plant, as a medicine was used to automatically classify the plants accordingly using their leaf image. An attempt has been made to collect leaf images of medicinal plants in Ethiopia, to train, test collected dataset images, and classify those images using convolutional neural network models like GoogleNet and AlexNet. The proposed convolutional neural networks were fine-tuned with the adjustment of hyper-parameters like learning rate, the number of epochs, optimizers to the models. Image augmentation is also implemented to enlarge the dataset. The experimental result for the augmented dataset and more training epoch gave better performance and accuracy in the classification of the images. From the two selected convolutional neural network models, the best model is then determined based on the result in accuracy and loss; from an experiment conducted, the best model, which is GoogLeNet with an accuracy of 96.7 % chosen to develop a web-based automatic medicinal plant classification system.

S. Lu—Contributed equally to this work.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Medicinal Plant Classification Using Neural Network

Medicinal Plant Recognition from Leaf Images Using Deep Learning

MediNET: A Deep Learning Approach to Recognize Bangladeshi Ordinary Medicinal Plants Using CNN

Keywords

1 Introduction

Plants are the basis of all living things on earth, serving as oxygen, food, shelter, medicine, and preserving the earth from climate changes and natural disasters. It is therefore becoming increasingly important to identify plants and preserve them [1]. One of the benefits of plants is their medicinal abilities; humans have been using medicinal plants for centuries. Medicinal plants have been used for both curative and preventive medical therapy preparations for human beings and also animals, which also has been used for the extraction of important bioactive compounds [2, 3]. It is estimated that nearly 80% of the world’s total population, regularly, uses traditional medicine and its products for its healthcare needs especially in third world countries [4]. People from developing countries combine conventional medicine (exercise, meditation, lifestyle) with traditional medicines to treat themselves. Medicinal plants are preferred mostly because of their lower side effect and affordability. Moreover, in rural areas where the accessibility of a proper healthcare system is rare whereby the healthcare facilities are concentrated in cities, People living in these areas tend to get dependent on traditional medicinal plants and traditional physicians, which they think they understand their culture, their symptoms, and environment according to their disease. Ethiopia is one of the countries whereby medicinal plants play an important role in primary health care. About 95% of traditional medicines works in Ethiopia stand to be of plant origin [5]. Scientific investigation of millennia-old community knowledge on plant use is essential to define cultural identities of a particular community and understand links to their history, land and plant use practices, and traditional environmental philosophy. The knowledge on traditional medicinal plants of Ethiopia, which exits for centuries, is now facing extinction the knowledge has mainly been stored in the memories of elderly peoples and handed down verbally for generations [6]. Although medicinal plants play a significant role in supporting primary healthcare in Ethiopia, a few attempts have been done to scientifically identify, document, and promote the widely used medicinal plants and associated knowledge dynamics in the country. Hence, an appropriate investigation, identification, documentation, and usage of the knowledge and linking up with the modern ways on medicinal plants is needed. Moreover, it provides the opportunity for recognition, promotion, management, and protection of indigenous knowledge of a community on medicinal plants as a vital part of a nation’s heritage, besides calling policymakers, natural resource managers, stakeholders, and cultural practitioners for conservation actions [6]. This study aims on classifying eight medicinal plants based on their leaf shape, texture, and color. Our contributions are as follow: (1) In this paper, we develop a novel dataset based on Ethiopian medicinal plants. (2) We use fine-tuning techniques for the architecture like changing the output classification layer to optimize such as SGD and Adams and adjust the learning rate. And data pre-processing techniques and data augmentation are used such as image data augmentation, image enhancement, and training epochs as a variable to get better result. (3) We use two models of a convolutional neural network to train and classify the dataset, and we compare both based on their performance on accuracy and chose the best to build a web-based medicinal plant identification system.

2 Related Work

This section defines different techniques used by researchers for identifying and classifying plants using machine methods. Machine learning techniques in classification, recognition, disease detection of plant species have been used for some time. Du et al. (2007) used leaf images of 20 plant species for plant identification based on digital morphological features, in this study K-Nearest neighbor is used for the classification and achieved 93% accuracy [8]. Herdiyeni and Wahyuni (2012) used a fusion of fuzzy local binary pattern and fuzzy color histogram and for classification, a Probabilistic neural network (PNN) is used on a dataset containing 2448 leaf images of medicinal plants from Indonesian forest achieving an accuracy of 74.5% [7]. According to ArunPriya C. and Balasaravanan in their work, 12 features are orthogonalized using Principal component analysis and fed into a classifier called support vector machine (SVM) achieving a result of 96.8% which is far better in performance compared to K-NN which achieved 81.3% [8]. According to Jiachun Liu et al. (2018), a novel classification method based on ten layers convolutional neural network is proposed and used on the Flavia dataset having 4800 images with 32 different kinds of leaf plants, achieving 87.92% accuracy [9]. Sue Han Lee et al. [10] Used deep learning in a bottom-up and top-down manner to classify 44 different plant species, Convolutional neural networks are used to learn the features and classify them. In addition, a Deconvolutional network is used for visualization of learned features, this work attained 99.5% of accuracy in identifying the plant species. Ferreira, Alessandro Dos Santos, et al. [11] used convolutional neural network in order to identify weed plant from soybean crops, More than 15,000 images composed of images of soybean crops, grass weeds, broadleaf is used for training the chosen classification architecture which is CaffeNet architecture of Neural Network. Baizel Kurian Varghese et al. proposed in their work a convolutional neural network that is based on ILSVRC 2012 dataset used for the pre-training process with the final layer, which is fully connected by replacing 1000 neurons with 44 neurons. More than 8800 leaf images were used for testing and 34,672 leaf images were used for training achieving an accuracy of 99.5%. Using a Mobile Net architecture with the same dataset and obtained correct predictions with an accuracy level of 99% [12, 13]. This paper aims at comparing two convolutional neural network models named AlexNet and GoogLeNet on a novel image dataset, which is based on Ethiopian medicinal plants; Different variables like such as techniques of image pre-processing like image augmentation and training epoch are put into consideration in choosing the best model from the two.

3 The Proposed System

In this part, we present the system architectures and the deep learning techniques. Figure 1 shows the main architecture of the proposed system. The proposed architecture mainly involves two phases, namely, the training phase and the identification phase. In the training phase, the image acquisition is done by collecting leaf images of medicinal plants; different techniques like image enhancement, image resizing, and image segmentation are used for preprocessing the images. Features extraction is done by selected CNN models, which are AlexNet and GoogLeNet. The Identification phase involves the classification and labeling of medicinal plants based on the trained CNN models. Each model is trained on these models of CNN with the prepared dataset where features are extracted and classified by their names.

3.1 Dataset Preparation

Based on different expertise from the Department of Plant Biology and Biodiversity Management (lecturers, taxonomists) at Addis Ababa University, a list of plants that are used as medicine was prepared with a possible location area for all. From more than 80 medicinal plants, listed, leaf-based medicinal plants are chosen and located. Eight medicinal plants are chosen for the research and the research method used is experimental. More than 2100 leaf images of selected medicinal plants are collected from different parts of Addis Ababa, Ethiopia. To take photos of the leaf’s Redmi Mobile phone with 64MP (Mega Pixel) camera is used. The dataset is then divided into two parts, 80% of the dataset is used for training, while the rest 20% is used as a test set and validation set. The total number of medicinal plant leaf images is shown in a table format below (Table 1).

3.2 Image Pre-processing

The collected images are first cleaned in a way that does not affect labeled images. Since deep learning performance is improved based on the number of data the algorithm is fed, the total number of images increased using a technique called image augmentation. Image augmentation techniques like horizontal flip, vertical flip and rotate augmentation are used. Likewise, according to the model, different activities that are suitable input for our models are performed including resizing the images. All images are resized to fit the scale represented by AlexNet and GoogLeNet, which are 227 * 227 and 224 * 224 respectively. Below is a table showing the number of images in each class after image augmentation (Table 2).

Table 1. Total number of images before data augmentation

Full size table

Table 2. The total number of images after augmentation

Full size table

3.3 Experiment Parameters

Two CNN architectures AlexNet and GoogLeNet were fine-tuned and used. The batch size used was 64 and 16 for both AlexNet and GoogLeNet respectively. For optimization algorithms, Adam and SGD are used which helps to compute the adaptive learning rate for each parameter and improves the accuracy of the model. For both models, the output classification layers are adjusted to the number of classes, which is eight. Dropout parameter is layer is used with a probability of 40%, to increase the accuracy of the model. After comparing the results of the accuracy of the two models, the one with better performance is chosen to train the dataset with the augmented data. In order to increase the original size of the training dataset and to avoid overfitting data augmentation is used with the following variable horizontal flip, Zoom range and shift range 0.2, rotation range 30\(^\circ \), width, and height shifting. Epochs given for training varied from 50 to 200 for both architectures.

3.4 Training and Classification of the Models

In this study, two architectures of convolutional neural networks are used to train and classify leaf images of medicinal plants collected and preprocessed for identification purposes. AlexNet and GoogLeNet, the two networks, differ in general architecture. Goog-LeNet has Inception Modules, which perform with different convolutions and concatenate the filters for the next layer. AlexNet, on the other hand, has layers input provided by one previous layer instead of a filter concatenation. In this paper, the collected images are trained to classify images on AlexNet and GoogLeNet models to choose which model performs better in terms of accuracy, loss, and validation. After getting better accuracy results in GoogLeNet, this model is chosen to train and classify the medicinal plants and the model to feed for a web-based classification system.

4 Experiment and Results

In our experiment, 1740 leaf images of the original dataset are used for training and 432 leaf images are considered for testing the performance of the system. For AlexNet, 42 samples are misclassified. 5 data is misclassified in class 1. 3 data is misclassified in class 2. 8 data is misclassified in class 3. 9 data is misclassified in class 4. 6 data is misclassified in class 5. 2 data is misclassified in class 6. 4 data misclassified in class 7 and 5 data misclassifies in class 8 as shown in Table 3. Classification accuracy for each class is 90.7%, 89.4%, 85.1%, 83.3%, 88.8%, 96.2%, 96.2%, 92.5%, 90.7% from class1 to class 8 respectively, which gives overall average accuracy of 90.2%. For GoogLeNet in total 30 samples are misclassified. 4 data misclassified in class 1, 4 data misclassified in class 2, 5 data misclassified in class 3, 6 data misclassified in class 4, 2 data misclassified in class 5, 3 data misclassified in class 6, 5 data misclassified in class 7 and 3 data misclassified in class 8 shown in Table 3. Classification accuracy for each class is 94.4%, 92.4%, 90.7%, 88.8%, and 96.2%, 94.2%, 90.0%, 94.4% from class1 to class 8 respectively, which gives overall average accuracy of 92.8%. The classification result for both AlexNet and GoogLeNet before data augmentation is shown in the Fig. 2 table below.

Table 3. Classification results of the proposed architectures.

Full size table

Based on the result, GoogLeNet is chosen as the preferred CNN architecture to train and classify the medicinal plants with the augment-ed dataset and fine-tuned algorithm. The number of epoch given for the training was 100,150 and 200 in which the results were 89.8%, 94.3% and 96.7% respectively.

5 Conclusion

In this paper, an automatic classification of Ethiopian medicinal plants based on convolutional networks is proposed. We first pre-process images by resizing, enhancement, data augmentation and normalization, and we use fine-tuning technique to improve the accuracy of the model. Two convolutional neural network architectures namely AlexNet and GoogLeNet are used to train and classify the medicinal plants and the results are compared in terms of accuracy and validation loss. Our experimental results showed that the larger the dataset and the number of epochs GoogLeNet showed better achievement in classification the target correctly. GoogLeNet showed 96.7% training accuracy with 200 epochs, which is used for a web-based application to able users to upload a leaf image of a medicinal plant and identify the plant along with a description of their use.

References

Cope, J.S., Corney, D., Clark, J.Y., Remagnino, P., Wilkin, P.: Plant species identification using digital morphometrics: a review. Expert Syst. Appl. 39(8), 7562–7573 (2012)
Article Google Scholar
Lambert, J., Srivastava, J., Vietmeyer, N.: Medicinal Plants: Rescuing a Global Heritage, vol. 355. World Bank Publications (1997)
Google Scholar
Thirumalai, T., Kelumalai, E., Senthilkumar, B., David, E.: Ethnobotanical study of medicinal plants used by the local people in Vellore District, Tamilnadu, India. Ethnobotanical Leaflets 13(10), 1302–1311 (2009)
Google Scholar
Musila, W., Kisangau, D., Muema, J.: Conservation status and use of medicinal plants by traditional medical practitioners in Machakos District, Kenya. Nat. Mus. Kenya 22, 12–18 (2002)
Google Scholar
Demisew, S., Dagne, E.: Basic and applied research on medicinal plants of Ethiopia. In: Proceedings of the National Workshop on Biodiversity Conservation and Sustainable Use of Medicinl Plants in Ethiopia, Addis Ababa (2001)
Google Scholar
Giday, M., Teklehaymanot, T.: Ethnobotanical study of plants used in management of livestock health problems by Afar people of Ada’ar District, Afar Regional State, Ethiopia. J. Ethnobiol. Ethnomed. 9(1), 1–10 (2013)
Article Google Scholar
Lulekal, E., Asfaw, Z., Kelbessa, E., Van Damme, P.: Ethnomedicinal study of plants used for human ailments in Ankober District, North Shewa Zone, Amhara region, Ethiopia. J. Ethnobiol. Ethnomed. 9(1), 1–13 (2013)
Article Google Scholar
Du, J.X., Wang, X.F., Zhang, G.J.: Leaf shape based plant species recognition. Appl. Math. Comput. 185(2), 883–893 (2007)
Google Scholar
Herdiyeni, Y., Wahyuni, N.K.S.: Mobile application for Indonesian medicinal plants identification using fuzzy local binary pattern and fuzzy color histogram. In: 2012 International Conference on Advanced Computer Science and Information Systems (ICACSIS), pp. 301–306 (2012)
Google Scholar
Priya, C.A., Balasaravanan, T., Thanamani, A.S.: An efficient leaf recognition algorithm for plant classification using support vector machine. In: International Conference on Pattern Recognition, Informatics and Medical Engineering, pp. 428–432, March 2012
Google Scholar
Liu, J., Yang, S., Cheng, Y., Song, Z.: Plant leaf classification based on deep learning. In: 2018 Chinese Automation Congress, pp. 3165–3169 (2018)
Google Scholar
Lee, S.H., Chan, C.S., Wilkin, P., Remagnino, P.: Deep-plant: plant identification with convolutional neural networks. In: 2015 IEEE International Conference on Image Processing, pp. 452–456, September 2015
Google Scholar
dos Santos Ferreira, A., Freitas, D.M., da Silva, G.G., Pistori, H., Folhes, M.T.: Weed detection in soybean crops using ConvNets. Comput. Electron. Agric. 143, 314–324 (2014)
Google Scholar
Varghese, B.K., Augustine, A., Babu, J.M., Sunny, D., Cherian, S.: Plant recognition using convolutional neural networks. In: proceedings of the Fourth International Conference on Computing Methodologies and Communication (2020)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Information Technology, Beijing University of Technology, No. 100 Pingleyuan Street, Beijing, 100124, China
Mengisti Berihu, Juan Fang & Shuaibing Lu

Authors

Mengisti Berihu
View author publications
You can also search for this author in PubMed Google Scholar
Juan Fang
View author publications
You can also search for this author in PubMed Google Scholar
Shuaibing Lu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Juan Fang .

Editor information

Editors and Affiliations

National University of Defense Technology, Changsha, China
Xiangke Liao
Shenzhen University of Technology, Chinese Academy of Sciences, Shenzhen, China
Wei Zhao
University of Science and Technology of China, Hefei, China
Enhong Chen
Sun Yat-sen University, Guangzhou, China
Nong Xiao
Taiyuan University of Technology, Taiyuan, China
Li Wang
Nanjing University, Nanjing, China
Yang Gao
Nanjing University, Nanjing, China
Yinghuan Shi
Sun Yat-sen University, Guangzhou, China
Changdong Wang
Sun Yat-sen University, Guangzhou, China
Dan Huang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Berihu, M., Fang, J., Lu, S. (2022). Automatic Classification of Medicinal Plants of Leaf Images Based on Convolutional Neural Network. In: Liao, X., et al. Big Data. BigData 2021. Communications in Computer and Information Science, vol 1496. Springer, Singapore. https://doi.org/10.1007/978-981-16-9709-8_8

Download citation

DOI: https://doi.org/10.1007/978-981-16-9709-8_8
Published: 15 January 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-9708-1
Online ISBN: 978-981-16-9709-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the China Computer Federation (CCF) (opens in a new tab)

Automatic Classification of Medicinal Plants of Leaf Images Based on Convolutional Neural Network

Abstract

Similar content being viewed by others

Medicinal Plant Classification Using Neural Network

Medicinal Plant Recognition from Leaf Images Using Deep Learning

MediNET: A Deep Learning Approach to Recognize Bangladeshi Ordinary Medicinal Plants Using CNN

Keywords

1 Introduction

2 Related Work