Abstract
Hyperspectral imaging systems allow to detect the initial stages of decay caused by fungi in citrus fruit automatically, instead of doing it manually under dangerous ultraviolet illumination, thus preventing the fungal infestation of other sound fruit and, consequently, the enormous economical losses generated. However, these systems present the disadvantage of generating a huge amount of data, which is necessary to select for achieving some result useful for the sector. There are numerous feature selection methods to reduce dimensionality of hyperspectral images. This work compares a feature selection method using the area under the receiver operating characteristic (ROC) curve with other common feature selection techniques, in order to select an optimal set of wavelengths effective in the detection of decay in a citrus fruit using hyperspectral images. This comparative study is done using images of mandarins with the pixels labelled in five different classes: two types of healthy skin, two types of decay and scars, ensuring that the ROC technique generally provides better results than the other methods.
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
Decay caused by fungi is among the main defects affecting the post-harvest and marketing processes of citrus fruit. Infected fruit can be neither stored for a long time nor long-term transported during exportation since a small number of decayed fruit can infect a whole consignment. Thus, fungal infections generate great economic losses to the citrus industry if damaged fruit are not early detected, being Penicillium sp. as the fungi that lead to the most post-harvest loses in citrus packinghouses (Eckert and Eaks 1989). In current packing lines, the detection of decayed fruit is made visually by trained operators examining the fruit as it passes under ultraviolet (UV) light. Nevertheless, this method is subjective and potentially dangerous for human skin. The use of automatic machine vision systems is a possible solution for preventing these drawbacks.
Technology based on colour cameras has spread rapidly for the detection of skin damage of fruit and vegetables (Zude 2008; Cubero et al. 2011), being a common technique for the inspection of citrus fruit. For instance, Kondo et al. (2000) studied the possibility of detecting sugar content and acid content of oranges ‘Iyokan’ using a machine vision system and neural networks. Slaughter et al. (2008) developed a non-contact method of detecting freeze-damaged oranges based on UV fluorescence, and López-García et al. (2010) used multivariate image analysis to detect peel diseases in citrus fruit. Nevertheless, decay lesions are difficult to detect using standard artificial vision systems since they are hardly visible to the human eye and, therefore, by standard colour cameras (Fig. 1). Blasco et al. (2007) used visible computer vision to detect different types of damages in citrus fruit including decay by green mould. While the success in other defects was high, the detection of decay was lower than 60 % because the damages caused for this disease in the citrus skin are not clearly visible before sporulation. On the other hand, following the fluorescence technique used in the industry to detect decay by humans, Kurita et al. (2009) tried to detect decay in citrus using two lighting systems (visible and UV) changing between them while the fruit is under the view of the camera.
Hyperspectral sensors have been used successfully as an alternative to detect non-visible damages on fruit (Lorente et al. 2012). In the particular case of citrus fruit, different works have been carried out to detect decay lesions (Qin et al. 2009, 2012; Gómez-Sanchis et al. 2012). A hyperspectral image consists of a large number of consecutive monochromatic images of the same scene in each wavelength becoming very important to select only those bands with the most relevant information, while discarding those that do not contribute in any significant way to improve the results, containing redundant information or exhibiting a high degree of correlation. There are numerous feature selection methods to reduce dimensionality that retain most of the original information in fewer bands.
For example, Gómez-Sanchis et al. (2008) evaluated four feature selection methods with the aim of selecting an optimal set of wavelengths in the range 460–1,020 nm for detecting decay in citrus fruit. Xing et al. (2005) used principal component analysis (PCA) to reduce data from a hyperspectral imaging system (400–1,000 nm) for detecting bruises on ‘Golden Delicious’ apples. PCA was also used by Liu et al. (2005) to obtain spectral features for the detection of chilling injury in cucumbers imaged using a hyperspectral system (447–951 nm). More recently, Li et al. (2011) have used PCA to select most discriminant wavelengths in the range 400–1,000 nm for detecting various common skin defects on oranges. Partial least squares (PLS) or artificial neural networks (ANN) are another techniques commonly used for feature selection purposes. ElMasry et al. (2008) determined some important wavelengths for detecting bruises in ‘McIntosh’ apples using PLS on hyperspectral images in the range 400–1,000 nm and ElMasry et al. (2009) used ANN to classify apples into injured and normal classes and to detect changes in firmness due to chilling injury by selecting optimal wavelengths.
Objective
The method used by Lorente et al. (2011) to select most spectral relevant features for detecting decay in citrus fruit was based on the area under the receiver operating characteristic (ROC) curve, which is a promising method to measure the quality of a binary classifier. A novel approach was presented to extend its use to multiclass problems, as is the automatic discrimination of decay lesions in citrus fruits, which is a problem still under research and very important from the agricultural point of view since the damages caused by fungi are hardly visible to the naked human eye and standard vision systems and can be quickly spread to other sound fruits during storage. This work aims to compare our novel approach of the ROC feature selection method with other common feature selection techniques for agricultural multiclass classification problems. We use the detection of decay in citrus fruits using hyperspectral imaging as a benchmark problem by selecting an optimal set of wavelengths effective in the discrimination between common defects and decay lesions in citrus fruit. The comparison of different feature selection techniques is aimed at knowing if the ROC method is a promising technique in multiclass classification problems relative to other commonly used methods in terms of classification accuracy.
Material and Methods
Image Acquisition
The hyperspectral imaging system used was based on liquid crystal tunable filters (LCTF; e.g. Lorente et al. 2011). The system consists of a monochrome camera (CoolSNAP ES, Photometrics, Tucson, USA), a lens providing a uniform focus in the working range (Xenoplan 1.4/17MM, Jos. Schneider Optische Werke GmbH, Bad Kreuznach, Germany), and two LCTF (CRI Varispec VIS07 and NIR07, UK) sensitive to the visible (400–720 nm) and NIR (650–1,100 nm), respectively. The scene was illuminated by halogen lamps placed inside an aluminium hemispherical domo.
For hyperspectral images, a total of 240 ‘Clemenules’ mandarins (Citrus clementina Hort. ex Tanaka) collected from a local producer company were used, including 60 without visible damages, 60 presenting external scars, 60 inoculated with spores of Penicillium digitatum and 60 inoculated with spores of Pitalicum italicum. The inoculation was performed using a suspension of spores with a concentration of 106 spores/ml for both fungi, which is sufficient to cause infestation in laboratory conditions (Palou et al. 2001). The images were acquired by presenting manually the damage on the fruit to the camera. A total of 240 hyperspectral images were taken in the range of 460–1,020 nm, with a 10-nm spectral resolution. Each sample pattern in the labelled set consisted of 74 spectral features associated to each pixel (reflectance level for each acquired band—grey level in each monochromatic image—and several spectral indexes) and a class label assigned manually by a human expert. Five different classes were considered in this work: green sound skin (GS), orange sound skin (OS), defective skin by scars (SC), decay caused by P. digitatum (PD) and decay caused by P. italicum (PI).
Feature Selection Methods
The performance of the method based on the area under the ROC curve is compared with other common feature selection methods. The methods included in this comparative study are: correlation analysis (Rodgers and Nicewander 1988), mutual information (Bonnlander and Weigend 1994), Fisher’s discriminant analysis (Venables and Ripley 2002), t test (Li et al. 2006), Wilks’ lambda (Ouardighi et al. 2007), Bhattacharyya distance (Choi and Lee 2003), minimum redundancy maximum relevance difference criterion (MRMRd) (Ponsa and López 2007), minimum redundancy maximum relevance quotient criterion (MRMRq) (Peng et al. 2005) and Kullback–Leibler divergence (Kullback 1987; Abe et al. 2000). These feature selection techniques have been chosen because they are commonly applied to the analysis of hyperspectral imaging in the fields of pattern recognition and remote sensing, although they have not been used before for automatic fruit or vegetable inspection using computer vision. Therefore it will also be studied if they are suitable and accurate methods for this kind of problems.
In order to get a feature selection for each method, two steps were followed: (1) to obtain a ranking of features ordered according to the discriminant relevance of the features and (2) the selection of an optimal number of features from the feature ranking. The feature selection methods and the classification procedure used in this work were implemented using Matlab 7.9 (The Mathworks, Inc., Natick, USA).
-
Step I
Obtainment of a feature ranking
The obtainment of a feature ranking for each class is the initial step to follow. The feature selection techniques studied are intended for binary classification problems but this work deals with problems with more than two classes. Therefore, the one vs. all approach (Rifkin and Klautau 2004) is employed to obtain a feature ranking for each class, which maximises the separation between that class and the others. The second step consists in obtaining a single global feature ranking for each method that is achieved from the relevance values corresponding to the partial rankings for each class. These relevance values are weighted in proportion to the relative importance of the class in the problem and combined using Eq. 1.
where \( {\overline r_j} \) is the global relevance of feature x j , N is the number of different classes, r jk is the relevance value of feature x j from the partial ranking for the kth class, and w k is the weight for the kth class.
After obtaining the global relevance of each feature, each input feature is ranked.
-
Step II
Selection of an optimal number of features
Once the global feature ranking has been obtained, a minimum number of features leading to a saturation trend in the success rate of classification is chosen for each method. The success rate is calculated using the first features in the ranking, then successive features are added in an iterative process until the increment of the success rate is lower than a certain threshold (1 %). The n features that satisfy this condition are then selected.
Area Under ROC Curve
The ROC curve is a graphical plot of the true-positive rate vs. false-positive rate for a binary classifier, as its discrimination threshold is varied; this value being defined as that from which a positive class prediction is made (Fawcett 2006). The area under a ROC curve (AUC) is used as a global measure of classifier performance that is invariant to the classifier discrimination threshold and the class distribution (Bradley 1997). Maximum classification accuracy corresponds to an AUC value of 1, while a random guess separation involves an AUC value of 0.5. Basically, the ROC feature selection method for binary classification problems consists in calculating a z statistic from the discriminant relevance of each feature x j , defined as the difference between the AUC of a classifier using all the features (AUC 0) and the AUC of a classifier without taking into account the effect of feature x j (AUC j ) (Serrano et al. 2010).
Classifier
The classifier used in this comparative study is a multilayer perceptron (MLP) with a single hidden layer, being a type of ANN (Plaza et al. 2009). MLP can use a wide range of learning techniques for determining the network parameters, the most commonly used being backpropagation. In these classical learning methods, the parameters of the ANN are usually tuned iteratively, thus entailing several disadvantages, such a high computational complexity and convergence to local minima (Shih 2010). To avoid this, the MLP used in this work avoids these problems by being trained using extreme learning machine (Huang et al. 2006), in the same way as that used in Lorente et al. (2011), which is a new learning algorithm that determines the MLP parameters analytically in a faster way instead of tuning them iteratively providing a good generalisation performance at an extremely fast learning speed.
Approaches to the Problem of Decay Detection
In this work, three different approaches to the problem of the decay detection in mandarins are considered, depending on the number of classes implicated and the importance of each class (Lorente et al. 2011). The approach I involves the five classes described in the labelled set, all of them having equal importance or weight. Therefore, the weights of all the classes were considered to be equal when obtaining the global relevance.
It is, however, realistic to assume that the classes belonging to decaying skin should be more important for decay detection. Hence, approach II gives more importance to decay classes (w PD = w PI = 15), medium to the scar class (w SC = 5) and less to sound classes (w GS = w OS = 1). Furthermore, since the actual objective of a potential inspection system would be to detect decay, it is also important to study the detection of just infected fruit, leading to a binary problem: the separation between infected or not infected fruit (approach III).
Methodology of Comparison
Two different tests were carried out in order to compare the different selection techniques with the ROC feature selection method. The comparison, in both tests, is based on the performance evaluation of the classifier using the different sets of features provided by the methods. The first test (test I) consists in selecting an optimum number of features for each method and for each approach. Therefore, for each method, a different number of features that maximises the classification will be obtained. A different way to make the comparison is using a fixed number of features for all methods (test II). For this test, we have chosen the number of features obtained for the ROC method for each approach.
Results and Discussion
The classification obtained using the ROC method is in general better than those obtained for the other methods in all cases, but MRMRd and MRMRq using the third approach. These results could be expected since the MRMR criterion is recognised as one of the most powerful techniques for feature selection (Peng et al. 2005; Ponsa and López 2007). The success of ROC approach is similar to that obtained using the rest of the methods tested. The differences are not significant and therefore we cannot say that our approach is better than the others in terms of decay detection accuracy. It is, however, important to highlight that the best results are achieved using the ROC method for all tests and all approaches. This result should to be taken into account because it is probably due to the fact that this method not only evaluates the features selection but also optimises the performance of the classifier. Therefore, having similar results, ROC method can achieve slightly better scores.
Table 1 shows the results of the classifier performance evaluation using the different sets of features provided by the feature selection methods, described above, corresponding to the test I. The accuracy, achieved with the ROC method, is higher than that obtained with the other methods, except for MRMR in approach III. However, on one hand, minimal redundancy methods try to extract the features with a high degree of relevance, avoiding those features with redundant information. On the other hand, ROC is a method that provides those bands that were used in a classification problem which fit a classifier in a much robust way in terms of accuracy and significance of the model.
In general, the rest of the methods saturate the criterion of success with fewer bands than those selected by the ROC. This, in theory, means that to reach more approximate results than ROC, the number of bands needed by these methods should be higher. Therefore, the test II was used in order to check the performance of the ROC method using the same number of bands, being six for the first approach, seven for the second approach and four for the third one. As shown in Table 2, the ROC feature method provides higher scores than most of the feature selection methods used in this study. As it happens in test I, the only two methods surpassing the ROC are MRMRd and MRMRq for the third approach. This fact shows that, in the most pessimistic scenario for ROC method (permitting an increase of the number of features for the rest of the methods), it obtains better results than the others except in the case of MRMR methods in approach III. Even though the differences with the other methods are small since all of them are good feature selection methods, in the case of the approach II, which is probably the most realistic scenario in the real world, the ROC method is clearly the one that obtains better accuracy.
Conclusions
In the first test, the classification average success rate obtained using the ROC method is greater than that obtained for the other methods in almost every case, except for MRMRd and MRMRq using the third approach. When we use the same number of features for all the methods, the ROC feature method provides generally better results than most of the feature selection methods used in this comparative study, being the average success rate for ROC almost always greater than that obtained for the other methods, only being surpassed by the MRMR methods for the third approach.
Therefore, the ROC feature selection method is a suitable feature selection technique that can be applied with success to multiclass classification problems with a huge amount of features such as the segmentation of hyperspectral images to detect decay in citrus fruit, having at least similar results than other recognised feature selection methods but with the advantage of to optimise, by its nature, the performance of the classifier.
References
Abe, N., Kudo, M., Toyama, J., & Shimbo, M. (2000). A divergence criterion for classifier independent feature selection. Lecture notes in computer science. Advances in Pattern Recognition, 1876, 668–676.
Blasco, J., Aleixos, N., Gómez, J., & Moltó, E. (2007). Citrus sorting by identification of the most common defects using multispectral computer vision. Journal of Food Engineering, 83(3), 384–393.
Bonnlander, B.V., & Weigend, A.S. (1994). Selecting input variables using mutual information and nonparametric density estimation. In: Proceedings of the 1994 International Symposium on Artificial Neural Networks (ISANN’94), Tainan, Taiwan, pp. 42–50.
Bradley, A. P. (1997). The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition, 30(7), 1145–1159.
Choi, E., & Lee, C. (2003). Feature extraction based on the Bhattacharyya distance. Pattern Recognition, 36(8), 1703–1709.
Cubero, S., Aleixos, N., Moltó, E., Gómez-Sanchis, J., & Blasco, J. (2011). Advances in machine vision applications for automatic inspection and quality evaluation of fruits and vegetables. Food and Bioprocess Technology, 4(4), 487–504.
Eckert, J., & Eaks, I. (1989). Postharvest disorders and diseases of citrus. In W. Reuther, E. C. Calavan, & G. E. Carman (Eds.), The citrus industry. Berkeley: University California Press.
ElMasry, G., Wang, N., Vigneault, C., Qiao, J., & ElSayed, A. (2008). Early detection of apple bruises on different background colors using hyperspectral imaging. LWT, 41, 337–345.
ElMasry, G., Wang, N., & Vigneault, C. (2009). Detecting chilling injury in Red Delicious apple using hyperspectral imaging and neural networks. Postharvest Biology and Technology, 52, 1–8.
Fawcett, T. (2006). An introduction to ROC analysis. Pattern Recognition Letters, 27(8), 861–874.
Gómez-Sanchis, J., Gómez-Chova, L., Aleixos, N., Camps-Valls, G., Montesinos-Herrero, C., Moltó, E., et al. (2008). Hyperspectral system for early detection of rottenness caused by Penicillium digitatum in mandarins. Journal of Food Engineering, 89(1), 80–86.
Gómez-Sanchis, J., Martín-Guerrero, J. D., Soria-Olivas, E., Martínez-Sober, M., Magdalena-Benedito, R., & Blasco, J. (2012). Detecting rottenness caused by Penicillium in citrus fruits using machine learning techniques. Expert Systems with Applications, 39(1), 780–785.
Huang, G. B., Zhu, Q. Y., & Siew, C. K. (2006). Extreme learning machine: theory and applications. Neurocomputing, 70, 489–501.
Kondo, N., Ahmad, U., Monta, M., & Murase, H. (2000). Machine vision based quality evaluation of Iyokan orange fruit using neural networks. Computers and Electronics in Agriculture, 29, 135–147.
Kullback, S. (1987). The Kullback–Leibler distance. The American Statistician, 41, 340–341.
Kurita, M., Kondo, N., Shimizu, H., Ling, P., Falzea, P. D., Shiigi, T., et al. (2009). A double image acquisition system with visible and UV LEDs for citrus fruit. Journal of Robotics and Mechatronics, 21(4), 533–540.
Li, S., Liao, C., & Kwok, J. (2006). Gene feature extraction using T-test statistics and Kernel partial least squares. Lecture notes in computer science. Neural Information and Processing, 4234, 11–20.
Li, J., Rao, X., & Ying, Y. (2011). Detection of common defects on oranges using hyperspectral reflectance imaging. Computers and Electronics in Agriculture, 78(1), 38–48.
Liu, Y., Chen, Y. R., Wang, C. Y., Chan, D. E., & Kim, M. S. (2005). Development of a simple algorithm for the detection of chilling injury in cucumbers from visible/near-infrared hyperspectral imaging. Applied Spectroscopy, 59(1), 78–85.
López-García, F., Andreu-García, A., Blasco, J., Aleixos, N., & Valiente, J. M. (2010). Automatic detection of skin defects in citrus fruits using a multivariate image analysis approach. Computers and Electronics in Agriculture, 71, 189–197.
Lorente, D., Aleixos, N., Gómez-Sanchis, J., Cubero, S., & Blasco, J. (2011). Selection of optimal wavelength features for decay detection in citrus fruit using the ROC curve and neural networks. Food and Bioprocess Technology. doi:10.1007/s11947-011-0737-x.
Lorente, D., Aleixos, N., Gómez-Sanchis, J., Cubero, S., García-Navarrete, O. L., & Blasco, J. (2012). Recent advances and applications of hyperspectral imaging for fruit and vegetable quality assessment. Food and Bioprocess Technology, 5(4), 1121–1142.
Ouardighi, A., Akadi, A., Aboutajdine, D. (2007). Feature selection on supervised classification using Wilks lambda statistic. In: International Symposium on Computational Intelligence and Intelligent Informatics ISCIII07, 1, pp. 51–55.
Palou, L., Smilanik, J., Usall, J., & Viñas, I. (2001). Control postharvest blue and green molds of oranges by hot water, sodium carbonate, and sodium bicarbonate. Plant Disease, 85, 371–376.
Peng, H., Long, F., & Ding, C. (2005). Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(8), 1226–1238.
Plaza, A., Benediktsson, J. A., Boardman, J. W., Brazile, J., Bruzzone, L., Camps-Valls, G., et al. (2009). Recent advances in techniques for hyperspectral image processing. Remote Sensing of Environment, 113(1), S110–S122.
Ponsa, D., & López, A. (2007). Feature selection based on a new formulation of the minimal-redundancy-maximal-relevance criterion. Lecture notes in computer science. Pattern Recognition and Image Analysis, 4477, 47–54.
Qin, J., Burks, T. F., Ritenour, M. A., & Bonn, W. G. (2009). Detection of citrus canker using hyperspectral reflectance imaging with spectral information divergence. Journal of Food Engineering, 93, 183–191.
Qin, J., Burks, T. F., Zhao, X., Niphadkar, N., & Ritenour, M. A. (2012). Development of a two-band spectral imaging system for real-time citrus canker detection. Journal of Food Engineering, 108(1), 87–93.
Rifkin, R., & Klautau, A. (2004). In defense of one-vs-all classification. Journal of Machine Learning Research, 5, 101–141.
Rodgers, J. L., & Nicewander, A. W. (1988). Thirteen ways to look at the correlation coefficient. The American Statistician, 42(1), 59–66.
Serrano, A.J., Soria, E., Martín, J.D., Magdalena, R. Gómez, J. (2010). Feature selection using ROC curves on classification problems. In: International Joint Conference on Neural Networks, IJCNN 2010. 28th–30th July 2010. Barcelona, Spain. Proceedings, pp. 1980–1985.
Shih, F. Y. (2010). Image processing and pattern recognition: fundamentals and techniques. New York: Wiley-IEEE.
Slaughter, D. C., Obenland, D. M., Thompson, J. F., Arpaia, M. L., & Margosan, D. A. (2008). Non-destructive freeze damage detection in oranges using machine vision and ultraviolet fluorescence. Postharvest Biology and Technology, 48, 341–346.
Venables, W. N., & Ripley, B. D. (2002). Modern applied statistics with S (4th ed.). New York: Springer.
Xing, J., Bravo, C., Jancsók, P. T., Ramon, H., & De Baerdemaeker, J. (2005). Detecting bruises on ‘Golden Delicious’ apples using hyperspectral imaging with multiple wavebands. Biosystems Engineering, 90(1), 27–36.
Zude, M. (Ed.). (2008). Optical monitoring of fresh and processed agricultural crops. Boca Raton: CRC.
Acknowledgments
This work has been partially funded by the Universitat de València through project UV-INV-AE11-41271, by the Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria de España (INIA) through research project RTA2009-00118-C02-01 and by the Ministerio de Ciencia e Innovación de España (MICINN) through research project DPI2010-19457, both projects with the support of European FEDER funds.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lorente, D., Blasco, J., Serrano, A.J. et al. Comparison of ROC Feature Selection Method for the Detection of Decay in Citrus Fruit Using Hyperspectral Images. Food Bioprocess Technol 6, 3613–3619 (2013). https://doi.org/10.1007/s11947-012-0951-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11947-012-0951-1