Keywords

1 Introduction

The classification of pollen particles has been widely applied for allergic pollen index forecast, drug research, paleoclimatic reconstruction, criminal investigation, oil exploration and some other fields [1]. The traditional identification of pollen grains is mainly done by artificial inspection under microscopy, which requires the operator to have a rich knowledge of pollen morphology and needs a high level of training to get accurate recognition results. The commonly used discriminate criteria is the visual biological pollen grain morphological appearance, such as shape, polarity, aperture, size, exine stratification and thickness, and so on [2]. It takes operator much of time and effort to observe the appearance of pollen grains, and often causes misrecognition.

With the development of image processing and pattern recognition [3,4,5], using computer to extract and classify pollen features has become an effective way for pollen recognition. The early pollen recognition algorithms focused on extracting shape features, in which the contour shape is a prominent feature for some pollen grains with slender oval shape or rounded triangular shape. However, most pollen grains always have similar contour shapes, so it is difficult to identify different categories of pollen images only by shape features. Considering that pollen images from different categories have large differences in texture, more and more texture based feature extraction methods have been proposed for automatic classification of pollen images. For example, Punyasena et al. [6] extracted the texture and shape features of pollen images using dictionary learning and sparse coding (DLSC), which obtained a recognition rate of 86.13%, however, the recognition performance largely depends on the selection and quantity of sample blocks. Daood et al. [7] decomposed the pollen image into multiple feature layers using clustering, then the texture and geometric features (TGF) of each layer were extracted using LBP and fractal dimension respectively. Finally, the SVM classifier was used to classify pollen images and a recognition rate of 86.94% was obtained. Whereas, the method has little robustness to the rotation of pollen grains, and the decomposition of pollen images increases the dimension of features. Boochs et al. [8] proposed a pollen recognition method combining shape, texture and aperture features (STAF), which extracted 18 shape features, 5 texture features (Gabor Filters, Fast Fourier Transform, Local Binary Pattern, Histogram of Oriented Gradients, and Haralick features) and a surface aperture features of pollen images. The method used a random forest classifier to identify pollen images, and obtained nearly 87% recognition rate. Guru et al. [9] proposed a pollen classification model based on surface texture, which combined local binary pattern (LBP), Gabor wavelet, gray-level difference matrix (GLDM), and gray-level co-occurrence matrix (GLCM) for pollen recognition (LGGG), and obtained 91.66% recognition rate. However, the computation cost of these two methods is large due to high dimension of the combined features, which makes them unpractical for real application. Marcos et al. [10] extracted texture features using Log-Gabor filter (LGF), discrete Tchebichef moments (DTM), local binary patterns (LBP) and gray-level co-occurrence matrix (GLCM), which obtained a recognition rate of 94.83%, whereas, the fused texture feature (LDLG) contains large amounts of redundant information and the computational process is complex.

Local binary pattern is an effective method for representing texture feature, which has been widely used in face recognition and texture classification [11]. The traditional local binary pattern and its variants usually use wide quantization intervals to quantize the neighboring pixels, which enhances the descriptor’s robustness to the illumination changes of images, but also loses some detailed textural information at the same time. Unlike the general texture images, the textural variation range of pollen images is relatively small, so it’s difficult to capture the subtle textural differences of pollen images from different categories in wide quantization intervals. In order to solve the problem, the local decimal pattern (LDP) was proposed. The advantages of our method are as follows: Quantizing using the gradient magnitude of pixel blocks instead of single pixel value to eliminate the effects of image noise. Encoding by referring to the number of pixel blocks in each quantization interval making the descriptor invariant to the rotation of pollen grains. The combination of LDP features in multiple directions increases the descriptor’s discrimination. Experimental results on Pollenmonitor dataset show that the recognition rate and computation speed of our method is higher than that of most pollen recognition methods.

2 Local Decimal Pattern (LDP)

Most of the current methods for extracting pollen features are those combining different single features: LBP and fractal dimension as in [7], and LBP, GLCM, LGF and DTM as in [10], and LBP, Gabor, etc. as in [8, 9]. All of these take advantages of different features to construct the optimal representation of pollen images, but the use of multiple features leads to a higher computational costs.

In order to build pollen feature descriptor with high computational efficiency, and high robustness to rotation and noise, we proposed Local Decimal Pattern (LDP). Figure 1 shows the implementation of LDP feature for representing pollen images, and Fig. 2 presents the step of the algorithm based on LDP for pollen recognition. The specific calculation process of LDP is as follows:

Fig. 1.
figure 1

Implementation of LDP feature for representing pollen.

Fig. 2.
figure 2

The step of the algorithm based on LDP for pollen recognition.

First, we calculate the image gradient, the gradient information of each pixel includes gradient magnitude and gradient angle. The gradient angle range from \( - \pi \) to \( \pi \), and we divide \( \left[ { - \pi ,\pi } \right) \) into 8 equal-sized direction intervals. Then, a histogram of gradient is calculated by weighting all pixels’ gradient magnitude into corresponding gradient directions, and the directions with maximum, minimum and median gradient are marked as \( D_{1} \), \( D_{2} \) and \( D_{3} \) respectively (as shown in Fig. 3). The gradient magnitude of pixel blocks under different gradient directions is calculated as follows:

Fig. 3.
figure 3

Calculation of gradient histogram of an image block. The lengths and directions of arrows represent the gradient magnitude and gradient direction of pixels respectively.

$$ B_{m}^{D} = \sum\limits_{k = 1}^{{m^{2} }} {\left( {m_{{P_{K} }} \times E^{D} \left( {\theta_{{P_{k} }} } \right)} \right)} $$
(1)
$$ E^{D} \left( {\theta_{{P_{k} }} } \right) = \left\{ {\begin{array}{*{20}c} 1 & {\theta_{{P_{k} }} \in D} \\ 0 & {\theta_{{P_{k} }} \notin D} \\ \end{array} } \right. $$
(2)

Where: \( D \) is the gradient direction; m is the pixel block size; \( m_{{P_{K} }} \) and \( \theta_{{P_{K} }} \) are the gradient magnitude and gradient angle of the pixel \( P_{K} \).

Second, the number of pixel blocks in \( i \)th quantization interval under gradient direction \( D \) is counted as follows:

$$ N_{i}^{D} = \sum\limits_{j = 1}^{n} {S_{i} \left( {B_{r,n,m,j}^{D} - B_{m,c}^{D} } \right)} $$
(3)
$$ S_{i} (x) = \left\{ {\begin{array}{*{20}c} 1 & {\left| x \right| \in Q_{i} } \\ 0 & {\left| x \right| \notin Q_{i} } \\ \end{array} } \right. $$
(4)
$$ Q_{i} = \left[ {l_{i} ,l_{i + 1} } \right) $$
(5)

Where: \( n \) is the number of neighboring pixel blocks; \( B_{r,n,m,j}^{D} \) is the gradient magnitude of the \( m \times m \) pixel block in the square neighborhood with sampling radius \( r \); \( j \) is the serial number of pixel blocks; \( B_{m,c}^{D} \) is the gradient magnitude of the central pixel block under gradient direction \( D \); \( Q_{i} \) is the \( i \)th quantization interval; \( l_{i} \) is the threshold of \( Q_{i} \).

After counting the number of neighboring pixel blocks located at different quantization intervals, we can define the Local Decimal Pattern (LDP) as follows:

$$ LDP^{D} = \sum\limits_{i = 1}^{L} {N_{i}^{D} \times 10^{i - 1} } $$
(6)

Where \( L \) is the total number of quantization intervals.

At last, we calculate the LDP feature histograms under three gradient directions, and the final representation of pollen images is the concatenation of these LDP histograms:

$$ LDPH = \left\{ {LDPH^{{D_{1} }} ,LDPH^{{D_{2} }} ,LDPH^{{D_{3} }} } \right\} $$
(7)

Figure 4 shows the calculation process of LDP of an image block under direction \( D_{1} \), the color of the square in figure represents the gradient magnitude difference between the neighboring pixel blocks and the central pixel block under the gradient direction \( D_{1} \), and the same color indicates that the difference of gradient magnitude belongs to the same quantization interval. In Fig. 4, the gradient magnitude of pixel blocks under gradient direction \( D_{1} \) are quantized into 4 intervals, and the number of pixel blocks under 4 quantization intervals is counted as 4, 2, 1 and 1, respectively. So we can get a local decimal pattern 1124.

Fig. 4.
figure 4

Calculation of LDP of an image block under direction \( D_{1} \).

3 Pollen Recognition Experiments

To evaluate our method, we performed experiments on pollenmonitor dataset with a computer of Intel(R) Core(TM) i5-3210 M @ 2.50 GHz processor and 6 GB memory, and the software we used is MATLAB R2014a. We randomly selected 60% of the pollen images of each category on pollenmonitor dataset as training images and the rest were used as test images. A SVM classifier [12, 13] was used for the classification and recognition of pollen images, and the correct recognition rate (CRR), recall rate (RR), F1-measure and recognition time (RT) were used to measure the experimental performance, where, F1-measure is the harmonic average of CRR and RR.

3.1 Parameter Selection

  1. (1)

    Neighbor number, sampling radius and block size:

We use a sampling strategy with fixed number of neighboring pixel blocks (\( n = 8 \)), and different block size and sampling radius \( \text{(}m = \left\{ {2, \, 3, \, 4, \, 5, \, 6, \, 7, \, 8, \, 9, \, 10} \right\}, \) \( r = \left\{ {2, \, 3, \, 4, \, 5, \, 6, \, 7, \, 8, \, 9, \, 10} \right\}\text{)} \).

  1. (2)

    Quantization interval number:

We performed experiments with different number of quantization intervals and find that 2 quantization intervals is not enough to represent pollen texture feature, but too many (more than 4) leads to a higher dimension of LDP histogram. When the number of quantization intervals is 3, 4, the corresponding dimensions of LDP histogram are \( 8 \times 10^{2} \) and \( 8 \times 10^{3} \), respectively. In fact, many decimal patterns do not exist, resulting in large columns of LDP histogram are empty. That’s because the total number of quantized pixel blocks in the neighborhood is fixed (\( n = 8 \)). Take 3 quantization intervals for instance, if the number of pixel blocks located at first quantization interval is 7, the decimal pattern only can be 107 or 017, and other patterns such as 117, 127, etc. can never appear. So, we delete the nonexistent decimal patterns from the LDP histogram, and the dimension of LDP histogram is 45, 165 when the quantization interval is 3, 4, respectively.

  1. (3)

    Quantization thresholds:

The quantization thresholds with \( L = 3, \, 4 \), are presented in Table 1, which depends on pixel block size (m).

Table 1. The quantization thresholds of different quantization levels

3.2 Experimental Results on Pollenmonitor Dataset

The Pollenmonitor dataset comprises air pollen samples from 33 different taxa collected in Freiburg and Zurich in 2006. The number of pollen images in this dataset is about 22700. Affected by the micro-sensors and irregular collection methods, some pollen images have some degrees of deformation and contamination, and the image quality is generally not high.

By varying the pixel block size and sampling radius from 2 to 10, we get the correct recognition rates as presented in Fig. 5. Obviously, 4 quantization intervals (\( L = 4 \)) performs better, and the best recognition rate was obtained with the block size 5.

Fig. 5.
figure 5

Recognition results (%) on Pollenmonitor dataset with different block size and quantization intervals.

Figure 6 presents the partial recognition instances of 6 representative pollen categories on Pollenmonitor dataset. It can be seen that most pollen images with clear texture and have not been contaminated and deformed can be correctly identified. The specific recognition results are shown in Table 2, we can find that the correct recognition rates of most pollen categories are more than 90%, and the recall rates of all categories are more than 73%. For Corylus category with varying degrees of rotation, our method achieved 94.02% correct recognition rate. For Fagus category with severe noise, our method can also obtained 83.18% correct recognition rate.

Fig. 6.
figure 6

Recognition instances of 6 classic pollen taxa from the Pollenmonitor dataset.

Table 2. Recognition results of 6 classic pollen taxa in Pollenmonitor dataset

3.3 Experimental Comparison and Analysis

We compared the best recognition rates achieved by our method using different block size with state-of-the-art pollen recognition methods, the experimental results on Pollenmonitor datasets are listed in Table 3. The average correct recognition rate of our method on Pollenmonitor datasets is 90.95%, which is on average 6.81 percentage points higher than that of compared pollen recognition methods. The experimental results show that our proposed method has a better recognition performance and the computational efficiency is higher than most of the compared methods.

Table 3. Comparison of the average recognition results of our method and 5 pollen recognition methods on Pollenmonitor dataset

4 Conclusions

In this paper, we presented a LDP descriptor for pollen image recognition. Unlike most pollen recognition methods fusing different kinds of features in recent years, our method extracts single texture feature in three directions, which decreases the dimensionality of pollen features and increases the discrimination at the same time. Experimental results show that our method outperforms 5 compared pollen recognition methods in extracting pollen texture feature, and has robustness to the noise and rotation of pollen images.