Abstract
The most common asphalt pavement surface distress is cracking, manifested in various forms, such as transverse, longitudinal, and reflective cracks, governed by different initiation and propagation mechanisms. Being able to timely detect and classify different types of cracks provides critical information for properly maintaining and managing our invaluable road assets. In this paper, a series of classic image processing techniques were applied to pavement surface images to delineate crack patterns. The processed images were projected to a low-dimensional feature space through principal component analysis (PCA). A K-means algorithm is then applied to cluster images in the low-dimensional feature space. The results revealed a meaningful correlation between the crack patterns and the clusters derived.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Accurate pavement condition assessment of a road network is vital for managing maintenance and rehabilitation (M&R) efforts to ensure its continued functionality. Early detection and repair of pavement defects can prevent further degradation and reduce the overall road M&R expenditure. As such, efficient and timely road inspection is one of the key elements of a successful pavement management system. Nevertheless, periodic road condition surveys tend to be costly and time-consuming if being carried out in the conventional fashion, i.e., by human visual inspection.
Good road conditions are important in supporting the proper functions of a country’s economy, and many leading civil engineering organizations across the world, such as the American Society of Civil Engineers (ASCE), continue to emphasize the importance of maintaining the good conditions of the roads and other civil infrastructures. However, the existing situation is not optimistic. According to [1], approximately half of the major roads in the U.S. are currently considered as in poor conditions.
Pavement surface condition assessment has historically been conducted manually by state and/or local transportation agencies, which is still the predominant practice in most developing countries. This resource-demanding and time-consuming process has motivated the development of various automatic road condition assessment techniques and systems. With the growing interest in automating the pavement condition survey and rapid advancement in computer vision and deep learning, the past decade has witnessed an emerging popularity of automatic image-based road surface distress assessment systems. However, many existing automatic image-based systems still suffer from variation in environmental conditions, such as lighting condition, shadow cast by nearby objects, rainy weather, etc., and reliability issues associated with underlying image-based cracks detection algorithms, ranging from simple pixel level filtering to complex deep convolutional neural networks [2].
Automating the detection and classification of pavement surface cracks has been a challenging task due to multifaceted crack patterns arising from a multitude of factors underlying different crack initiation and propagation mechanisms, such as traffic loading spectrums, weather-induced material properties, pavement design, etc. Many local agencies still rely on the traditional visual inspection method for assessment of their local road networks.
In this study, we aim to leverage the availability of pavement surface images collected as part of the pavement survey process and apply unsupervised learning techniques (e.g., PCA and cluster analysis) to cluster similar crack types/patterns. Being able to identify different classes of cracks in an unsupervised fashion helps to understand the local dominant crack types and potential contributing factors, and to infer proper treatment strategies. For conceptual proof of this idea, the surface images collected by a survey vehicle from three local roads in Athens, Georgia were utilized.
2 Literature Review
Automated detection of distresses from pavement images is a challenging problem that has been studied by the computer vision research community for more than three decades. The challenges associated with 2D pavement image-based method include variations in image sources (digital cameras, smartphones, unmanned aerial vehicles, etc.), non-uniformity of cracks, surface texture, lack of sufficient background illumination, and the presence of other features such as joints, among others.
Traditional image analysis approaches rely on hand-crafted features. For example, a novel local binary pattern (LBP) based operator for pavement crack detection was introduced in [3] and a crack detection method using Gabor filter was presented in [4]. An automatic crack detection method referred to as CrackTree is developed in [5]. An integrated model for crack detection and characterization is presented in [6] and a variety of image processing algorithms aimed at detecting and characterizing road surface crack distresses are described in [7].
The recent advancement in machine learning (ML), especially in deep learning (DL) with vision tasks (e.g., object detection and classification), has reignited the research in image-based pavement distress detection and classification. In a recent study, Hoang et al. [8] developed a multiclass support vector machine model optimized by artificial bee colony algorithm to achieve an over 96% overall accuracy in classifying longitudinal crack, transverse crack, alligator crack, and no crack. A review on deep learning applications in pavement image analysis and distress detection was conducted by [9]. Specifically, Zhang et al. [10] developed a crack detection method where the discriminative features are learned from raw image patches using ConvNets. Likewise, [11] proposed a deep architecture of CNNs for detecting concrete cracks. More recently, Maeda et al. [12] evaluated Single Shot MultiBox Detector (SSD) with Inception V2 and MobileNet architectures, for detection eight different road damage types.
However, most previous studies on crack detection and classification relied on hand-crafted features or were conducted in supervised learning settings. In contrast, our study focuses on clustering pavement surface images that are congruous with the underlying pavement distress classes in an unsupervised fashion through cluster analysis in a low-dimensional feature space of processed images.
3 Data Set
Pavement surface images (800-by-600) were acquired through a dash camera mounted on a survey vehicle. The dataset used in our experiment consists of 1,125 pavement surface images collected from three local roads in Athens, GA, USA.
4 Analysis
Given the complexity of asphalt pavement images, which are typically endowed with various patterns of micro-texture, nonuniform color arising from the mixture of pavement materials, pavement markings, and shadows casted by nearby utility poles, vegetation, etc., preprocessing of the images is usually required to remove noises and highlight cracks. In this study, a series of image processing techniques, including logarithmic transformation, bilateral filters, Canny edge detector [13], and morphological operations, were applied, resulting in “cleaner” images with more distinct crack patterns. The resultant images were further processed by Principal Component Analysis (PCA) to reduce dimensionality of the feature space, followed by a K mean algorithm [14, 15] to cluster the images in the low-dimensional feature space.
4.1 Image Processing
The interference of the image taking with the dynamic environment often induces unwanted noises. Therefore, it is desirable to apply image preprocessing operations first to remove environmental interference as much as possible. In this study, the open-source computer vision library, OpenCV [16] was utilized, which implemented many popular and commonly used image processing and computer vision algorithms. In our experiment, the images were processed in several steps as described below.
First, the originally captured RGB images were resized from 800-by-600 to 100-by-100 to reduce unnecessary resolution as well as computational footprint and converted to grayscale images since colors do not contribute much to crack features (Fig. 1a). Then, the logarithmic transformation was applied, resulting in the range of darker pixels being expanded while brighter pixels being compressed (Fig. 1b).
Random noise is one of the main problems which may affect the results. Due to pavement physical characteristics, the naturally captured images are more or less affected by random noises (e.g., induced by uneven micro-texture) and/or uniform noise (e.g., a result of shadow), which need to be removed. A bilateral filter was applied to smooth images without losing edge precision. Figure 1c shows the result of the same image after applying the bilateral filtering. Finally, a canny edge detector was applied, followed by morphological operations (erosion and dilation) to highlight crack tracks (Fig. 1d).
4.2 Dimension Reduction
With the previously processed images, principal component analysis (PCA) was applied for dimension reduction and feature extraction. In PCA, the high-dimensional features are linearly projected to a low-dimensional space without losing much information. In other words, PCA aims to find a linear subspace of the original feature space that captures the most data variance. In PCA, the principal components (PCs) are created in a descending order of the amount of data variation and all PCs are orthogonal to one another. The eigenvectors of covariance matrix reveal the PC directions while the eigenvalues provide the relative magnitudes of variance explained by corresponding PCs. A limitation of PCA lies in its linearity, which can be addressed by extending the linear kernel to nonlinear kernels through the kernel trick, referred to as kernel PCA [17]. After applying PCA to our dataset, the cumulative variance explained was plotted against the number of components, as shown in Fig. 2. The top 700 PCs, which explained over 80% of the total variance, were retained for cluster analysis.
4.3 Cluster Analysis
A variety of cluster analysis methods have been developed with K-means (Lloyd 1957; MacQueen 1967) being the most popular one that is flexible and works well in many settings. In K-means, the number of clusters, K, needs to be prespecified. The elbow method has been typically used to determine the K value that corresponds to the point of the maximum curvature in the inertia plot. Given a dataset, the goal of K-means is to find the K clusters with the minimal within-cluster variation. Using the Euclidean distance as the proximity measure, K-means algorithm can be expressed as an optimization problem in Eq. 1.
where, \(x_{ij}\) = the jth feature of observation i; p = the number of features; \(\left| {C_{k} } \right|\) = the number of observations in the kth cluster.
The within-cluster sum-of-squares in Eq. 1 is often referred to as inertia. Applying K-means to our dataset in the reduced feature space, the resulting inertia is plotted against the number of clusters (K) in Fig. 3. Per the elbow method, three clusters (K = 3) were chosen.
For visualization purposes, the cluster results are plotted in the 2D plane formed by the first two principal components (see Fig. 4).
The clusters are indicated by colors in Fig. 4. Cluster 0 (blue) represents multi-directional cracks, Cluster 1 (orange) coincides with longitudinal crack, Cluster 2 (green) contains mainly images without cracks. Exemplar images from each cluster are shown in Fig. 5.
5 Conclusions
As a popular nonintrusive technology, research in image-based pavement condition assessment has recently regained momentum by leveraging modern machine learning methods and techniques. This paper presents preliminary results from an ongoing study, demonstrating the use of simple unsupervised machine-learning methods (i.e., PCA for feature extraction and K-means for cluster analysis) for clustering pavement surface distresses. The results revealed a meaningful correlation between derived clusters and crack types. The simple approach could be used to help local jurisdictions understand their predominant pavement distress types and plan for M&R activities. The cluster outcome can also be used as a pre-step to assist with image annotation for developing supervised classification models. Given the limitation of PCA in its linear projection, further studies should consider nonlinearity in feature extraction, such as through kernel PCA, variational autoencoders, etc.
Besides the pavement surface images, modern pavement survey practices also tend to capture subsurface profiles using the ground penetrating radar (GPR) technology. Surface images emphasize functional failure while subsurface GPR scan images can reveal structural failure. For effective diagnosis of pavement structure, surface images and GPR scan images can be jointly used to enrich the feature space through feature engineering techniques to properly capture the complementary aspects of pavement structure.
References
AASHTO: Rough Roads Ahead: Fix Them Now or Pay for it Later. Joint Publication of the American Association of State Highway and Transportation Officials and National Transportation Research Group, Washington, DC (2009)
Riid, A.; Lõuk, R.; Pihlak, R.; Tepljakov, A.; Vassiljeva, K.: Pavement distress detection with deep learning using the orthoframes acquired by a mobile mapping system. Appl. Sci. 9, 4829 (2019)
Hu, Y., Zhao, C.X.: A novel LBP based methods for pavement crack detection. J. Pattern Recogn. Res. 5(1), 140–147 (2010)
Salman, M., Mathavan, S., Kamal, K., Rahman, M.: Pavement crack detection using the gabor filter. In: Proceedings of IEEE International Conference on Intelligent Transportation Systems, pp. 2039–2044 (2013)
Zou, Q., Cao, Y., Li, Q., Mao, Q., Wang, S.: CrackTree: automatic crack detection from pavement images. Pattern Recogn. Lett. 33(3), 227–238 (2012)
Oliveira, H., Correia, P.L.: Automatic road crack detection and characterization. IEEE Trans. Intell. Transp. Syst. 14(1), 155–168 (2013)
Oliveira, H., Correia, P.L.: Crackit-an image processing toolbox for crack detection and characterization. In: Proceedings of IEEE International Conference on Image Processing, pp. 798–802 (2014)
Hoang, N.-D., Nguyen, Q.-L., Bui, D.T.: Image processing-based classification of asphalt pavement cracks using support vector machine optimized by artificial bee colony. J. Comput. Civ. Eng. 32, 04018037 (2018)
Gopalakrishnan, K.: Deep Learning in data-driven pavement image analysis and automated distress detection: a review. MDPI J. Data 3, 28 (2018)
Zhang, L., Yang, F., Zhang, D., Zhu, Y.J.: Road crack detection using deep convolutional neural network. In: Image Processing (ICIP), IEEE International Conference, pp. 3708–3712 (2016)
Cha, Y.J., Choi, W., Büyüköztürk, O.: Deep learning-based crack damage detection using convolutional neural networks. Comput. Aided Civ. Infrastructure Eng. 32(5), 361–378 (2017)
Maeda, H., Sekimoto, Y., Seto, T., Kashiyama, T., Omata, H.: Road damage detection and classification using deep neural networks with smartphone images. Comput.‐Aided Civ. Infrastructure Eng. 33, 1127 (2018)
Canny, J.: A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 8(6), 679–698 (1986)
Lloyd, S.P.: Least squares quantization in PCM. Technical Report RR-5497, Bell Lab, September 1957. Google Scholar
MacQueen, J.B.: Some methods for classification and analysis of multivariate observations. In: Le Cam, L.M., Neyman, J. (eds.) Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281–297. University of California Press, California
OpenCV: Open Source Computer Vision Library (2021)
Schölkopf, B., Smola, A., Müller, K.-R.: Kernel principal component analysis. In: Gerstner, W., Germond, A., Hasler, M., Nicoud, J.-D. (eds.) ICANN 1997. LNCS, vol. 1327, pp. 583–588. Springer, Heidelberg (1997). https://doi.org/10.1007/BFb0020217
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Abdelmawla, A., Yang, J.J., Kim, S.S. (2021). Unsupervised Learning of Pavement Distresses from Surface Images. In: Liu, Y., Cuomo, S., Yang, J. (eds) Advances in Innovative Geotechnical Engineering. GeoChina 2021. Sustainable Civil Infrastructures. Springer, Cham. https://doi.org/10.1007/978-3-030-80316-2_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-80316-2_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-80315-5
Online ISBN: 978-3-030-80316-2
eBook Packages: Earth and Environmental ScienceEarth and Environmental Science (R0)