A Review of the Application of CNN-Based Computer Vision in Civil Infrastructure Maintenance

Cai, Ruying; Li, Jingru; Li, Geng; Tang, Dongdong; Tan, Yi

doi:10.1007/978-981-16-3587-8_42

Ruying Cai⁵,
Jingru Li⁵,
Geng Li⁵,
Dongdong Tang⁵ &
…
Yi Tan⁵

Included in the following conference series:

International Symposium on Advancement of Construction Management and Real Estate

1920 Accesses
2 Citations

Abstract

Computer-vision and deep-learning techniques are being increasingly applied to the maintenance of civil infrastructure, such as inspecting, monitoring, and assessing infrastructure conditions, which overcome time-consuming and laborious compared with traditional technology. In this paper, the research progress of deep learning, the developments of convolutional neural network (CNN)-based computer vision in improving accuracy, reliability and generalized object detection capability and its application in civil infrastructure maintenance are reviewed. The main objectives are as follows: (1) clarify the application of deep learning in computer vision to help researchers systematically understand deep learning; (2) review the application of computer vision in civil infrastructure maintenance to help researchers pay more attention to its advantages; (3) encourage relevant personnel to use this research as a reference, take deep learning as an important method at the forefront of engineering management, generate more innovations in the construction field, and promote the development of the construction industry.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Applications of Deep Learning in Intelligent Construction

A review of the research and application of deep learning-based computer vision in structural damage detection

Article 20 January 2022

Structural Damage Detection using Deep Convolutional Neural Network and Transfer Learning

Article 03 September 2019

Keywords

1 Introduction

Civil infrastructure can provide good services to the citizens as the operation and management activities. If maintenance is not carried out in time, it will not only cause potential hazards and hidden dangers to the civil infrastructure and its ancillary facilities, but also threaten citizen lives. Therefore, it’s essential for real-time monitoring the condition of the infrastructure, so that necessary repairs and maintenance work can be carried out proactively and timely before it becomes too dangerous and expensive. Conventional manual monitoring is extensively time-consuming, laborious, expensive and has healthy and safety problems, particularly for the aerial working environment where detection is difficult to conduct [1].

In the past years, deep learning techniques, especially convolutional neural network (CNN), have been shown to outperform previous state-of-the-art machine learning techniques in several fields, with computer vision being one of the most prominent cases [2]. Computer vision is changing processes of the construction management as it enables the automatic acquisition, processing, analysis of digital images, and the extraction of high-dimensional data from the real world to produce useful information to improve managerial decision-making [3].

Deep learning has obtained promising performance in various computer vision tasks such as image classification [4], object detection [5] and object segmentation [6]. These three tasks are not only related to each other, but also progressive. The connection is that they are all based on the basic idea of CNN. The progressive relationship increases difficulties of three tasks. Both object detection and segmentation use some basic network models from image classification. The CNN-based image classification algorithm provides many new ideas for object detection and segmentation, and has achieved good results. This paper will briefly describe these three tasks and make a general comparison. From the beginning, these tasks have been applied to the industrial field, until now, they have been applied to many other fields and made great achievements, and have great application prospects in civil infrastructure maintenance. The application of deep learning based on CNN in the automatic detection and location of defects in civil infrastructures [7], such as bridges [8], roads [9] and sewage pipes [6], can solve these problems.

The remainder of this paper is organized as follows. Figure 1 is the structure of this review work. In Sect. 2, the research progress of deep learning, including the structure of deep learning, is described. Section 3 described the use of deep learning methods to address key tasks in computer vision, such as image classification, object detection, and object segmentation. In Sect. 4, the application of deep learning-based computer vision in civil infrastructure maintenance are reviewed.

2 Research Progress of Deep Learning

2.1 Important Milestones of Deep Learning

Deep learning methods usually addresses rich and complex data from different sources, and they have performed better than previous technologies in multiple tasks, attracting increasing attention. Where does it start? How does it determine whether a particular deep learning model is suitable for their problem? How to train and deploy them? With these questions, the important milestones leading up to the era of deep learning [2] are firstly summarized in Table 1. The MCP model and Neocognitron are the beginning of the artificial neural networks (ANN) and CNN, respectively. However, AlexNet [10] won the ImageNet contest in 2012 with an absolute advantage of 10.9 percentage points over the second place. Since then, deep learning and convolutional neural networks rose to prominence with AlexNet. An overview of deep learning structure based on CNN is presented next.

Table 1 Important milestones

Full size table

2.2 Deep Learning Structure Based on CNN

CNNs were inspired by the visual system’s structure, in particular by its proposed models [11]. A CNN consists of three main types of layers, namely, convolutional layers, pooling layers and fully connected layers. Each type of layers has a different task. Figure 2 shows a general CNN architecture for an image classification task. In addition, CNN also has activation function, Batch Normalization and Regularization.

(i)
Convolutional layers

In the convolutional layers, various kernels are used to convolve the input data to generate feature maps. The convolution operation is to cover the entire image step by step with the convolution kernel according to the step size, and the value of the filter is multiplied by the pixel value of the corresponding position of the image and then summed. The value obtained is the value of the target pixel in the output image.
(ii)
Pooling layers

The pooling layer reduces the spatial size (width × height) of the input volume of the next convolutional layer through maximum pooling or average pooling, but does not affect its depth. This operation can reduce the number of parameters in the network, reduce the consumption of computing resources, and can also effectively control overfitting. The operation process of the pooling layer is to first slide the input data through the spatial window, and select the maximum or average value as the output result, and then continue to slide the window until the entire input data is covered, and finally the output results of each sliding are in order arrange to obtain the final complete output data. In the whole process, reduce the spatial size of the input data. The size of the sliding window and the sliding step size will affect the output data, so it is necessary to use the appropriate size and step size for the accuracy of the results.
(iii)
Fully connected layers

Following several convolutional and pooling layers, the high-level reasoning in the neural network is performed via fully connected layers. Fully connected layers play the role of classifier in the entire CNN.
(iv)
Activation function

The emergence of the activation function Rectified Linear Units (ReLU) solves the problem that sigmoid and tanh are prone to disappearing gradients, which is currently the most commonly used activation function. Generally, the activation function is used after each convolution.
(v)
Batch Normalization and Regularization

Batch Normalization is to force the distribution of the input value back to a standard normal distribution with a mean of 0 and a variance of 1, to avoid the problem of vanishing gradients. Dropout is a convenient but powerful regularization method, which randomly deletes some nodes in each iteration, and only train the remaining nodes to suppress overfitting.

2.3 The Relationship Between Machine Learning, Deep Learning, CNN, Computer Vision and Civil Infrastructure

Understanding the relationship between machine learning, deep learning, CNN, computer vision and civil infrastructure can help researchers understand this paper. For machine learning, the way to solve the problem is to find out the mapping relationship between X and Y through the model, among which the available models are logistic regression, linear regression, support vector machine (SVM) and others. While, using the type of neural network model is called deep learning, which including convolutional neural networks. The application of convolutional neural network to computer vision mainly has three major tasks, including Image classification, Object detection, Object segmentation. Then, these three tasks are applied to civil infrastructure, as can be seen from Fig. 3.

3 Application of CNN in Computer Vision

Deep learning has been widely adopted in various directions of computer vision, such as image classification, object detection and segmentation, which are key tasks for image understanding. The differences among the three tasks can be seen intuitively from Fig. 4, taking crack images of sewage pipes [12] as an example. In this part, the developments of deep learning in above-mentioned three tasks, especially the CNN- based algorithms, will be briefly summarized.

3.1 Image Classification

The image classification task means that image is labeled with a probability of the presence of a particular visual object class [13], which is the simplest and most basic image understanding task. The task of the deep learning model is to achieve the first breakthrough and realize large-scale application.

In general, CNN is the most advanced compared to classical algorithms [14]. Through the continuous research and improvement of its structure, a series of network models have been formed and successfully applied in a wide range of practical applications, such as AlexNet, VGGNet [15], GoogleNet [16] and ResNet [17] as shown in Table 2. It can be seen from the table that more and more optimizations are applied to network design, such as Dropout, Local response normalization (LRN), and Batch normalization. The state-of-the-art results of the top-5 error rate tested by ImageNet since 2012 are also presented in Fig. 5. The model CNN-based is also used in the cracks of civil infrastructures, for example, Zhou and Song developed Deep CNN structures with different layouts for fracture classification based on laser scanning range images [4], Wang et al. proposed a CNN-based damage classification technology for deep buildings targeting masonry historical structures [18].

Table 2 Structure of typical convolutional neural networks models

Full size table

3.2 Object Detection

Image classification is the basis of computer vision, but only classification is not enough. Object detection is different from but closely related to the image classification task. Object recognition and segmentation are more difficult but meaningful. The classification task is only concerned with classification, while the detection task is not only focus on classification, but also required to obtain the location of the detected object.

Object detection research has been conducted for many years, and there are many methods that have been widely recognized and applied in the industry. Several typical detection models and their feature are introduced in Table 3. Object detection is usually divided into two categories, one category is one-stage network, such as You Only Look Once (YOLO) [19,20,21] series and Single Shot MultiBox Detector (SSD) [22], the other is two-stage network, such as Regions with CNN features (RCNN) series [23]. In general, one-stage is faster, two-stage is more precise. Some scholars have applied these two types of detection networks to sewage pipes and have reached a consistent conclusion [24]. In addition, these algorithms have attracted the attention of the researchers, for example, YOLO was applied in various kinds of defects automatic detection [25], YOLO have also been used in detecting multiple damage on the surface of the concrete bridge [26, 27], Faster RCNN was used to detect and preliminarily evaluate the damage caused by earthquake to buildings [28].

Table 3 Object detection model feature

Full size table

3.3 Object Segmentation

In addition to classification and object detection, it is also necessary to separate out all the pixels related to the object and give the categories even though it’s more difficult, which is called object segmentation.

Object segmentation consists of semantic segmentation and instance segmentation. The former is an extension of the pre-background segmentation, requiring the separation of image parts with different semantics [29]. Figure 6 shows the scores of its typical model in the VOC2012 dataset. While the latter is an extension of the detection task, which requires the outline of the objects and more refined than the detection frame. The MASK R-CNN and FCIS [30] are the most significant research outcomes in recent years. Compared with semantic segmentation, instance segmentation can label different individuals of the same type of object on the image, which is a comprehensive task combining image classification, object detection, and semantic segmentation.

In general, Object segmentation is a pixel-level description of an image [14], which is suitable for scenes with high requirements for understanding. Such as the segmentation of roads and non-roads in auto pilot.

3.4 Typical Experimental Tools and Model Evaluation

Good tools such as datasets and computing platform, can make the research process more effective and successful. The development of deep learning is inseparable from the development of datasets. The typical datasets of image processing fields, including MNIST, PASCAL VOC, CIFAR, ImageNet, COCO, Open Image, and Youtube8M play an important role in the recent neural network researches in industry application, academic research and other fields. Programming tools that support deep learning are also very popular, such as TensorFlow, MXNet, PaddlePaddle, Caffe, Torch, and Theano, providing rich convenient interfaces for mathematical computation.

Deep learning is a branch of machine learning, and precision and recall are typical indicators for most machine learning. However, due to the uneven distribution of prior targets, traditional evaluation indicators are not suitable for multi-object detection models. Therefore, different types of classification errors should be considered when evaluating object detection models [5]. The performance of the model is summarized as two aspects: (1) accuracy. The precise recall, average accuracy (AP), mean AP and missing rate belong to accuracy; (2) calculating cost. Detection speed and training time belong to calculating cost.

4 Application in Civil Infrastructure

Civil infrastructures, including bridges, roads, tunnels, and underground utilities like sewage pipe, are becoming susceptible to losing their designed functions due to deterioration caused by use [7]. This inevitable situation means urgent maintenance is required. The condition monitoring of concrete surface plays a significant role in civil infrastructure management system [31]. Defects are the main threat to concrete surface of infrastructure. Traditional vision-based methods of crack detection lack accuracy and generalization when working on complicated infrastructural conditions [32]. At present, a number of computer vision-based crack detection techniques have been developed to enhance the efficiency, speed, and objectivity of inspection [33] and manage a large number of structures [34]. For example, as shown in Fig. 7, three computer vision tasks based on CNN, including image classification, object detection and object segmentation, are employed in three civil infrastructures, including sewage pipe, bridge and road, respectively.

4.1 Sewage Pipe Inspection

As an important component of civil infrastructure, sanitary sewer systems are designed to collect and transport sanitary wastewater and stormwater. Sewer defect inspection is the key in identifying both the type and location of pipe defects to maintain the normal sewer operations [5] for maintenance of urban underground infrastructure [35].

For sewage pipe defect inspection, a CNN was initially used to detect and characterize cracks on an autonomous sewer inspection robot [36]. Currently, closed-circuit television (CCTV) and other visual inspection technologies have been widely used in the inspection of underground sewage pipelines. However, it’s time-consuming and the results are subjective [37] when relying on manual interpretation of the images or videos. However, the deep learning-based approach can automatically extract image features and improve the accuracy and efficiency, and it does not require much for image preprocessing. Therefore, several studies of deep learning-based approach exploration have been performed. For example, the method of image classification is applied to sewage pipe detection with a sufficiently large dataset (over 2 million CCTV images) by Dirk Meijer [38]. A deep learning-based approach is developed for sewer pipe defect detection using faster region-based convolutional neural network (Faster R-CNN) [12]. With the development of deep learning techniques, Yin et al. employ a state-of-the art convolutional neural network (CNN) based object detector, namely YOLOv3 network, for detection system of sewer pipes [39]. A unified neural network, namely DilaSeg-CRF, is proposed by fully integrating a deep convolutional neural network (CNN) with dense conditional random field (CRF) and applied to sewer pipe [6].

4.2 Bridge Inspection

Bridges play an important role in civil infrastructure. Periodic bridge inspections are very important to maintain the functionality, safety and reliability of the bridge structure. It’s essential for the continuous monitoring and maintenance of bridges. As bridges become obsolete, the number of bridges that need to be inspected increases, which requires a lot of maintenance costs. If postponing the cost of bridge maintenance, more costs will be required in the near future [40].

Traditional bridge detection methods rely on human visual inspection [41], which remains the most adopted approach among all nondestructive evaluation techniques that can be used to identify and monitor defects [42]. This method has limitations that the performance is highly related to the experience of the inspector, time consumption and accessible areas [40]. In this case, detection technology based on computer vision [43] and the idea of images obtained from drones [44] are proposed.

Zhang et al. use the applicability of the state-of-the-art single-stage detector YOLOv3 to identify various types of defects in concrete bridges and improve its performance in terms of detection accuracy [42]. Some researchers also use other deep learning-based methods to detect bridge damage and achieve better results, such as CNN [45,46,47,48], region with convolutional neural networks (R-CNN)-based transfer learning [40] when the dataset is not enough.

4.3 Road Inspection

With the rapid development of road traffic, road surface cracks not only affect the transportation efficiency but also pose a potential threat to vehicle safety. The importance of road maintenance has attracted increasing attention. It is crucial to repair the roads in time when potholes are appeared to prevent accidents in advance [49]. In reality, however, due to limited human resources, it is difficult to detect and repair potholes in time. A lot of research has focused on road damage detection, and there are three main methods: vibration sensor-based, laser scanning-based, and computer vision-based methods [50].

With the advent of CNN [51], image processing technology has made significant progress recently, and computer vision-based methods are widely utilized to research road defects. Image processing algorithms [52] mainly include threshold segmentation [53], edge detection [54] and region growth methods [55] for image processing and crack feature recognition. CNN algorithm is applied to concrete pavement crack detection [56, 57]. Chun et al. proposed Fully CNN-based road surface damage detection with semi-supervised learning to detect road damage [49]. Hybrid deep CNN is applied to the detection and location of moisture damage in asphalt pavements, including ResNet50 network for feature extraction, YOLOv2 network for identification, and detection and location of moisture damage [9].

4.4 Other Civil Infrastructures Inspection

In addition to the civil infrastructure mentioned above, other infrastructures also applied deep learning methods to detect damage. Structural health monitoring (SHM) is used to manage and maintain civil infrastructure, which generated a large amount of data. Traditional detection technology cannot effectively analyze these data, and it is time-consuming, laborious, and inefficient. Therefore, how to effectively monitor, mine and use the data requires in-depth research, which considers the introduction of deep learning-based methods for detection. Deep learning-based method is also used to detect crack from concrete surface [58,59,60,61,62,63], structure [64, 65], buildings [66, 67]. In addition, deep learning is also used to identify unsafe behavior from two-dimensional images that appear on construction site [68, 69]. Their experimental results show that the method has a significant improvement in accuracy and efficiency. In summary, deep learning has good application prospects in the field of construction.

Through the review, we found that although many people apply cutting-edge technologies such as computer vision to civilian infrastructure, they have not been implemented in practice and have not achieved real-time detection technology.

5 Conclusions

Computer vision has attracted the increasing attention of researchers and practitioners. This paper gives a brief review of the application of deep learning-based computer vision in civil infrastructure maintenance. Firstly, the research progress of deep learning was reviewed, including the important milestones and deep learning architectures. Deep learning is widely used in the three major directions of computer vision, image classification, object detection, and object segmentation. Secondly, the models used in these three aspects are summarized. Finally, the applications of deep learning-based computer vision for damage detection in the maintenance phase of civil infrastructure, including sewage pipes, bridges and roads, were reviewed.

Through the review, we can find that more and more people are paying attention to automation and intelligence. The application of cutting-edge technology to the construction industry is a measure that conforms to the times. Moving to the forefront of technology is a necessary condition for the development of automation and intelligence in the construction industry. Prosperous application prospects in other aspects of the construction industry. In recent years, these reviewed models have become new hotspots for deep learning and CNN to effectively applied in computer vision, multi-object classification and related fields. They are considered effective methods and tool by the industry and academia. Applying deep learning-based computer vision technology to the construction management field can achieve greater and more innovation and promote the transformation and development of the construction industry. However, we found that although many people apply cutting-edge technologies such as computer vision to civil infrastructure, the implementation is limited in practice, and the real-time detection is also still limited.

References

Perez, H., Tah, J. H. M., & Mosavi, A. (2019). Deep learning for detecting building defects using convolutional neural networks. Sensors, 19(16), 22.
Article Google Scholar
Voulodimos, A., et al. (2018). Deep learning for computer vision: A brief review. Computational Intelligence and Neuroscience, 13.
Google Scholar
Zhong, B. T., et al. (2019). Mapping computer vision research in construction: Developments, knowledge gaps and implications for research. Automation in Construction, 107.
Google Scholar
Zhou, S. L., & Song, W. (2020). Deep learning-based roadway crack classification using laser-scanned range images: A comparative study on hyperparameter selection. Automation in Construction, 114, 17.
Article Google Scholar
Cheng, J. C. P., & Wang, M. Z. (2018). Automated detection of sewer pipe defects in closed-circuit television images using deep learning techniques. Automation in Construction, 95, 155–171.
Article Google Scholar
Wang, M. Z., & Cheng, J. C. P. (2020). A unified convolutional neural network integrated with conditional random field for pipe defect segmentation. Computer-Aided Civil and Infrastructure Engineering, 35(2), 162–177.
Article Google Scholar
Cha, Y. J., Choi, W., & Buyukozturk, O. (2017). Deep learning-based crack damage detection using convolutional neural networks. Computer-Aided Civil and Infrastructure Engineering, 32(5), 361–378.
Article Google Scholar
Lee, K. H., Byun, N., & Shin, D. (2020). A damage localization approach for Rahmen Bridge based on convolutional neural network. Ksce Journal of Civil Engineering, 24(1), 1–9.
Article Google Scholar
Zhang, J., et al. (2020). Automatic detection of moisture damages in asphalt pavements from GPR data with deep CNN and IRS method. Automation in Construction, 113, 11.
Article Google Scholar
Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2017). ImageNet classification with deep convolutional neural networks. Communications of the Acm, 60(6), 84–90.
Article Google Scholar
Hubel, D. H., & Wiesel, T. N. (1962). Receptive fields, binocular interaction and functional architecture in cats visual cortex. Journal of Physiology-London, 160(1), 106.
Google Scholar
Wang, M. Z., & Cheng, J. C. P. (2018). Development and improvement of deep learning based automated defect detection for sewer pipe inspection using faster R-CNN. In I. F. C. Smith & B. Domer (Eds.), Advanced computing strategies for engineering, Pt Ii (pp. 171–192). Springer International Publishing Ag.
Chapter Google Scholar
Guo, Y. M., et al. (2016). Deep learning for visual understanding: A review. Neurocomputing, 187, 27–48.
Article Google Scholar
Wu, H., Liu, Q., & Liu, X. D. (2019). A review on deep learning approaches to image classification and object segmentation. Cmc-Computers Materials and Continua, 60(2), 575–597.
Article Google Scholar
Simonyan, K., & Zisserman, A. J. A. E.-P. (2014). Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556.
Szegedy, C., et al. (2015). Going deeper with convolutions. 2015 Ieee Conference on Computer Vision and Pattern Recognition (pp. 1–9). Ieee.
Google Scholar
Kaiming, H., et al. (2015). Deep residual learning for image recognition (pp. 12).
Google Scholar
Wang, N. N., et al. (2018). Damage classification for masonry historic structures using convolutional neural networks based on still images. Computer-Aided Civil and Infrastructure Engineering, 33(12), 1073–1089.
Article Google Scholar
Redmon, J., & Farhadi, A. (2018). YOLOv3: An incremental improvement (pp. 1–6).
Google Scholar
Redmon, J., Farhadi, A., & IEEE. (2017). YOLO9000: Better, faster, stronger. In 30th Ieee Conference on Computer Vision and Pattern Recognition (pp. 6517–6525).
Google Scholar
Redmon, J., et al. (2016). You only look once: unified, real-time object detection. In 2016 Ieee Conference on Computer Vision and Pattern Recognition (pp. 779–788).
Google Scholar
Liu, W., et al. (2015). SSD: Single shot MultiBox detector. arXiv:1512.02325.
Ren, S. Q., et al. (2017). Faster R-CNN: Towards real-time object detection with region proposal networks. Ieee Transactions on Pattern Analysis and Machine Intelligence, 39(6), 1137–1149.
Article Google Scholar
Kumar, S. S., et al. (2020). Deep learning-based automated detection of sewer defects in CCTV videos. Journal of Computing in Civil Engineering, 34(1), 13.
Article Google Scholar
Li, L.J., et al. (2019). Dam surface crack detection based on deep learning. In Proceedings of the 2019 International Conference on Robotics, Intelligent Control and Artificial Intelligence (pp. 738–743). Assoc Computing Machinery.
Google Scholar
Zhang, C. B., Chang, C. C., & Jamshidi, M. (2020). Concrete bridge surface damage detection using a single-stage detector. Computer-Aided Civil and Infrastructure Engineering, 35(4), 389–409.
Article Google Scholar
Zhang, C. B., & Chang, C. C. (2019). Surface damage detection for concrete bridges using single-stage convolutional neural networks. In P. Fromme & Z. Su (Eds.), Health Monitoring of Structural and Biological Systems (pp. Xiii). Spie-Int Soc Optical Engineering.
Google Scholar
Mondal, T. G., et al. (2020). Deep learning-based multi-class damage detection for autonomous post-disaster reconnaissance. Structural Control and Health Monitoring, 27(4), 15.
Google Scholar
Shelhamer, E., Long, J., & Darrell, T. (2017). Fully convolutional networks for semantic segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(4), 640–651.
Article Google Scholar
Li, Y., et al. (2017). Fully convolutional instance-aware semantic segmentation. 30th Ieee Conference on Computer Vision and Pattern Recognition (pp. 4438–4446). Ieee.
Google Scholar
Guo, L., et al. (2020). Automatic crack distress classification from concrete surface images using a novel deep-width network architecture. Neurocomputing, 397, 383–392.
Article Google Scholar
Zhang, X. X., Rajan, D., & Story, B. (2019). Concrete crack detection using context-aware deep semantic segmentation network. Computer-Aided Civil and Infrastructure Engineering, 34(11), 951–971.
Article Google Scholar
Kim, B., & Cho, S. (2019). Image-based concrete crack assessment using mask and region-based convolutional neural network. Structural Control and Health Monitoring, 26(8), 15.
Google Scholar
Kim, B., & Cho, S. (2018). Automated vision-based detection of cracks on concrete surfaces using a deep learning technique. Sensors, 18(10), 18.
Article Google Scholar
Xie, Q., et al. (2019). Automatic detection and classification of sewer defects via hierarchical deep learning. Ieee Transactions on Automation Science and Engineering, 16(4), 1836–1847.
Article Google Scholar
Browne, M., & Ghidary, S. S. (2003). Convolutional neural networks for image processing: An application in robot vision. In T. D. Gedeon & L. C. C. Fung (Eds.), Ai 2003: Advances in artificial intelligence (pp. 641–652). Springer.
Google Scholar
Hassan, S. I., et al. (2019). Underground sewer pipe condition assessment based on convolutional neural networks. Automation in Construction, 106, 12.
Article Google Scholar
Meijer, D., et al. (2019). A defect classification methodology for sewer image sets with convolutional neural networks. Automation in Construction, 104, 281–298.
Article Google Scholar
Yin, X. F., et al. (2020). A deep learning-based framework for an automated defect detection system for sewer pipes. Automation in Construction, 109, 17.
Article Google Scholar
Kim, I. H., et al. (2018). Application of crack identification techniques for an aging concrete bridge inspection using an unmanned aerial vehicle. Sensors, 18(6), 14.
Article Google Scholar
Xu, H. Y., et al. (2019). Automatic bridge crack detection using a convolutional neural network. Applied Sciences-Basel, 9(14), 14.
Google Scholar
Jiang, S., & Zhang, J. (2020). Real-time crack assessment using deep neural networks with wall-climbing unmanned aerial system. Computer-Aided Civil and Infrastructure Engineering, 35(6), 549–564.
Article Google Scholar
Dinh, K., Gucunski, N., & Duong, T. H. (2018). An algorithm for automatic localization and detection of rebars from GPR data of concrete bridge decks. Automation in Construction, 89, 292–298.
Article Google Scholar
Jung, H. J., et al. (2019). Bridge inspection and condition assessment using unmanned aerial vehicles (UAVs): Major challenges and solutions from a practical perspective. Smart Structures and Systems, 24(5), 669–681.
Google Scholar
Kang, D. H., & Cha, Y. J. (2018). Autonomous UAVs for structural health monitoring using deep learning and an ultrasonic beacon system with geo-tagging. Computer-Aided Civil and Infrastructure Engineering, 33(10), 885–902.
Article Google Scholar
Liu, H., & Zhang, Y. F. Bridge condition rating data modeling using deep learning algorithm. Structure and Infrastructure Engineering, 14.
Google Scholar
Modarres, C., et al. (2018). Convolutional neural networks for automated damage recognition and damage type identification. Structural Control and Health Monitoring, 25(10), 17.
Article Google Scholar
Zhu, J. S., & Song, J. B. (2020). An intelligent classification model for surface defects on cement concrete bridges. Applied Sciences-Basel, 10(3), 15.
Google Scholar
Chun, C., & Ryu, S. K. (2019). Road surface damage detection using fully convolutional neural networks and semi-supervised learning. Sensors, 19(24), 15.
Article Google Scholar
Augustauskas, R., & Lipnickas, A. (2020). Improved pixel-level pavement-defect segmentation using a deep autoencoder. Sensors, 20(9), 21.
Article Google Scholar
Eguchi, M., et al. (2019). A simplified method of detecting spot surface defects by using Quasi-3D data from a conventional road profiler. Transportation Research Record, 2673(11), 377–387.
Article Google Scholar
Cao, W. M., Liu, Q. F., & He, Z. Q. (2020). Review of pavement defect detection methods. Ieee Access, 8, 14531–14544.
Article Google Scholar
Zhu, S. P., et al. (2007). An image segmentation algorithm in image processing based on threshold segmentation. In Sitis 2007: Proceedings of the International Conference on Signal Image Technologies & Internet Based Systems (pp. 673+). Ieee Computer Soc.
Google Scholar
Huili, Z., Guofeng, Q., & Xingjian, W. (2010). Improvement of canny algorithm based on pavement edge detection. In Proceedings of the 2010 3rd International Congress on Image and Signal Processing (CISP 2010) (pp. 964–967).
Google Scholar
Zhou, Y., et al. (2016). Seed-based approach for automated crack detection from pavement images. Transportation Research Record, 2589, 162–171.
Article Google Scholar
Qu, Z., et al. (2020). Crack detection of concrete pavement with cross-entropy loss function and improved VGG16 network model. Ieee Access, 8, 54564–54573.
Article Google Scholar
Riid, A., et al. (2019). Pavement distress detection with deep learning using the orthoframes acquired by a mobile mapping system. Applied Sciences-Basel, 9(22), 22.
Google Scholar
Cha, Y. J., et al. (2018). Autonomous structural visual inspection using region-based deep learning for detecting multiple damage types. Computer-Aided Civil and Infrastructure Engineering, 33(9), 731–747.
Article Google Scholar
Deng, L., et al. (2020). Region-based CNN method with deformable modules for visually classifying concrete cracks. Applied Sciences-Basel, 10(7), 18.
Google Scholar
Lee, J., et al. (2019). Learning to detect cracks on damaged concrete surfaces using two-branched convolutional neural network. Sensors, 19(21), 18.
Article Google Scholar
Li, S. Y., & Zhao, X. F. (2019). Image-based concrete crack detection using convolutional neural network and exhaustive search technique. Advances in Civil Engineering, 2019, 12.
Google Scholar
Wei, F. J., et al. (2019). Instance-level recognition and quantification for concrete surface bughole based on deep learning. Automation in Construction, 107, 13.
Article Google Scholar
Ryu, E., et al. (2020). Automated detection of surface cracks and numerical correlation with thermal-structural behaviors of fire damaged concrete beams. International Journal of Concrete Structures and Materials, 14(1), 12.
Article Google Scholar
Ye, X. W., Jin, T., & Chen, P. Y. (2019). Structural crack detection using deep learning-based fully convolutional networks. Advances in Structural Engineering, 22(16), 3412–3419.
Article Google Scholar
Wei, S. Y., Bao, Y. Q., & Li, H. (2020). Optimal policy for structure maintenance: A deep reinforcement learning framework. Structural Safety, 83, 13.
Article Google Scholar
Perry, B. J., et al. (2020). Streamlined bridge inspection system utilizing unmanned aerial vehicles (UAVs) and machine learning. Measurement, 164, 14.
Article Google Scholar
Liu, L. L., et al. (2017). Transfer learning on convolutional activation feature as applied to a building quality assessment robot. International Journal of Advanced Robotic Systems, 14(3), 12.
Article Google Scholar
Fang, W. L., et al. (2020). Computer vision for behaviour-based safety in construction: A review and future directions. Advanced Engineering Informatics, 43, 13.
Article Google Scholar
Fang, W. L., et al. (2020). Computer vision applications in construction safety assurance. Automation in Construction, 110, 10.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Sino-Australia Joint Research Center in BIM and Smart Construction, Shenzhen University, Shenzhen, China
Ruying Cai, Jingru Li, Geng Li, Dongdong Tang & Yi Tan

Authors

Ruying Cai
View author publications
You can also search for this author in PubMed Google Scholar
Jingru Li
View author publications
You can also search for this author in PubMed Google Scholar
Geng Li
View author publications
You can also search for this author in PubMed Google Scholar
Dongdong Tang
View author publications
You can also search for this author in PubMed Google Scholar
Yi Tan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yi Tan .

Editor information

Editors and Affiliations

Central China Normal University, Wuhan, China
Xinhai Lu
Central China Normal University, Wuhan, China
Zuo Zhang
University of Hong Kong, Hong Kong, China
Weisheng Lu
Zhejiang University of Finance and Economics, Hangzhou, China
Yi Peng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cai, R., Li, J., Li, G., Tang, D., Tan, Y. (2021). A Review of the Application of CNN-Based Computer Vision in Civil Infrastructure Maintenance. In: Lu, X., Zhang, Z., Lu, W., Peng, Y. (eds) Proceedings of the 25th International Symposium on Advancement of Construction Management and Real Estate. CRIOCM 2020. Springer, Singapore. https://doi.org/10.1007/978-981-16-3587-8_42

Download citation

DOI: https://doi.org/10.1007/978-981-16-3587-8_42
Published: 12 October 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-3586-1
Online ISBN: 978-981-16-3587-8
eBook Packages: Business and ManagementBusiness and Management (R0)

Publish with us

Policies and ethics

A Review of the Application of CNN-Based Computer Vision in Civil Infrastructure Maintenance

Abstract

Similar content being viewed by others

Applications of Deep Learning in Intelligent Construction

A review of the research and application of deep learning-based computer vision in structural damage detection

Structural Damage Detection using Deep Convolutional Neural Network and Transfer Learning

Keywords

1 Introduction

2 Research Progress of Deep Learning