Abstract
Computer-vision and deep-learning techniques are being increasingly applied to the maintenance of civil infrastructure, such as inspecting, monitoring, and assessing infrastructure conditions, which overcome time-consuming and laborious compared with traditional technology. In this paper, the research progress of deep learning, the developments of convolutional neural network (CNN)-based computer vision in improving accuracy, reliability and generalized object detection capability and its application in civil infrastructure maintenance are reviewed. The main objectives are as follows: (1) clarify the application of deep learning in computer vision to help researchers systematically understand deep learning; (2) review the application of computer vision in civil infrastructure maintenance to help researchers pay more attention to its advantages; (3) encourage relevant personnel to use this research as a reference, take deep learning as an important method at the forefront of engineering management, generate more innovations in the construction field, and promote the development of the construction industry.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Civil infrastructure can provide good services to the citizens as the operation and management activities. If maintenance is not carried out in time, it will not only cause potential hazards and hidden dangers to the civil infrastructure and its ancillary facilities, but also threaten citizen lives. Therefore, it’s essential for real-time monitoring the condition of the infrastructure, so that necessary repairs and maintenance work can be carried out proactively and timely before it becomes too dangerous and expensive. Conventional manual monitoring is extensively time-consuming, laborious, expensive and has healthy and safety problems, particularly for the aerial working environment where detection is difficult to conduct [1].
In the past years, deep learning techniques, especially convolutional neural network (CNN), have been shown to outperform previous state-of-the-art machine learning techniques in several fields, with computer vision being one of the most prominent cases [2]. Computer vision is changing processes of the construction management as it enables the automatic acquisition, processing, analysis of digital images, and the extraction of high-dimensional data from the real world to produce useful information to improve managerial decision-making [3].
Deep learning has obtained promising performance in various computer vision tasks such as image classification [4], object detection [5] and object segmentation [6]. These three tasks are not only related to each other, but also progressive. The connection is that they are all based on the basic idea of CNN. The progressive relationship increases difficulties of three tasks. Both object detection and segmentation use some basic network models from image classification. The CNN-based image classification algorithm provides many new ideas for object detection and segmentation, and has achieved good results. This paper will briefly describe these three tasks and make a general comparison. From the beginning, these tasks have been applied to the industrial field, until now, they have been applied to many other fields and made great achievements, and have great application prospects in civil infrastructure maintenance. The application of deep learning based on CNN in the automatic detection and location of defects in civil infrastructures [7], such as bridges [8], roads [9] and sewage pipes [6], can solve these problems.
The remainder of this paper is organized as follows. Figure 1 is the structure of this review work. In Sect. 2, the research progress of deep learning, including the structure of deep learning, is described. Section 3 described the use of deep learning methods to address key tasks in computer vision, such as image classification, object detection, and object segmentation. In Sect. 4, the application of deep learning-based computer vision in civil infrastructure maintenance are reviewed.
2 Research Progress of Deep Learning
2.1 Important Milestones of Deep Learning
Deep learning methods usually addresses rich and complex data from different sources, and they have performed better than previous technologies in multiple tasks, attracting increasing attention. Where does it start? How does it determine whether a particular deep learning model is suitable for their problem? How to train and deploy them? With these questions, the important milestones leading up to the era of deep learning [2] are firstly summarized in Table 1. The MCP model and Neocognitron are the beginning of the artificial neural networks (ANN) and CNN, respectively. However, AlexNet [10] won the ImageNet contest in 2012 with an absolute advantage of 10.9 percentage points over the second place. Since then, deep learning and convolutional neural networks rose to prominence with AlexNet. An overview of deep learning structure based on CNN is presented next.
2.2 Deep Learning Structure Based on CNN
CNNs were inspired by the visual system’s structure, in particular by its proposed models [11]. A CNN consists of three main types of layers, namely, convolutional layers, pooling layers and fully connected layers. Each type of layers has a different task. Figure 2 shows a general CNN architecture for an image classification task. In addition, CNN also has activation function, Batch Normalization and Regularization.
-
(i)
Convolutional layers
In the convolutional layers, various kernels are used to convolve the input data to generate feature maps. The convolution operation is to cover the entire image step by step with the convolution kernel according to the step size, and the value of the filter is multiplied by the pixel value of the corresponding position of the image and then summed. The value obtained is the value of the target pixel in the output image.
-
(ii)
Pooling layers
The pooling layer reduces the spatial size (width × height) of the input volume of the next convolutional layer through maximum pooling or average pooling, but does not affect its depth. This operation can reduce the number of parameters in the network, reduce the consumption of computing resources, and can also effectively control overfitting. The operation process of the pooling layer is to first slide the input data through the spatial window, and select the maximum or average value as the output result, and then continue to slide the window until the entire input data is covered, and finally the output results of each sliding are in order arrange to obtain the final complete output data. In the whole process, reduce the spatial size of the input data. The size of the sliding window and the sliding step size will affect the output data, so it is necessary to use the appropriate size and step size for the accuracy of the results.
-
(iii)
Fully connected layers
Following several convolutional and pooling layers, the high-level reasoning in the neural network is performed via fully connected layers. Fully connected layers play the role of classifier in the entire CNN.
-
(iv)
Activation function
The emergence of the activation function Rectified Linear Units (ReLU) solves the problem that sigmoid and tanh are prone to disappearing gradients, which is currently the most commonly used activation function. Generally, the activation function is used after each convolution.
-
(v)
Batch Normalization and Regularization
Batch Normalization is to force the distribution of the input value back to a standard normal distribution with a mean of 0 and a variance of 1, to avoid the problem of vanishing gradients. Dropout is a convenient but powerful regularization method, which randomly deletes some nodes in each iteration, and only train the remaining nodes to suppress overfitting.
2.3 The Relationship Between Machine Learning, Deep Learning, CNN, Computer Vision and Civil Infrastructure
Understanding the relationship between machine learning, deep learning, CNN, computer vision and civil infrastructure can help researchers understand this paper. For machine learning, the way to solve the problem is to find out the mapping relationship between X and Y through the model, among which the available models are logistic regression, linear regression, support vector machine (SVM) and others. While, using the type of neural network model is called deep learning, which including convolutional neural networks. The application of convolutional neural network to computer vision mainly has three major tasks, including Image classification, Object detection, Object segmentation. Then, these three tasks are applied to civil infrastructure, as can be seen from Fig. 3.
3 Application of CNN in Computer Vision
Deep learning has been widely adopted in various directions of computer vision, such as image classification, object detection and segmentation, which are key tasks for image understanding. The differences among the three tasks can be seen intuitively from Fig. 4, taking crack images of sewage pipes [12] as an example. In this part, the developments of deep learning in above-mentioned three tasks, especially the CNN- based algorithms, will be briefly summarized.
3.1 Image Classification
The image classification task means that image is labeled with a probability of the presence of a particular visual object class [13], which is the simplest and most basic image understanding task. The task of the deep learning model is to achieve the first breakthrough and realize large-scale application.
In general, CNN is the most advanced compared to classical algorithms [14]. Through the continuous research and improvement of its structure, a series of network models have been formed and successfully applied in a wide range of practical applications, such as AlexNet, VGGNet [15], GoogleNet [16] and ResNet [17] as shown in Table 2. It can be seen from the table that more and more optimizations are applied to network design, such as Dropout, Local response normalization (LRN), and Batch normalization. The state-of-the-art results of the top-5 error rate tested by ImageNet since 2012 are also presented in Fig. 5. The model CNN-based is also used in the cracks of civil infrastructures, for example, Zhou and Song developed Deep CNN structures with different layouts for fracture classification based on laser scanning range images [4], Wang et al. proposed a CNN-based damage classification technology for deep buildings targeting masonry historical structures [18].
3.2 Object Detection
Image classification is the basis of computer vision, but only classification is not enough. Object detection is different from but closely related to the image classification task. Object recognition and segmentation are more difficult but meaningful. The classification task is only concerned with classification, while the detection task is not only focus on classification, but also required to obtain the location of the detected object.
Object detection research has been conducted for many years, and there are many methods that have been widely recognized and applied in the industry. Several typical detection models and their feature are introduced in Table 3. Object detection is usually divided into two categories, one category is one-stage network, such as You Only Look Once (YOLO) [19,20,21] series and Single Shot MultiBox Detector (SSD) [22], the other is two-stage network, such as Regions with CNN features (RCNN) series [23]. In general, one-stage is faster, two-stage is more precise. Some scholars have applied these two types of detection networks to sewage pipes and have reached a consistent conclusion [24]. In addition, these algorithms have attracted the attention of the researchers, for example, YOLO was applied in various kinds of defects automatic detection [25], YOLO have also been used in detecting multiple damage on the surface of the concrete bridge [26, 27], Faster RCNN was used to detect and preliminarily evaluate the damage caused by earthquake to buildings [28].
3.3 Object Segmentation
In addition to classification and object detection, it is also necessary to separate out all the pixels related to the object and give the categories even though it’s more difficult, which is called object segmentation.
Object segmentation consists of semantic segmentation and instance segmentation. The former is an extension of the pre-background segmentation, requiring the separation of image parts with different semantics [29]. Figure 6 shows the scores of its typical model in the VOC2012 dataset. While the latter is an extension of the detection task, which requires the outline of the objects and more refined than the detection frame. The MASK R-CNN and FCIS [30] are the most significant research outcomes in recent years. Compared with semantic segmentation, instance segmentation can label different individuals of the same type of object on the image, which is a comprehensive task combining image classification, object detection, and semantic segmentation.
In general, Object segmentation is a pixel-level description of an image [14], which is suitable for scenes with high requirements for understanding. Such as the segmentation of roads and non-roads in auto pilot.
3.4 Typical Experimental Tools and Model Evaluation
Good tools such as datasets and computing platform, can make the research process more effective and successful. The development of deep learning is inseparable from the development of datasets. The typical datasets of image processing fields, including MNIST, PASCAL VOC, CIFAR, ImageNet, COCO, Open Image, and Youtube8M play an important role in the recent neural network researches in industry application, academic research and other fields. Programming tools that support deep learning are also very popular, such as TensorFlow, MXNet, PaddlePaddle, Caffe, Torch, and Theano, providing rich convenient interfaces for mathematical computation.
Deep learning is a branch of machine learning, and precision and recall are typical indicators for most machine learning. However, due to the uneven distribution of prior targets, traditional evaluation indicators are not suitable for multi-object detection models. Therefore, different types of classification errors should be considered when evaluating object detection models [5]. The performance of the model is summarized as two aspects: (1) accuracy. The precise recall, average accuracy (AP), mean AP and missing rate belong to accuracy; (2) calculating cost. Detection speed and training time belong to calculating cost.
4 Application in Civil Infrastructure
Civil infrastructures, including bridges, roads, tunnels, and underground utilities like sewage pipe, are becoming susceptible to losing their designed functions due to deterioration caused by use [7]. This inevitable situation means urgent maintenance is required. The condition monitoring of concrete surface plays a significant role in civil infrastructure management system [31]. Defects are the main threat to concrete surface of infrastructure. Traditional vision-based methods of crack detection lack accuracy and generalization when working on complicated infrastructural conditions [32]. At present, a number of computer vision-based crack detection techniques have been developed to enhance the efficiency, speed, and objectivity of inspection [33] and manage a large number of structures [34]. For example, as shown in Fig. 7, three computer vision tasks based on CNN, including image classification, object detection and object segmentation, are employed in three civil infrastructures, including sewage pipe, bridge and road, respectively.
4.1 Sewage Pipe Inspection
As an important component of civil infrastructure, sanitary sewer systems are designed to collect and transport sanitary wastewater and stormwater. Sewer defect inspection is the key in identifying both the type and location of pipe defects to maintain the normal sewer operations [5] for maintenance of urban underground infrastructure [35].
For sewage pipe defect inspection, a CNN was initially used to detect and characterize cracks on an autonomous sewer inspection robot [36]. Currently, closed-circuit television (CCTV) and other visual inspection technologies have been widely used in the inspection of underground sewage pipelines. However, it’s time-consuming and the results are subjective [37] when relying on manual interpretation of the images or videos. However, the deep learning-based approach can automatically extract image features and improve the accuracy and efficiency, and it does not require much for image preprocessing. Therefore, several studies of deep learning-based approach exploration have been performed. For example, the method of image classification is applied to sewage pipe detection with a sufficiently large dataset (over 2 million CCTV images) by Dirk Meijer [38]. A deep learning-based approach is developed for sewer pipe defect detection using faster region-based convolutional neural network (Faster R-CNN) [12]. With the development of deep learning techniques, Yin et al. employ a state-of-the art convolutional neural network (CNN) based object detector, namely YOLOv3 network, for detection system of sewer pipes [39]. A unified neural network, namely DilaSeg-CRF, is proposed by fully integrating a deep convolutional neural network (CNN) with dense conditional random field (CRF) and applied to sewer pipe [6].
4.2 Bridge Inspection
Bridges play an important role in civil infrastructure. Periodic bridge inspections are very important to maintain the functionality, safety and reliability of the bridge structure. It’s essential for the continuous monitoring and maintenance of bridges. As bridges become obsolete, the number of bridges that need to be inspected increases, which requires a lot of maintenance costs. If postponing the cost of bridge maintenance, more costs will be required in the near future [40].
Traditional bridge detection methods rely on human visual inspection [41], which remains the most adopted approach among all nondestructive evaluation techniques that can be used to identify and monitor defects [42]. This method has limitations that the performance is highly related to the experience of the inspector, time consumption and accessible areas [40]. In this case, detection technology based on computer vision [43] and the idea of images obtained from drones [44] are proposed.
Zhang et al. use the applicability of the state-of-the-art single-stage detector YOLOv3 to identify various types of defects in concrete bridges and improve its performance in terms of detection accuracy [42]. Some researchers also use other deep learning-based methods to detect bridge damage and achieve better results, such as CNN [45,46,47,48], region with convolutional neural networks (R-CNN)-based transfer learning [40] when the dataset is not enough.
4.3 Road Inspection
With the rapid development of road traffic, road surface cracks not only affect the transportation efficiency but also pose a potential threat to vehicle safety. The importance of road maintenance has attracted increasing attention. It is crucial to repair the roads in time when potholes are appeared to prevent accidents in advance [49]. In reality, however, due to limited human resources, it is difficult to detect and repair potholes in time. A lot of research has focused on road damage detection, and there are three main methods: vibration sensor-based, laser scanning-based, and computer vision-based methods [50].
With the advent of CNN [51], image processing technology has made significant progress recently, and computer vision-based methods are widely utilized to research road defects. Image processing algorithms [52] mainly include threshold segmentation [53], edge detection [54] and region growth methods [55] for image processing and crack feature recognition. CNN algorithm is applied to concrete pavement crack detection [56, 57]. Chun et al. proposed Fully CNN-based road surface damage detection with semi-supervised learning to detect road damage [49]. Hybrid deep CNN is applied to the detection and location of moisture damage in asphalt pavements, including ResNet50 network for feature extraction, YOLOv2 network for identification, and detection and location of moisture damage [9].
4.4 Other Civil Infrastructures Inspection
In addition to the civil infrastructure mentioned above, other infrastructures also applied deep learning methods to detect damage. Structural health monitoring (SHM) is used to manage and maintain civil infrastructure, which generated a large amount of data. Traditional detection technology cannot effectively analyze these data, and it is time-consuming, laborious, and inefficient. Therefore, how to effectively monitor, mine and use the data requires in-depth research, which considers the introduction of deep learning-based methods for detection. Deep learning-based method is also used to detect crack from concrete surface [58,59,60,61,62,63], structure [64, 65], buildings [66, 67]. In addition, deep learning is also used to identify unsafe behavior from two-dimensional images that appear on construction site [68, 69]. Their experimental results show that the method has a significant improvement in accuracy and efficiency. In summary, deep learning has good application prospects in the field of construction.
Through the review, we found that although many people apply cutting-edge technologies such as computer vision to civilian infrastructure, they have not been implemented in practice and have not achieved real-time detection technology.
5 Conclusions
Computer vision has attracted the increasing attention of researchers and practitioners. This paper gives a brief review of the application of deep learning-based computer vision in civil infrastructure maintenance. Firstly, the research progress of deep learning was reviewed, including the important milestones and deep learning architectures. Deep learning is widely used in the three major directions of computer vision, image classification, object detection, and object segmentation. Secondly, the models used in these three aspects are summarized. Finally, the applications of deep learning-based computer vision for damage detection in the maintenance phase of civil infrastructure, including sewage pipes, bridges and roads, were reviewed.
Through the review, we can find that more and more people are paying attention to automation and intelligence. The application of cutting-edge technology to the construction industry is a measure that conforms to the times. Moving to the forefront of technology is a necessary condition for the development of automation and intelligence in the construction industry. Prosperous application prospects in other aspects of the construction industry. In recent years, these reviewed models have become new hotspots for deep learning and CNN to effectively applied in computer vision, multi-object classification and related fields. They are considered effective methods and tool by the industry and academia. Applying deep learning-based computer vision technology to the construction management field can achieve greater and more innovation and promote the transformation and development of the construction industry. However, we found that although many people apply cutting-edge technologies such as computer vision to civil infrastructure, the implementation is limited in practice, and the real-time detection is also still limited.
References
Perez, H., Tah, J. H. M., & Mosavi, A. (2019). Deep learning for detecting building defects using convolutional neural networks. Sensors, 19(16), 22.
Voulodimos, A., et al. (2018). Deep learning for computer vision: A brief review. Computational Intelligence and Neuroscience, 13.
Zhong, B. T., et al. (2019). Mapping computer vision research in construction: Developments, knowledge gaps and implications for research. Automation in Construction, 107.
Zhou, S. L., & Song, W. (2020). Deep learning-based roadway crack classification using laser-scanned range images: A comparative study on hyperparameter selection. Automation in Construction, 114, 17.
Cheng, J. C. P., & Wang, M. Z. (2018). Automated detection of sewer pipe defects in closed-circuit television images using deep learning techniques. Automation in Construction, 95, 155–171.
Wang, M. Z., & Cheng, J. C. P. (2020). A unified convolutional neural network integrated with conditional random field for pipe defect segmentation. Computer-Aided Civil and Infrastructure Engineering, 35(2), 162–177.
Cha, Y. J., Choi, W., & Buyukozturk, O. (2017). Deep learning-based crack damage detection using convolutional neural networks. Computer-Aided Civil and Infrastructure Engineering, 32(5), 361–378.
Lee, K. H., Byun, N., & Shin, D. (2020). A damage localization approach for Rahmen Bridge based on convolutional neural network. Ksce Journal of Civil Engineering, 24(1), 1–9.
Zhang, J., et al. (2020). Automatic detection of moisture damages in asphalt pavements from GPR data with deep CNN and IRS method. Automation in Construction, 113, 11.
Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2017). ImageNet classification with deep convolutional neural networks. Communications of the Acm, 60(6), 84–90.
Hubel, D. H., & Wiesel, T. N. (1962). Receptive fields, binocular interaction and functional architecture in cats visual cortex. Journal of Physiology-London, 160(1), 106.
Wang, M. Z., & Cheng, J. C. P. (2018). Development and improvement of deep learning based automated defect detection for sewer pipe inspection using faster R-CNN. In I. F. C. Smith & B. Domer (Eds.), Advanced computing strategies for engineering, Pt Ii (pp. 171–192). Springer International Publishing Ag.
Guo, Y. M., et al. (2016). Deep learning for visual understanding: A review. Neurocomputing, 187, 27–48.
Wu, H., Liu, Q., & Liu, X. D. (2019). A review on deep learning approaches to image classification and object segmentation. Cmc-Computers Materials and Continua, 60(2), 575–597.
Simonyan, K., & Zisserman, A. J. A. E.-P. (2014). Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556.
Szegedy, C., et al. (2015). Going deeper with convolutions. 2015 Ieee Conference on Computer Vision and Pattern Recognition (pp. 1–9). Ieee.
Kaiming, H., et al. (2015). Deep residual learning for image recognition (pp. 12).
Wang, N. N., et al. (2018). Damage classification for masonry historic structures using convolutional neural networks based on still images. Computer-Aided Civil and Infrastructure Engineering, 33(12), 1073–1089.
Redmon, J., & Farhadi, A. (2018). YOLOv3: An incremental improvement (pp. 1–6).
Redmon, J., Farhadi, A., & IEEE. (2017). YOLO9000: Better, faster, stronger. In 30th Ieee Conference on Computer Vision and Pattern Recognition (pp. 6517–6525).
Redmon, J., et al. (2016). You only look once: unified, real-time object detection. In 2016 Ieee Conference on Computer Vision and Pattern Recognition (pp. 779–788).
Liu, W., et al. (2015). SSD: Single shot MultiBox detector. arXiv:1512.02325.
Ren, S. Q., et al. (2017). Faster R-CNN: Towards real-time object detection with region proposal networks. Ieee Transactions on Pattern Analysis and Machine Intelligence, 39(6), 1137–1149.
Kumar, S. S., et al. (2020). Deep learning-based automated detection of sewer defects in CCTV videos. Journal of Computing in Civil Engineering, 34(1), 13.
Li, L.J., et al. (2019). Dam surface crack detection based on deep learning. In Proceedings of the 2019 International Conference on Robotics, Intelligent Control and Artificial Intelligence (pp. 738–743). Assoc Computing Machinery.
Zhang, C. B., Chang, C. C., & Jamshidi, M. (2020). Concrete bridge surface damage detection using a single-stage detector. Computer-Aided Civil and Infrastructure Engineering, 35(4), 389–409.
Zhang, C. B., & Chang, C. C. (2019). Surface damage detection for concrete bridges using single-stage convolutional neural networks. In P. Fromme & Z. Su (Eds.), Health Monitoring of Structural and Biological Systems (pp. Xiii). Spie-Int Soc Optical Engineering.
Mondal, T. G., et al. (2020). Deep learning-based multi-class damage detection for autonomous post-disaster reconnaissance. Structural Control and Health Monitoring, 27(4), 15.
Shelhamer, E., Long, J., & Darrell, T. (2017). Fully convolutional networks for semantic segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(4), 640–651.
Li, Y., et al. (2017). Fully convolutional instance-aware semantic segmentation. 30th Ieee Conference on Computer Vision and Pattern Recognition (pp. 4438–4446). Ieee.
Guo, L., et al. (2020). Automatic crack distress classification from concrete surface images using a novel deep-width network architecture. Neurocomputing, 397, 383–392.
Zhang, X. X., Rajan, D., & Story, B. (2019). Concrete crack detection using context-aware deep semantic segmentation network. Computer-Aided Civil and Infrastructure Engineering, 34(11), 951–971.
Kim, B., & Cho, S. (2019). Image-based concrete crack assessment using mask and region-based convolutional neural network. Structural Control and Health Monitoring, 26(8), 15.
Kim, B., & Cho, S. (2018). Automated vision-based detection of cracks on concrete surfaces using a deep learning technique. Sensors, 18(10), 18.
Xie, Q., et al. (2019). Automatic detection and classification of sewer defects via hierarchical deep learning. Ieee Transactions on Automation Science and Engineering, 16(4), 1836–1847.
Browne, M., & Ghidary, S. S. (2003). Convolutional neural networks for image processing: An application in robot vision. In T. D. Gedeon & L. C. C. Fung (Eds.), Ai 2003: Advances in artificial intelligence (pp. 641–652). Springer.
Hassan, S. I., et al. (2019). Underground sewer pipe condition assessment based on convolutional neural networks. Automation in Construction, 106, 12.
Meijer, D., et al. (2019). A defect classification methodology for sewer image sets with convolutional neural networks. Automation in Construction, 104, 281–298.
Yin, X. F., et al. (2020). A deep learning-based framework for an automated defect detection system for sewer pipes. Automation in Construction, 109, 17.
Kim, I. H., et al. (2018). Application of crack identification techniques for an aging concrete bridge inspection using an unmanned aerial vehicle. Sensors, 18(6), 14.
Xu, H. Y., et al. (2019). Automatic bridge crack detection using a convolutional neural network. Applied Sciences-Basel, 9(14), 14.
Jiang, S., & Zhang, J. (2020). Real-time crack assessment using deep neural networks with wall-climbing unmanned aerial system. Computer-Aided Civil and Infrastructure Engineering, 35(6), 549–564.
Dinh, K., Gucunski, N., & Duong, T. H. (2018). An algorithm for automatic localization and detection of rebars from GPR data of concrete bridge decks. Automation in Construction, 89, 292–298.
Jung, H. J., et al. (2019). Bridge inspection and condition assessment using unmanned aerial vehicles (UAVs): Major challenges and solutions from a practical perspective. Smart Structures and Systems, 24(5), 669–681.
Kang, D. H., & Cha, Y. J. (2018). Autonomous UAVs for structural health monitoring using deep learning and an ultrasonic beacon system with geo-tagging. Computer-Aided Civil and Infrastructure Engineering, 33(10), 885–902.
Liu, H., & Zhang, Y. F. Bridge condition rating data modeling using deep learning algorithm. Structure and Infrastructure Engineering, 14.
Modarres, C., et al. (2018). Convolutional neural networks for automated damage recognition and damage type identification. Structural Control and Health Monitoring, 25(10), 17.
Zhu, J. S., & Song, J. B. (2020). An intelligent classification model for surface defects on cement concrete bridges. Applied Sciences-Basel, 10(3), 15.
Chun, C., & Ryu, S. K. (2019). Road surface damage detection using fully convolutional neural networks and semi-supervised learning. Sensors, 19(24), 15.
Augustauskas, R., & Lipnickas, A. (2020). Improved pixel-level pavement-defect segmentation using a deep autoencoder. Sensors, 20(9), 21.
Eguchi, M., et al. (2019). A simplified method of detecting spot surface defects by using Quasi-3D data from a conventional road profiler. Transportation Research Record, 2673(11), 377–387.
Cao, W. M., Liu, Q. F., & He, Z. Q. (2020). Review of pavement defect detection methods. Ieee Access, 8, 14531–14544.
Zhu, S. P., et al. (2007). An image segmentation algorithm in image processing based on threshold segmentation. In Sitis 2007: Proceedings of the International Conference on Signal Image Technologies & Internet Based Systems (pp. 673+). Ieee Computer Soc.
Huili, Z., Guofeng, Q., & Xingjian, W. (2010). Improvement of canny algorithm based on pavement edge detection. In Proceedings of the 2010 3rd International Congress on Image and Signal Processing (CISP 2010) (pp. 964–967).
Zhou, Y., et al. (2016). Seed-based approach for automated crack detection from pavement images. Transportation Research Record, 2589, 162–171.
Qu, Z., et al. (2020). Crack detection of concrete pavement with cross-entropy loss function and improved VGG16 network model. Ieee Access, 8, 54564–54573.
Riid, A., et al. (2019). Pavement distress detection with deep learning using the orthoframes acquired by a mobile mapping system. Applied Sciences-Basel, 9(22), 22.
Cha, Y. J., et al. (2018). Autonomous structural visual inspection using region-based deep learning for detecting multiple damage types. Computer-Aided Civil and Infrastructure Engineering, 33(9), 731–747.
Deng, L., et al. (2020). Region-based CNN method with deformable modules for visually classifying concrete cracks. Applied Sciences-Basel, 10(7), 18.
Lee, J., et al. (2019). Learning to detect cracks on damaged concrete surfaces using two-branched convolutional neural network. Sensors, 19(21), 18.
Li, S. Y., & Zhao, X. F. (2019). Image-based concrete crack detection using convolutional neural network and exhaustive search technique. Advances in Civil Engineering, 2019, 12.
Wei, F. J., et al. (2019). Instance-level recognition and quantification for concrete surface bughole based on deep learning. Automation in Construction, 107, 13.
Ryu, E., et al. (2020). Automated detection of surface cracks and numerical correlation with thermal-structural behaviors of fire damaged concrete beams. International Journal of Concrete Structures and Materials, 14(1), 12.
Ye, X. W., Jin, T., & Chen, P. Y. (2019). Structural crack detection using deep learning-based fully convolutional networks. Advances in Structural Engineering, 22(16), 3412–3419.
Wei, S. Y., Bao, Y. Q., & Li, H. (2020). Optimal policy for structure maintenance: A deep reinforcement learning framework. Structural Safety, 83, 13.
Perry, B. J., et al. (2020). Streamlined bridge inspection system utilizing unmanned aerial vehicles (UAVs) and machine learning. Measurement, 164, 14.
Liu, L. L., et al. (2017). Transfer learning on convolutional activation feature as applied to a building quality assessment robot. International Journal of Advanced Robotic Systems, 14(3), 12.
Fang, W. L., et al. (2020). Computer vision for behaviour-based safety in construction: A review and future directions. Advanced Engineering Informatics, 43, 13.
Fang, W. L., et al. (2020). Computer vision applications in construction safety assurance. Automation in Construction, 110, 10.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Cai, R., Li, J., Li, G., Tang, D., Tan, Y. (2021). A Review of the Application of CNN-Based Computer Vision in Civil Infrastructure Maintenance. In: Lu, X., Zhang, Z., Lu, W., Peng, Y. (eds) Proceedings of the 25th International Symposium on Advancement of Construction Management and Real Estate. CRIOCM 2020. Springer, Singapore. https://doi.org/10.1007/978-981-16-3587-8_42
Download citation
DOI: https://doi.org/10.1007/978-981-16-3587-8_42
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-3586-1
Online ISBN: 978-981-16-3587-8
eBook Packages: Business and ManagementBusiness and Management (R0)