Keywords

1 Introduction

Data mining is the technique of discovering styles of massive data units regarding strategies at the intersection system mastering, facts and database systems. It is the workout of analyzing massive pre-cutting-edge databases in order to generate new records. Data mining is the assessment step of the “information discovery in database” [1, 2]. It is an interdisciplinary subfield of computer science and statics with a preferred aim to extract facts from records set and rework the facts into an understandable shape for further use. This method is likewise used in corporation programs along with market segmentation, fraud detection, and credit danger evaluation further to many different programs [3]. It has the capacity to locate pattern saved inner records and is now taken into consideration as a catalyst for growing business enterprise procedure by using keeping off failure sample. There is a frequent collection of data on each day basis in construction and manufacturing companies while operations. The main intention of data mining is to extract data from the pre-existing dataset for future use in the logical structure. It is now considered as a catalyst for increasing business process by avoiding failure pattern [4, 5].

1.1 Methods and Approach

Some of the applications which are being utilized in the field of civil engineering are as follows [6,7,8,9]:

  1. i.

    Predicting strength based on the concrete mixture, soil, and several other variables.

  2. ii.

    Monitoring infrastructure health of sub- and super-structure on the data received from the sensors.

  3. iii.

    Traffic Engineering: Data sensing, analysis, and mining can be used to facilitate decision-making and intelligent transportation systems.

  4. iv.

    Spatial data mining to find out the best possible location for construction.

  5. v.

    Water resources engineering: In water resources engineering, data mining is used to identify chemical trends in water quality samples.

  6. vi.

    Construction Management: It is used to estimate maintenance cost and quality of the construction.

  7. vii.

    Environmental Engineering: It could be used to study environmental and natural resource science.

  8. viii.

    Highway and Transportation Engineering: Traffic and pavement management can be done with the application of data mining in the field of highway and transportation engineering.

  9. ix.

    Hydraulics and Water Power Engineering: The prediction of dangerous risks in Hydraulics and Water Power Engineering can be achieved by data mining techniques.

  10. x.

    Materials Science and Engineering: Data mining techniques can be applied in many domains such as astronomy, bioinformatics, chemistry, materials science, climate, fusion, and combustion.

  11. xi.

    Geotechnical Engineering: Data mining can be explored to formulate several complex geotechnical engineering problems. The complex analysis of geotechnical behavior is due to multivariable soil and rock responses.

  12. xii.

    Earthquake Engineering: Application of data mining would be extremely helpful in the prediction of natural geological calamities like flood, tornado, hurricane, volcanic eruption, earthquake, heat-wave, or landslides.

  13. xiii.

    Coastal and Harbor Engineering: By data mining, the different wave behaviors at the coastal areas can be studied, which will be useful for designing harbor engineering.

  14. xiv.

    Tunnel Engineering: The variation in the pattern of geological data, stress–strain data of supporting structures and the deformation data of the surrounding rocks can be interpreted by the data mining technique.

  15. xv.

    Surveying and Geo-Spatial Engineering: The use of new and developing technologies such as GPS, satellite imagery, laser mapping, and fast computing to create complex layers of interconnected geographic information can be facilitated by data mining technique.

  16. xvi.

    Geomatics: Application of data mining is utilized in delivering spatially referenced information by the data of deposit collection, processing, interpreting, storing of mining, and extraction plant.

  17. xvii.

    Geosciences: By the application of data mining, petro-physical data, logging data, seismic data, and geological data can be addressed.

  18. xviii.

    Remote Sensing: Data mining is utilized to extract aerial remote sensing imagery for automatic land-cover classification.

  19. xix.

    Geographical Information systems: Data mining can automatically extract knowledge from raw data of Geographical Information Systems to gain spatial analysis result.

The field of construction as well as other industries related to it makes the big statistics technology utilize the large quantity of information received and saved through advanced computing systems. Information comes from anywhere: computers, structures, human beings, sensors, and any device that generates records. In the construction field, every structure that has been erected embodies huge quantities of data [6, 10, 11]. However, managers and civil engineers need to explore the answers hidden in unstructured information. Big data technology gives civil engineers the strength to utilize unstructured information, which is tough to accumulate and analyze manually in any beneficial manner. In fact, all records are useless without correct assessment and verification [10, 12].

2 Application of Data Mining Technology on Construction Project Cost Control System

With the development of construction management, it is essential that the costing-related tasks are done correctly. The innovation regarding data mining process in the development of the cost control framework to resolve the deficiencies in conventional management is highly beneficial [13,14,15].

Secondarily, the main focus is on direct expenses on items rather than on overhead cost by cost management programs. Rational cost and cost of the quantity of items are often neglected because much more importance is given to financing [16]. There is great redundancy of data in the information system thus making it different to determine variation in the information. Usually issues like selection of important data from huge arise. Hence Data Mining Technology comes as a solution to the problem [17].

The process of sorting the important data from large data storage and extracting relatable patterns to solve problems with ease is facilitated with DM [18]. The user must have adequate knowledge to discover such databases. As a result, Data Mining Technology grows with great importance in the field of multiple costing parameters.

Data Mining Technologies plays a great role in Cost Management. Several goals that are achieved are as follows:-

  • Rapid or quick schedule

  • Reduction in cost

  • Availability of good quality data values

  • On-time targets.

Manual recording of data in a common system did not have the provision of accessibility which lead to the problem of missing data. As a result, data were classified into three dimensions:

  • Management field

  • Contractors field

  • Construction sector.

Initially the data were in one dimension. Now 3D projection and storage of data information has overcome the issue of missing data and has made it easier [19]. Figure 1 illustrates the three-dimensional data model of data information.

Fig. 1
A three-dimensional data model diagram presents data with respect to management fields with cost M, schedule M, and quality M, contractors with contractors A, B, and C, and construction sections with sections A, B, and C.

Three-dimensional data model [20]

The Management field in 3D model can further be separated into three sections (i) cost management, (ii) schedule management, and (iii) quality management. Figure 2 depicts the picture of data dividing in three-dimensional model. Hence special information might be consolidated into a new datum [21].

Fig. 2
A three-dimensional data model diagram presents data with respect to management fields with cost M, schedule M, and quality M, contractors with contractors A, B, and C, and construction sections with sections A, B, and C.

Data dividing in three-dimensional model [22]

3 Application of Big Data in Construction Industry

The application of big data in the construction Industry consists of basic three processes:

  • Design: Big data analyzes the building design and its modeling. It can also be used to finalize where the building can be constructed and with which material. However, old records can be studied with it to know the probability of risks at the construction site.

  • Build: During construction at the site, the data collected from weather, traffic, and other activity is interpreted to decide the stages of construction.

  • Operate: The accomplishment of the construction at each level can be tracked by collecting the data from the sensors built into the super and substructures of the buildings. Scheduled maintenance activity can be planned by feeding the data of the sensors in the building information modeling.

The handling of data gets bigger as the size of the data increases. This aspect can be managed by big data analytics. Big data analytics provide inspection alert much before the risk levels [23, 24].

There is a frequent collection of data on daily basis in construction companies for activities and operations. The job overheads increases proportionally with the number of changed orders during the month for high productivity. Projects hidden within the project database can be detected using data mining [25]. The flow chart of the data mining process is shown in Fig. 3.

Fig. 3
A vertical block flow chart represents the flow from data selection to knowledge through the processing of target data, transformation of processed data, data mining of transformed data, and interpretation of patterns.

Flow chart of data mining process [26]

Data mining serves two objectives, namely Insight and Prediction. Insight prompts in recognizing design and patterns. Prediction gives expectations dependent on information [27]. Large data mining techniques of discrete types, for example, neural networks and conceptual clustering are used in particular domains individually. Data sensing, analysis, and mining can be used to facilitate decision-making and intelligent transportation systems [28].

4 Data Mining in the Field of Transportation Engineering

The study of data mining in the field of Transportation has a great scope. Various problems were acknowledged through data mining such as.

  • Productive methodology for dealing with large information with accurate results

  • Methods to differ over-fitting

  • To check absence and uproarious information

  • Build connections between distinct fields

  • Better client communication and pre-learning.

Specialized forms of data help to monitor cases like drowsy drivers by sensing their state of driving. It also helps in designing roadways keeping in mind the overtaking speed of vehicles and the rate of accidents [29]. Table 1 illustrates Data Characteristics in GIS Application of transportation engineering.

Table 1 Data characteristics in GIS application of transportation engineering [30]

Huge amounts of data are given by PMS for pavement design, the number of vehicles running on that particular pavement, types of materials used to construct that pavement, undulations present, etc. Data provided by GPS helped in tracing the vehicles and to check the inconvenient changing of lanes. In geographic data frameworks (GIS) for transportation, interconnected equipment, programming, information, individuals, and assessment of data are done, which is to be considered as a vital job [31,32,33,34].

5 Data Mining in the Field of Environmental Engineering

In the field of environmental engineering, the aspect of data mining can be utilized to the optimum. Data mining techniques in environmental studies provide valuable knowledge and useful patterns which facilitate us to deal with environmental problems, like air pollution, water pollution, etc. [35, 36]. Some of the beneficial aspects of data mining in environmental engineering are as follows:

  • Prediction of rainfall based on the parameters of environmental studies.

  • Prediction of rainfall and climate change can be predicted on the different patterns obtained by the data mining techniques. [37, 38]

  • Ecological systems can be modeled by processing ecological and environmental data [39, 40]

  • Better control of air quality and water quality by analyzing environmental data.

  • Better waste management and recycling can be accomplished by discovering patterns through data mining [38, 41].

6 Conclusion

This paper shows the various application related to civil engineering where big data plays a very big role in the management of project, maintaining resources, scheduling of jobs, estimation of time and cost involved for civil construction works. Big data analytics along with data mining can make the construction industry fruitful and suggests the methods for improving the health of construction. Data available at various construction sites may be gathered to form big data and the data mining methods like clustering and classification algorithms can be implemented for better analysis, decision–making, and prediction of the new estimates using the past data. Data mining has huge scope in the field of environmental engineering also. It facilitates to have better control over air and water quality. Prediction of the weather forecast and climate change can be optimized.