1 Introduction

Pavement performance and evaluation are essential for Pavement Management Systems (PMS). The condition and functional changes of pavements as they age are referred to as pavement performance [1]. In other words, the ability of pavements to withstand traffic intensity and climatic conditions, functionally and structurally, can be used to assess performance. Pavement structural failure occurs as a result of stresses caused by heavy traffic. When pavements fail to provide a smooth riding surface, this is referred to as functional failure. Uneven pavement causes discomfort for drivers and passengers and raises vehicle operating costs [2].

Several factors can be used to assess the condition and serviceability of pavements, including the Present Serviceability Index (PSI), the Pavement Condition Index (PCI), and the International Roughness Index (IRI). IRI, in particular, is a primary performance measure that highway agencies frequently use to forecast pavement performance. The current study aims to use field survey data to develop IRI prediction modelling using Multiple Linear Regression (MLR), Artificial Neural Network (ANN) and Fuzzy Inference Index (FIS) techniques. The IRI was created to provide a standardized global measurement for comparing pavement roughness. The average rectified slope (accumulated suspension motion to distance travelled) derived from a mathematical model of a standard quarter car passing over a measured profile at 80.5 km/h is defined as the IRI of pavement [3, 4]. The roughness or smoothness of the pavement is a comprehensive assessment indicator that considers the ride quality, comfort of the pavement and the presence of collective distresses. The IRI of the pavement increases as it ages, indicating deterioration.

Several techniques were used for pavement performance modelling, such as multiple linear regression, ANN, and fuzzy logic. ANN for modelling infrastructure deterioration are becoming more popular, and numerous studies have been conducted to evaluate their effectiveness.

According to the U.S. Department of Transportation, pavement ride quality based on IRI can be categorized into five groups, as shown in Table 1 [5].

Table 1 Pavement ride quality based on roughness

2 Literature Review

2.1 Multiple Linear Regression

MLR is commonly used to investigate the relationship between input and output variables. A traditional regression method is a comprehensive tool for analyzing input and output parameters’ relationships. Linear regression is one of the most commonly used statistical techniques [6]. The following equation gives the traditional linear regression model:

$$Y =C+{a}_{1}{X}_{1}+{a}_{2}{X}_{2}+\dots \dots \dots \dots \dots \dots + {a}_{n}{X}_{n},$$
(1)

where Y is the dependent variable, C is the constant, X is the (\({x }_{1}\),\({x }_{2}\),…,\({x }_{n}\)) independent variable, and \({a}_{1},{a}_{2},{a}_{3}\dots \dots \dots \dots {a}_{n}\)= coefficients.

According to Okine et al. [7], a technique for IRI prediction using Multivariate Adaptive Regression Splines (MARS) allowed them to determine the relative importance of pavement conditions, traffic, and environmental parameters.

Choi et al. [8] used the MLR method to predict the IRI model based on asphalt concrete properties. The model considered Asphalt Concrete (AC), the percent passing no 200 seize (P200) as input factors. According to Wang et al. [9], the Mechanistic-Empirical Pavement Design Guide employs IRI as a primary mode of assessing pavement condition and as one of the primary functional performance indicators (MEPDG).

Owolabi et al. [10] used the MLR method to predict performance models. The models took into account the input factors of fatigue cracking, longitudinal cracks, patching, potholes, and rut depth. The MLR method was used by Khattak et al. [11] to develop IRI models for hot mix asphalt.

Mubaraki [12] investigated the relationship between IRI and pavement damage on the highway connecting Jeddah and Jazan in Saudi Arabia. The study's findings revealed a significant relationship between IRI and cracking and IRI and ravelling. The results also show that rutting has no significant association with IRI values.

2.2 Artificial Neural Network (ANN)

ANNs are recent computational models defined in analogy with biological characteristics to simulate the decision process in the brain. This technique helps approximate and estimate unknown functions depending on various and numerous input values. One of the most important characteristics of this method is that it represents a method for solving extremely complex and nonlinear problems using only a few simple mathematical operations [13].

Machine Learning (ML) techniques are one of the most important methods for predicting pavement performance indicators. Chen et al. [14] genetic algorithms programming, gray forecast, and multiple regression are helpful tools for predicting IRI distress.

According to Hoang et al. [15, 16], seven methods of ML were used to investigate pavement distress, Naïve Bayesian Classifier (NBC), ANN, Random Forest (RF), Support Vector Machine (SVM), Radial Basis Function Neural Network (RBFNN), and Classification Tree (CT). Nabipour et al. [17] applied Genetic Expression Programming (GEP) and SVM techniques to predict the remaining service life pavement.

Ceylan et al. [18] published a study on ANN in pavement engineering. According to Terzi [19], predicting PSI with pavement distress is possible. Kirbasß and Karasßahin [20] studied ANN and MLR for calculating pavement performance models based on PCI. Platei et al. [21] used ANNs to assess pavement structural conditions based on FWD data. Other significant approaches to the study of compaction factors have been proposed in infrastructure engineering [22], evaluating drivers’ perception of conditions [23], assessing maintenance costs and prioritizing maintenance [24], and analyzing asphalt binders' ageing characteristics [25].

Amin MSR and Jiménez [26] published a case study of the Montreal road network. They considered simulated traffic over five decades. They also factored in the uncertainty of pavement performance in their calculations. They also used the backpropagation neural network (BPN) with the generalized delta rule learning algorithm, where uncertainties were ignored. Annual Average Daily Traffic (AADT), Equivalent Single Axle Loads (ESAL), and the pavement condition index were used to calculate the estimated PCI values. Marcelino et al. [27] applied the RF technique to predict IRI.

Vyas et al. [28] attempted to develop correlations using a variety of environmental structural, subgrade soil attributes, and functional inputs to estimate two different deflection basin parameters. The authors investigated a 124-km-long pavement network. They trained various ANN models with multiple architectures, each with one or more hidden layers. They compared the results of the ANN to those of MLR models and other intelligent techniques, and the ANN performed better than the other techniques.

Nitsche et al. [29] attempted to predict Weighted Longitudinal Profile (WLP) indices. Their goal was to assess the effectiveness of these techniques in estimating range and standard deviation.

Younos et al. [30] developed a pavement condition index prediction model that considers climatic parameters, pavement thickness, weighted plasticity index, and traffic loading. Linear regression and neural networks were the two methods utilized (ANN). The \({R}^{2}\) for the regression model was 0.80, while the \({R}^{2}\) for the ANN model was 0.88. Inkoom et al. [31] used various ML techniques, including bootstrap forest, gradient-boosted trees, K nearest neighbours, nave Bayes, and multiple linear regression.

Researchers proposed using a Multi-Stage Hyper-Heuristic (CMS-HH) approach to address specific combinatorial optimization issues, and they applied a self-learning discrete Jaya algorithm (SD-Jaya) to address the energy-efficient distributed no-idle Flow-shop Scheduling Problem (FSP) [32, 33].

A neural network is an interconnected group of artificial neurons that uses a connectionist approach to computation to process information. ANN is an adaptive system that changes its structure in response to external or internal information flowing through the network during the learning phase. Modern neural networks are tools for nonlinear statistical data modelling. They are used to model complex relationships between inputs and outputs and discover patterns in data. A series of layers typically represent them. Figure 1 shows that these layers usually consist of an input layer, several hidden layers, and an output layer. As shown in Fig. 1, each neuron in the ANN functions as a processing unit, the inputs are received, and the output is turned over to the next layer [34]. In each layer, neurons are connected to neurons in the next layer. Ann’s nonlinear relationship between input and output layers requires a function to make a correlation between the neurons. Three transfer functions provide computation between neurons of different layers: Log-Sigmoid, Tan-Sigmoid, and Linear [35]. The following equations describe these functions:

Fig. 1
figure 1

Schematic representation of ANNs

$$\mathrm{logsig}\left(x\right)=\frac{1}{1+{e}^{-x}},$$
(2)
$$\mathrm{tansig}\left(x\right)=\frac{2}{1+{e}^{-2x}}-1,$$
(3)
$$\mathrm{purelin}\left(x\right)=x.$$
(4)

The processing of each neuron is simply a weighted summation that is transferred via the activation function, as shown in Eq. (4).

$${Y}_{j}=f\sum_{i=1}^{n}{x}_{i}{w}_{ij},$$
(5)

where \({Y}_{j}\) is the output of j the neuron, f is the activation function, n is the total number of inputs in this layer, \({x}_{i}\) is i the input, \(\mathrm{and}\) \({w}_{ij}\) is the connection weight between ith input and the neuron.

2.3 Fuzzy Inference System (FIS)

The term “fuzzy” refers to a lack of clarity, and this fuzziness is caused by modelling the most similar human inference using a complex mathematical pattern. A fuzzy system converts human knowledge into mathematical formulas.

This critical activity uses linguistic variables, “if–then” fuzzy rules, and a mapping system (fuzzy engine). Knowledge and rules underpin fuzzy systems [36]. A fuzzy system’s heart is a knowledge base comprised of fuzzy “if–then” rules. The first step in creating a fuzzy system is collecting a set of fuzzy “if–then” rules from expert knowledge or studying literature in the related field. The following step combines these rules into mathematical forms [37].

Fuzzy logic is one of the methods used in handling the uncertainties in the model or the data. FIS are based on fuzzy rules called fuzzy “if–then” rules. In some resources, instead of FIS, terms such as fuzzy model, fuzzy associative memory, and fuzzy logic controller are also used [38]. Fuzzy systems employ fuzzy sets to convert input variables to output variables [39]. These systems are especially useful for incorporating human experiences and behavioural data into the model. The model’s variables are expressed in this way using fuzzy subsets. Fuzzy set operations are used for the inference under consideration. By generalizing classical set operations, these operations are obtained. A general fuzzy system has five layers, as shown in Fig. 2. The FIS structure has five layers, as follows:

Fig. 2
figure 2

Schematic representation of a fuzzy system

2.3.1 Fuzzification Layer

This layer consists of defined membership functions of the input parameters. The result is a degree of membership value calculated using a Gaussian membership function [40].

$${\mu }_{Ai}\left(x\right)=\mathrm{exp}\left\lfloor -{\left(\frac{x-{c}_{i}}{2{\delta }_{i}^{2}}\right)}^{2}\right\rfloor ,$$
(6)

where \({c}_{i},{\delta }_{i}\) are parameters of a membership function.

2.3.2 Rule Layer

This layer applies fuzzy AND to the previous part of the fuzzy rules.

$${\omega }_{i}={\mu }_{Ai}\left(x\right)* {\mu }_{Bi}\left(x\right).$$
(7)

2.3.3 Normalized Layer

This layer normalized the membership functions.

$$\overline{{\omega }_{i}}=\frac{{\omega }_{i}}{\sum_{i}{\omega }_{i}}.$$
(8)

2.3.4 Defuzzification Layer

In this layer, the outcome of the fuzzy rules is applied.

$$\overline{{\omega }_{i}}{f}_{i}=\overline{{\omega }_{i}}\left({p}_{i}x+{q}_{i}y+{r}_{i}\right).$$
(9)

\({p}_{i},{q}_{i}, and {r}_{i .}\mathrm{are linear variables}\).

2.3.5 Output Layer

This layer is calculated by summing up the outputs of previous layers.

$$\sum_{i}\overline{{\omega }_{i}}{f}_{i}=\frac{\sum_{i}\overline{{\omega }_{i}}{f}_{i}}{\sum_{i}{\omega }_{i}}.$$
(10)

2.4 Assessment of Developed Models

To examine the strengths and weaknesses of the developed models for three techniques, MLR, ANNs, and FIS., the predicted IRI values were compared with the observed values concerning the determination coefficient of (\({R}^{2}\)), Root Mean Square Error (RMSE), and Mean Absolute Error (MAE). Thus, a higher \({R }^{2}\) value was considered a better fit of the development data set. The mathematical representation of the three implemented measures is shown in the following equations:

$${{\varvec{R}}}^{2}=1-\frac{\sum_{{\varvec{i}}}{{({\varvec{t}}}_{{\varvec{i}}}-{{\varvec{o}}}_{{\varvec{i}}})}^{2}}{\sum\nolimits_{{\varvec{i}}}({{{\varvec{o}}}_{{\varvec{i}}})}^{2}},$$
(11)
$$MAE=\frac{1}{{\varvec{n}}}\sum _{{\varvec{i}}}^{{\varvec{n}}}\left|{{\varvec{t}}}_{{\varvec{i}}}-{{\varvec{o}}}_{{\varvec{i}}}\right|,$$
(12)
$$RMSE=\sqrt{\frac{\sum_{{\varvec{i}}}({{\varvec{t}}}_{{\varvec{i}}}-{{\varvec{o}}}_{{\varvec{i}}}{)}^{2}}{{\varvec{n}}}},$$
(13)

where: \({o}_{i}\) is the actual value observation I, \({t}_{i}\) is the predicted value of observation I, n is the number of observations.

2.5 Research Goal and Methodology

This study focuses on the modelling and evaluation of roughness in flexible pavements based on pavement distress. This study aims to develop a new technique for replacing the conventional methods of determining IRI values. These include determining the IRI of flexible pavement and establishing credible prediction models for St. John’s roads based on data gathered over the last few years. The IRI values were measured using a smartphone application called “TotalPave.”

In this paper, the authors propose techniques based on MLR, ANN for predicting the IRI values based on pavement distress. Models named FIS were used to compare with MLR and ANN models. In detail, with data obtained from the field survey, the authors have divided the work into five phases, as follows:

  • Collection of the pavement distress parameters for 19 roads (37 sections),

  • Analysis of data by using the MLR technique,

  • Analysis of data by using the ANN technique,

  • Analysis of data by using the FIS technique, and

  • Comparison and validation of the FIS, MLR, and ANNs models.

Eight pavement distress variables were studied: rutting, fatigue cracking, block cracking, longitudinal cracking, transverse cracking, potholes, patching, and delamination.

3 Study Area Location and Data Collection

The city of St. John's has a road network of more than 1000 kms of paved road. These roads are critical for the quick and safe flow of products and people in and out of St. John's.

The data collected on St. John's roads used IRI to assess pavement performance based on pavement distress. The information gathered aids in the development of models that anticipate pavement conditions. To develop flexible pavement performance and meet the goals of this research, researchers conducted a complete field investigation of pavement conditions for different roadways. The surveys covered 19 roads (37 sections) in St. John's (wet freeze climate), with pavement conditions ranging from very poor to very good in the selected sections. The survey was conducted for asphalt concrete pavement types. In this research, approximately 58 km of road length was studied, including nine significant roads (36.2 km), eight minor roads (13.5 km), and one highway (8.6 km). A smartphone application called “TotalPave” was used to collect IRI data. This program can detect vertical movement caused by the road’s rough surface and calculate the IRI value. The key motive for using this programme in the study was its property of requiring no pre-or post-processing for the pavement distress data gathered. Furthermore, TotalPave is simple to use and has a reasonable cost. The TotalPave app was installed on a smartphone and then put on the vehicle's windshield using a mobile phone holder to collect IRI data. The phone was confirmed to have experienced some bumping and vibration. The vehicle was driven at a speed of 20–80 km/h throughout data collection, according to TotalPave user instructions. Table 2 provides a descriptive summary of the St. John’s road network chosen for this study for 2018 and 2021. Table 3 illustrates sample data of eight distress parameters (Fig. 3).

Table 2 Details of study section
Table 3 Sample data of pavement distress
Fig. 3
figure 3

Map of the road network of the St. John’s

Ali et al. performed a distress survey on some road sections in St. John's, which was published at the 2021 Journal of Transportation Engineering, Part B: Pavements. They also studied some roads other than the sections considered in the current analysis, and the survey was presented at the 2018 Conference of the Canadian Society for Civil Engineering (CSCE) ([41, 42]. Figure 4 shows example photographs of some of the city’s roads.

Fig. 4
figure 4

Representative photo showing different distress types in pavement sections (Image by Abdualmtalab Ali)

3.1 MLR Model Development

A MLR was used to determine the relationship between pavement distress parameters and the asphalt pavement performance index (IRI). This study developed two prediction models using the MLR technique from the collected data. IBM's SPSS Statistics package (IBM 27) was used to analyze the data of 19 roads (37 sections) from the field survey data. Equation (14) shows a basic formula for the prediction models to find the influence of pavement distress parameters for IRI.

$$\mathrm{IRI}=\mathrm{C}+{a}_{1}{X}_{1}+{a}_{2}{X}_{2}+{a}_{3}{X}_{3}+{a}_{4}{X}_{4}+{a}_{5}{X}_{5}+{a}_{6}{X}_{6}+{a}_{7}{X}_{7}+{a}_{8}{X}_{8},$$
(14)

where IRI is the International Roughness Index, C is the constant, X1 is the rutting, X2 is the fatigue, \({X }_{3}\) is the block cracking, \({X }_{4}\) is the longitudinal cracking, \({X }_{5}\) is the transverse cracking, \({X }_{6}\) is the patching, \({X }_{7}\) is the potholes, \({X }_{8}\) is the delamination, and \({a}_{1},{a}_{2},{a}_{3}\dots \dots \dots \dots {a}_{9}\) are the coefficients.

The IRI regression models are presented in Eqs. (15) and (16), which consider eight surface pavement distresses: rutting, fatigue cracking, block cracking, longitudinal cracking, transverse cracking, potholes, patching, and delamination, as input variables and IRI output variables.

  • MLR (2018)

Table 4 shows two IRI regression models developed based on surface pavement distress data. Rutting, fatigue cracking, block cracking, transverse cracking, and potholes have been adversely correlated with the \({\mathbf{I}\mathbf{R}\mathbf{I}}_{2018}\) model. Longitudinal cracking, patching, and delamination have been positively correlated with the \({\mathbf{I}\mathbf{R}\mathbf{I}}_{2018}\) model. The association between IRI and surface pavement distress was described as follows in Eq. (15):

Table 4 IRI models based on surface pavement distress data
$${{\varvec{I}}{\varvec{R}}{\varvec{I}}}_{2018}=3.58-0.06{\times }_{1}-0.12{\times }_{2}-0.03{\times }_{3}+0.03{\times }_{4}-0.02{\times }_{5}+0.01{\times }_{6}-0.01{\times }_{7}+0.08{\times }_{8}.$$
(15)

The correlation coefficient (\({R}^{2}\)) of this relationship is 39%.

  • MLR (2021)

\({{\varvec{I}}{\varvec{R}}{\varvec{I}}\boldsymbol{ }}_{2021}\) model has been negatively correlated with rutting, block cracking, longitudinal cracking, patching, and potholes. Fatigue cracking, transverse cracking, and delamination have been positively is associated with the \({{\varvec{I}}{\varvec{R}}{\varvec{I}}\boldsymbol{ }}_{2021}\) model. The association between IRI and surface pavement distress was illustrated as follows in Eq. (16):

$${{\varvec{I}}{\varvec{R}}{\varvec{I}}}_{2021}=4.006-0.078{\times }_{1}+0.194{\times }_{2}-0.222{\times }_{3}-0.067{\times }_{4}+0.081{\times }_{5}-0.004{\times }_{6}-0.014{\times }_{7}+0.015{\times }_{8}.$$
(16)

The correlation coefficient (\({R}^{2}\)) of this relationship is 53.2%.

3.2 Cross-validation Evaluating Model Performance

The cross-validation technique evaluates the model’s accuracy across various data. 80% of the data samples for each subgroup (as shown in Table 2) are chosen at random to create deterioration models. The remaining 20% of data samples are used to assess the precision of prediction models.

After the validation test, Table 5 illustrates the reduction in \({R}^{2}\), RMSE, and MAE values for all sections. Figures 5 and 6 present the errors and linear relations for (2018 and 2021).

Table 5 Validation results of prediction models
Fig. 5
figure 5

Accuracy of the prediction IRI values (2018)

Fig. 6
figure 6

Accuracy of the prediction IRI values (2021)

The following conclusions can be drawn from Table 5, Figs. 5 and 6:

  • IRI (2018): The results showed that the reduction in \({R}^{2}\), RMSE and MAE values is insignificant. The reductions in accuracy are 8.21%, 16.52%, and 20.17%, respectively. As a result, the MLR technique has a good ability to predict IRI models reliably.

  • IRI (2021): The results showed that the reduction in \({R}^{2}\), RMSE and MAE values are insignificant. The reductions in accuracy are 12.6%, 6.36%, and 1.31%, respectively. As a result, the MLR technique has a good ability to predict IRI models reliably.

3.3 ANN Model Development

The primary goal of model development is to predict pavement performance. The models were developed using MATLAB version R2021b. To acquire the best results, three ANN architectures were investigated: A (8-8-8-8-1-1), B (8-10-10-10-1-1), and C (8-20-10-10-1-1). Each network architecture is composed of eight inputs, three hidden layers, and one output layer. The data were divided into three categories, 70% of the data as training, 15% of the data as testing, and 15% of the data as validation. The ANNs architecture (A), (B), and (C) are shown in Figs. 7, 8, and 9; the best results were obtained with ANN architecture (B), and the results showed a good ability of the pavement distress models. Statistical metrics such as maximum \({R}^{2}\) values and minimum values of (RMSE) and (MAE) contributed to selecting the best ANN network. The findings of the ANNs approach are summarised in Table 6. The highest \({R}^{2}\) values were 99.2% and 99.5%, and the lowest RMSE and MAE values were (0.276), (0.234), (0.16), and (0.16) for 2018 and 2021, respectively. Figures 10 and 11 present the ANNs prediction results for IRI.

Fig. 7
figure 7

Architecture of ANNs model (Model A)

Fig. 8
figure 8

Architecture of ANNs model (Model B)

Fig. 9
figure 9

Architecture of ANNs model (Model C)

Table 6 Summary of IRI models of ANNs developed
Fig. 10
figure 10

ANNs model goodness-of-fit results for IRI values (2018)

Fig. 11
figure 11

ANNs model goodness-of-fit results for IRI values (2021)

3.4 FIS Model Development

The primary goal of using the FIS technique is to predict pavement performance. The models were developed using MATLAB version R2021b. The FIS technique was used on 19 roads (37 sections) in St. John’s. The fuzzy model employed the severity of the degradation as the input variable to generate IRI models, while IRI was the output variable. The acquired data from the field survey were used for this. The FIS technique has three main steps, the Fuzzification, Normalisation, and Defuzzification modules. Figure 12 demonstrates the main steps of the FIS technique. In the fuzzy modelling study, eight independent variables and one dependent (IRI) variable were considered inputs and outputs.

Fig. 12
figure 12

Diagram of a pavement classification on FIS

3.5 Data Pre-processing and Membership Functions

The fuzzy model was constructed with eight independent parameters based on pavement distress and one output (IRI) after collecting and revising data for 19 roads in St. John’s.

All input variables’ membership functions are classified as Minimal, Moderate, or Severe. The IRI membership functions for the output variables are Poor, Mediocre, Fair, Good, and Very Good. Table 7 shows pavement distress types and the number of membership functions to evaluate IRI.

Table 7 Distress types and number of membership functions to evaluate IRI

3.6 Fuzzy Rule Generation

The main challenge of FIS during the second phase is developing the rules. The generation rules for the classification model described in this paper are difficult and complex, because they have eight inputs and one output. Table 8 Rule base was created for FIS and IRI.

Table 8 Fuzzy rules for IRI by 19 road (37 sections)

3.7 The Results of Pavement Section Classification

The system evaluated two section datasets, each with 19 roads (37 sections), and this technique generated membership functions and rules by measuring the efficiency of fuzzy pavement classification. To display the level of agreement of the IRI values, four defuzzified methods (Centroid, Bisector, SOM, and Lom) were used to find the \({R}^{2}\), RMSE, and MAE. Table 9 shows the level of agreement of the IRI values using four defuzzified methods. Figures 13 and 14 depict the relationship between observed and fuzzified IRI for 37 sections.

Table 9 Assessment of various fuzzy inference systems' configurations for 19 roads (37 sections)
Fig. 13
figure 13

Performance of a fuzzy inference system (2018)

Fig. 14
figure 14

Performance of a fuzzy inference system (2021)

The FIS technique developed a section classification model for flexible pavement. An IRI was considered a FIS output, and the eight pavement distresses were considered inputs. The method was then developed in two phases for two data groups. The first phase involved creating fuzzy partitions for inputs and outputs applied a clustering algorithm. The second phase involved using four defuzzified methods to generate fuzzy rules from numerical data. The results presented that the fuzzy pavement classification has good accuracy compared to IRI calculated by MLR.

The following conclusions can be drawn from the goodness of fit statistics of the 19 road (37 sections) in Table 9:

  • Centroid Method: The statistical indicators of \({R}^{2}\), RMSE, and MAE for the 2021 sections improved by 0.226%, 5.30%, and 8.30%, respectively, when compared to 2018.

  • Bisector Method: The statistical indicators of RMSE and MAE for the 2021 sections improved by 20% and 21.41%, respectively, compared to 2018, while \({R}^{2}\) is decreased by 1.02% for 2021 compared to 2018.

  • Lom Method: The statistical indicators of RMSE and MAE for the 2021 sections improved by 1.34% and 2.88%, respectively, compared to 2018, while \({R}^{2}\) decreased by 1.93% for 2021, compared to 2018.

  • Som Method: The statistical indicators of \({R}^{2}\), RMSE, and MAE for the 2021 sections improved by 1.03%, 35.52%, and 45.92%, respectively, compared to 2018.

  • According to the results, the centroid method is more accurate for 2018 and 2021 (\({\mathrm{R}}^{2}\)= 88.3% and 88.5%, RMSE = 0.567% and 0.537%, and MAE = 0.446% and 0.411%) than other methods.

3.8 Comparison and Validation of MLR, ANNs, and FIS Models

To validate the models developed in this study, all models were compared using MLR, FIS, and ANNs techniques. Table 10 shows that the models' performance was measured and compared using \({R}^{2}\), RMSE, and MAE values. Figures 15 and 16 show the comparison among the three techniques.

Table 10 Comparison among MLR, FIS, and ANNs models
Fig. 15
figure 15

Fitness of MLR, FIS, and ANNs models to IRI prediction (2018)

Fig. 16
figure 16

Fitness of MLR and ANNs models to IRI prediction (2021)

Several conclusions can be drawn from Table 10, and Figs. 15 and 16:

  • IRI (2018):

  • According to the statistics, the \({R}^{2}\) value for the ANNs model was higher than the \({R}^{2}\) values of the FIS and the MLR models by 10.98% and 60.7%, respectively.

  • The RMSE value of the ANNs model was less than the RMSE values of the FIS and MLR models by 51.32% and 73.6%, respectively.

  • The MAE value of the ANNs model was less than the MAE values of the FIS and MLR models by 47.5% and 71.7%, respectively.

  • IRI (2021):

  • According to the statistics, the \({R}^{2}\) value of the ANNs model was higher than the \({R}^{2}\) values of the FIS and the MLR models by 46.5% and 11.01%, respectively.

  • The RMSE value of the ANNs model was less than the RMSE values of the FIS and MLR models by 70.4% and 78.7%, respectively.

  • The MAE value of the ANNs model was less than the MAE values of the FIS and MLR models by 61.1% and 73.6%, respectively.

3.9 Model Sensitivity Analysis

A sensitivity analysis was performed to investigate the effect of input variables on the efficiency of prediction IRI models. A sensitivity analysis was conducted by creating prediction models that considered each individual input while ignoring the effect of other factors. The results of the sensitivity analysis presented in Table 11, and Fig. 17 were as follows:

Table 11 Sensitivity analysis of prediction models
Fig. 17
figure 17

Results of the sensitivity analysis of MLR (2018 and 2021)

IRI (2018): Rutting has the most effect on the prediction model when compared to other variables. Fatigue cracking, longitudinal cracking, patching, and delamination have some influences on the IRI model, whereas block cracking and potholes have minor effects.

IRI (2021): Longitudinal and fatigue cracking had the most effect on the prediction model compared to other variables. Rutting, block cracking, patching, and delamination have some influences on the IRI model, while transverse cracking and potholes have minor effects.

3.10 Chi-Square Goodness of Fit Test

A chi-square goodness of fit test was performed for St. John’s 19 roads (37 sections). The goodness of fit measures summarises the difference between measured and predicted values. A goodness of fit between observed and predicted frequencies is based on the quantity presented in the Eq. (17):

$${X}^{2}=\sum_{i=1}^{k}\frac{{({o}_{i}-{e}_{i})}^{2}}{{e}_{i}},$$
(17)

where \({X}^{2}\) is a random value of a variable whose sampling distribution is approximated very closely by the chi-squared distribution with \(\nu\) = k − 1 degree of freedom; and \({o}_{i}\) and \({e}_{i}\) are the observed and predicted frequencies, respectively.

A higher chi-square value indicates that the measured frequencies differ significantly from the predicted frequencies and that the fit is poor. If the observed and predicted frequencies are low, the chi-square value will be small, and the fit will be good. A good fit leads to the null hypothesis (\({H}_{o}\)) being accepted, while a poor fit leads to its rejection. Table 12 summarises the goodness of fit test analysis; the null hypothesis is accepted, because \({X}^{2}\) is less than its critical value with a 95% confidence level for 19 roads (37 sections).

Table 12 Chi-Square fit goodness test for St. John’s

4 Conclusions

The soft computing techniques help demonstrate the correlation between surface pavement distress and IRI. This paper aimed to evaluate the performance of MLR, ANNs, and FIS techniques predicting pavement performance. This case study investigated the relationship between the (IRI) indicator in asphalt pavements and eight independent variables, using 19 roads (37 sections) in 2018 and 2021 in St. John’s, Newfoundland, Canada. With performance indicator data (IRI), eight surface pavement distress types were collected, including rutting, fatigue block cracking, longitudinal cracking, transverse cracking, potholes, patching, and delamination, with performance indicator data (IRI). The notable findings of this research are summarized below:

  • Table 2 indicated that 81% and 84% of road sections were classified as poor in (2018) and (2021), respectively.

  • The ANNs technique predicted the pavement roughness with more accuracy and the lowest errors considerably compared to the FIS techniques (2018 and 2021); a coefficient of determination indicated that the correlation was increased by 60.7% and 46.5%, respectively, while RMSE was reduced by 51.32% and 70.4%, respectively, MAE was decreased by 47.5% and 61.1%, respectively.

  • The FIS technique predicted the pavement roughness with more accuracy and the lowest errors considerably compared to the MLR techniques (2018 and 2021); a coefficient of determination indicated that the correlation was increased by 55.83% and 39.89%, respectively, while RMSE was reduced by 45.80% and 28.1%, respectively, MAE was decreased by 46.07% and 32.06%, respectively.

  • From the sensitivity analysis, it is concluded that rutting, fatigue cracking, longitudinal cracking, patching, and delamination are having highest influences on the prediction model for (2018), longitudinal, fatigue cracking, rutting, block cracking, patching, and delamination are effects on the prediction model (2021).