1 Introduction

The road network in India comprises different classes of roads as follows: National Highways (1.94%), State Highways (2.97%), District Roads (9.94%), Rural Roads (70.65%), Urban Roads (8.93%), Project Roads (5.58%). Road transport accounts for 64.5 percent of all products carried in India, and 90 percent of all passenger travel [1, 2]. The data show that the share of rural roads (70.65%) is the highest, followed by district roads and urban roads. In the Union Budget 2020–21, the Indian government has set out USD 12.24 billion for the Ministry of Road Transport and Highways (MoRTH). The Indian government has allocated USD 148 billion for the National Infrastructure Pipeline in the fiscal year 2019–25. The roads sector is expected to contribute 18 percent of capital spending in the 2019–25 fiscal year. In addition, in the Union Budget 2020–21, the Indian government has set out USD 2.6 billion for Pradhan Mantri Gram Sadak Yojana (PMGSY). The Pradhan Mantri Gram Sadak Yojana (PMGSY) is a nationwide plan in India to provide good all-weather road connectivity to unconnected villages [3, 4]. According to the 11th five-year plan working report on rural roads, the current replacement cost of the existing rural road network in India is projected to be USD 26.67 billion. The subsequent loss in value of road assets due to non-maintenance could be as high as USD 1.33 billion per year, equating to 50,000 km of the road being eroded every year [5]. As a result, the road assets created must be maintained during their design life. The guidelines for deciding the interval and type of maintenance activities for rural roads as per Indian Roads Congress (IRC) SP-20: Rural Roads Manual (2002) are on a subjective basis and mainly based on the experience of field engineers, and without any economic analysis being done [6]. This caused them to seek applicable solutions for road maintenance and rehabilitation problems, keeping resource constraints in mind. A step in this approach is the creation of a Pavement Management System (PMS) for the various conditions that exist in the nation.

Most pavement management systems (PMS) contain a section for pavement maintenance (PMMS), which may include models for determining the most cost-effective treatment. The features of the surface, the type of pavement, and other key variables generally influence them. The right maintenance treatment must be done at the appropriate time for the pavement to operate as intended and for the maintenance program to be cost-effective [7]. Due to advancements in information technologies, Pavement Management Systems (PMS) and Pavement Maintenance Management Systems (PMMS) have significantly improved over the last twenty years. Highway agencies today have a range of tools and systems at their disposal to assist them to make the most of the highway construction, maintenance, and rehabilitation services available. The Highway Development and Management (HDM-4) tool is one of the analytical tools available [8].

1.1 Highway Development and Management (HDM-4)

The World Bank has made it essential for its clients to have their highway projects assessed using the HDM-4 before receiving a loan. As a result, the HDM-4 program has become the legally recognized world standard decision support system for pavement management [9]. The Ministry of Shipping, Road Transport and Highways (MoSRTH) (currently MoRTH) of the Government of India has also called for the use of HDM-4 for a comprehensive evaluation of pavement deterioration and economic returns on highway investments to ensure the best possible use of funds for India's National Highway Network and the National Highway Authority of India (NHAI) is designing a Road Information Management System (RIMS) for the country's National Highway (NH) network using HDM-4 as well [10]. Relationships for modelling pavement deterioration and road works effects are included in HDM-4. These are used to forecast annual road conditions and analyze highway construction and maintenance strategies. The HDM-4 road deterioration models aim to simulate the dynamic relationship among vehicles, the atmosphere, pavement design, and surface characteristics [11] Five distinct distress models, given below, are used to forecast road deterioration in this study. Initiation and progression are two stages that describe the first three distress modes. The initiation phase is the period between the onset of surfacing distress of a particular mode or severity and the appearance of surfacing distress. The progression phase is defined as the period where the extent and magnitude of the distress grows.

1.2 Pavement Deterioration Models

Predictions of structural and functional pavement performance, as well as other inputs, aid in the creation of reasonable maintenance budgets and the selection of the best maintenance strategies. In a Pavement Maintenance and Management System (PMMS), a pavement deterioration model, which forecasts pavement performance, is a significant input. For several years, administrators and transportation engineers all over the world have focused on pavement performance models built mostly in Europe and North America for pavement design, construction, and reconstruction. These models were created using the results of a few groundbreaking research projects that were undertaken because of the continents' comprehensive transportation growth and paved highway network. However, since these models are developed from data for local environments, they have certain inherent drawbacks and are not often explicitly transferable for global use, unless their appropriate validation has been reported [12]. At present, no study has been carried out which manifests the contrast between urban and rural scenarios and effectively models the pavement deterioration in these conditions using primary (field studies) and secondary data [official records of concerned Public Works Department (PWDs), In-charge of maintenance of these roads, Toll booths, Police interceptors and other Urban local bodies (ULBs)]. A comparison of the efficacy of HDM-4 and the locally developed pavement deterioration models was not addressed in any of the preceding research. In this study, pavement deterioration models were developed for predicting the following distresses: Cracking percentage; Ravelling percentage; No. of Potholes; Edge break (only for rural sections), and Rut depth (only for urban sections).

These models for urban and rural roads were developed in MATLAB based on the large volume of field data collected viz., automated as well as the manual method of field evaluation for use in identifying and prioritizing resurfacing needs. The appropriate organizations and departments provided detailed information on the pavement layer composition, design, and traffic data for these sections. For the surface condition data of the pavement, these sections were constantly monitored for 5 years (from 2017 to the end of 2021). All these data were also utilized to calibrate and validate HDM-4 distress models. The flowchart given in Fig. 1 outlines the methodology proposed for the study.

Fig. 1
figure 1

Flowchart for study methodology

2 Literature Review

Most countries around the world are working on various pavement performance models for use in preparing pavement maintenance plans as part of various pavement performance studies. For roads in India, there are no historical long-term pavement performance data or pavement deterioration models to estimate pavement performance. As a result, it is essential to develop pavement deterioration models for urban and rural roads, which would help recommend appropriate maintenance solutions for the long-term protection of this type of road network. This scientifically based methodology would also assist a developed nation like India in making the best use of its limited maintenance budget [13, 14]. Most of the studies on HDM-4 are associated with the calibration of its distress models according to local conditions and the development of PMMS using calibrated HDM-4 models [8, 9, 15,16,17,18,19,20,21,22,23,24,25,26,27,28]. Many researchers have also exploited the advantages of other soft computing techniques in PMMS. Chandran et al. (2007) have prioritized low-volume pavement sections for maintenance using Fuzzy Logic [29]. Alsugair et. al. (1998), Mathew et. al. (2008), Shekharan et. al. (2000), and Ashraf and George (2000) employed ANN to evaluate the appropriate maintenance and repair choice for road segments in a network [30,31,32,33].

Ministry of Surface Transport (MOST) (currently MoRTH) of the Indian government, funded the Pavement Performance Study (PPS) at the Central Road Research Institute (CRRI) in New Delhi in 1986 to estimate data for a transportation cost model in Indian circumstances. These models were the first of their type and were based on long-term pavement performance data in Indian settings [34]. The details of the study by CRRI on Existing pavement sections (EPS) and the models developed are described in the study by Sood et al. (1994). The models in this study are based on typical traffic, environmental, and pavement conditions seen on India's major roads [35]. Reddy et al. (2002) investigated the cost allocation of flexible pavement maintenance and strengthening options based on performance. Pavement performance models were built utilizing data collected for government-sponsored Pavement Deterioration Models (PDM) research programs [36]. The structural and roughness deterioration models were created by Jain et al. (1994) to determine the necessity for flexible pavement maintenance management. The data from additional test sections were then used to calibrate and validate these models [37]. Kumar and Patel (2009) used deflection and roughness data on 18 Pradhan Mantri Gram Sadak Yojana (PMGSY) sections in Uttarakhand and Uttar Pradesh to develop pavement deterioration models for rural roads in India [38]. Rastogi et al. (2011) created pavement performance models for low-volume pavements in the Indian states of Uttarakhand and Uttar Pradesh. For the selected 18 portions, structural and functional responses were constantly assessed for 2 years [39]. Sandra and Sarkar (2012) created a model to estimate the association between roughness and typical road deteriorations such as rutting, ravelling, potholes, cracking, and patching on Indian roads [40].

In the Kenyan study by Hodges et al. (1975) and Patterson (1987), 46 test sections were chosen for monitoring and analysis to link deterioration to maintenance, traffic, environment, and material type [41, 42]. Similarly in the United States, Luhr and McCullough (1983) used a pavement management system called the 'Pavement Design and Management System’ (PDMS) to conduct an economic analysis comparing three surfacing types (aggregate, surface treatment, and asphalt) at six different traffic levels for low-volume roads (5–200 vehicles/day in single design lane) and calculating total costs for each case [43]. Data for the formulation of empirical models were collected by monitoring the state of existing in-service roads in the Brazil-UNDP road costs research by Geipot (1982), and Patterson (1987). The factorial foundation for choosing the forty-eight sections was identical to that used in the Kenyan study, but there were a few key modifications targeted at improving the data interpretation [42, 44]. Giummarra et al. (2004) undertook a study to construct deterioration models for local roads in Australia [45]. By merging experimental and field data, Prozzi and Madanat (2004) showed the generation of pavement performance models. When compared to approaches like conventional least squares, the methodology described in this work makes the best use of available data and produces models with better statistical features [46]. Chen et al. (2011) examined the application of IRI-based pavement deterioration prediction models, which included four deterministic pavement performance models (i.e., the NCHRP model, Dubai model, AI-Omari Darter model, and the NMDOT model used by the New Mexico Department of Transportation) [47]. Sun and Gu (2011) combined the advantages of the Analytic Hierarchy Process (AHP) and Fuzzy Logic theory to produce a new technique for pavement condition prediction and project prioritization [48]. Hassan et al. (2017) described the technique and results of deterioration modelling for five different types of bituminous pavement surfaces, which may be used to detect and prioritize resurfacing needs. Overall, the data show that for most surface types, the three modelling methodologies produce similar forecasts and deterioration rates [49].

3 Study Sections and Time-Series Data Collection

To develop a pavement performance model and for calibration of HDM-4 models, it is necessary to possess a group of distress data that serves to represent the real performance curve; preferably the data representing a relatively long time. Different researchers have used the Test Section or "Film" technique and the “Window" method to develop pavement performance models [41, 50, 51]. In the test section methodology, which has been adopted in this study, it is necessary to continue measuring pavement performance data for an extended time for each selected section for obtaining reliable predictive data. The study area in the present study is confined to the rural and urban roads situated in Jaipur district having plain terrain in the Rajasthan State of India.

3.1 Road Network Details

The identified urban and rural road sections span 79 km of plain terrain in Rajasthan, India. It consists of 20 functional road sections which are in plain terrain regions. These pavement sections were a part of 7 road networks spread over the rural and urban areas of the Jaipur district. Eventually, only these seven road networks were subjected to HDM-4 strategy analysis. Table 1 shows section-wise data on the number of in-service urban and rural road sections in the study region. Twelve in-service rural road sections and eight urban road sections were eventually chosen for model development. For simple identification in the road network, each of the selected pavement sections has been given a unique "Section ID" and a "Section Name" as shown in the table. “Study Section (SS)-Type of section- Number issued to pavement section” is the series used to denote Section ID. The selected pavement section was dispersed throughout a wide variety of experimental design characteristics, including kinds of soils, geometry, and traffic factors, that are often seen on urban and rural roads in India's diverse geographical areas. Hence, the identified urban and rural road networks can be considered representative of rural and urban road networks in the plain terrains of Rajasthan.

Table 1 Pavement sections selected for the study (2017 data)

Figure 2 depicts the location of the selected urban and rural road network in the broader context of the Jaipur district. The data are collected for each selected section by carrying out field studies (primary data) as well as from secondary sources (secondary data) such as official records of concerned Public Works Department (PWDs), In-charge of maintenance of these roads, Toll booths, Police interceptors, and other Urban local bodies (ULBs). The data-collecting procedure has been divided into the following areas: Road Network data; Pavement distress data for five years; Vehicle fleet data; Traffic Volume data; Road User Cost data.

Fig. 2
figure 2

Map showing the location of all study sections

The collected data in detail as well as the data collection approach and tools that were utilized have been presented in Appendix A (Supplementary Materials).

3.2 Evaluation of Pavement Materials

In the laboratory, subgrade soil samples and Hot mix asphalt (HMA) samples taken from the field were evaluated in detail following Indian standards and guidelines [52,53,54,55]. The test results are presented in Table 2. These experiments were conducted at Manipal University Jaipur research facilities. Representative subgrade soil samples were obtained from the test pits (0.5 m × 0.5 m) for each rural and urban pavement section for laboratory characterization of subgrade soil materials.

Table 2 Laboratory Evaluation of Pavement materials data (2017 data)

The Modified Structural Number of Pavement (MSN) is an indicator of the strength and structural adequacy of a pavement. It is calculated using the Structural Number of Pavement (SNP) value and by use of Eq. 1 given below as follows:

$${\text{MSN}} = {\text{SNP}} + 3.51\left( {\log_{10} {\text{CBR}}_{s} } \right) - 0.85\left( {\log_{10} {\text{CBR}}_{s} } \right)^{2} - 1.43,$$
(1)

where, SNP is the Structural Number of Pavement, MSN = Modified Structural Number, CBRs is the in-situ CBR value of the subgrade, in %

The pavement Structural Number is calculated by multiplying the layer material type specific coefficient by the layer thickness and then adding them together. The layer coefficients for different types of pavement layers are proposed by the American Association of State Highway and Transportation Officials (AASHTO) Guide, 1993 [56]. In HDM-4 the construction defects are input through two indicators as follows:

  • CDS—Construction defects indicator for bituminous surfacing

  • CDB—Construction defects indicator for the base

CDS is a parameter that indicates the general amount of binder content and stiffness compared to the best material design for a certain bituminous combination. It is used to determine if a bituminous surfacing is prone to cracking and ravelling (low CDS value) or rutting due to plastic deformation (high value of CDS). CDS is a continuous variable with a range of 0.5–1.5 as indicated in Table 3. The base construction defects indicator (CDB) is used for potholing. CDB is a continuous variable that ranges from 0 (no construction defects) to 1.5 (several defects).

Table 3 Construction defect indicator (CDS) for bituminous surfacing

4 Calibration of HDM-4 Models

HDM 4 road deterioration models include 'calibration factors,' which are linear multipliers for adjusting projections to fit local conditions, to help with calibration and local adaption. Each of these calibration factors in HDM-4 has been set to unity as the default value [22, 24, 25]. In this study, the "Window" approach was utilized to calibrate pavement distress models. The "Window" concept defines homogenous rural and urban road sections based on their most representative factors, such as structure, traffic, geometry, and climate. Each of these sections is, therefore, called a window in a pavement's performance curve and indicates a specific pavement's performance when combined with other windows (similar individual sections).

The selected in-service rural and urban road sections were divided into five cells (cells 1 to 5) and HDM-4 pavement deterioration models were calibrated for each cell. The calibration procedure entails calculating the adjustment variables (Ki) that will result in the best agreement between the model's predictions and field data [20, 57]. Section 4.1 contains the details of the calibration process used in this investigation.

For calibration of these deterioration models, three calibration levels (1—Basic application, 2-Primary relationships, and 3-Adaption) have been established in the HDM-4 tool, each of which requires a specific amount of work and resources depending on the application. In this study, Level 2 calibration was carried out for rural and urban pavement sections. For Level 2 initiation calibration, the observed values of the deteriorations for several years in these sections were compared with the results obtained from un-calibrated HDM-4 models. For distress progression factors, the final calibration factor corresponding to the maximum R2, minimum Root Mean Square Error (RMSE), and Average Absolute Error (AAE) value has been suggested among the different trial calibration factors [19]. Based on these methodologies, the initiation and progression calibration factors for all 20 sections were determined for Cracking, Ravelling, Potholes, Rut Depth, and Edge break. Subsequently, the calibration factors obtained from Level 2 calibration, as indicated in Table 6, were adopted for further studies.

4.1 Level-2 Calibration Methodology

4.1.1 Distress Initiation Calibration Factors

The coefficient (calibration factor) between the observed year of occurrence of the distress and the year of occurrence predicted by the un-calibrated models was determined using Eq. 2, for the calibration of surface distress initiation factors as follows:

$$K_{ci} = \frac{{\text{mean OTCI}}}{{\text{mean PTCI}}},$$
(2)

where, Kci is the crack initiation calibration factor, OTCI is the observed time of crack initiation, PTCI is the predicted time of crack initiation.

Based on this methodology, the initiation calibration factors for all five cells comprising 20 pavement sections were determined for the following distresses:

  • Cracking Initiation (Kci)

  • Ravelling Initiation (Kri)

  • Pothole Initiation (Kpi)

4.1.2 Distress Progression Factors

The following is the procedure used to determine the calibration factors for distress progression [12, 22]:

  • With the road network and vehicle fleet input data, HDM-4 was run for each cell containing a homogenous set of study sections. HDM-4 suggests a range of calibration factors ranging from 0 to 20, with a default value of 1. According to several previous research, the calibration factors in underdeveloped countries, such as India, range from zero to five [9, 22, 25]. As a result, HDM-4 was run in the first stage with calibration factors ranging from 0 to 5 with a 0.10 increment.

  • The calibration factors were calculated using the results of the first run, which corresponded to the lowest values of statistical parameters reflecting the difference between anticipated and observed distresses in time series data. The HDM-4 was run for the second stage after acquiring the calibration factors from the first run by taking the calibration factors within the narrower range of factors with an increment of 0.01, as determined from the first run, for further improvement of the calibration factors. The statistical parameters used in the study for calibration are given in further detail.

Based on this methodology, the progression calibration factors for all five cells comprising 20 pavement sections were determined for the following distresses:

  • Cracking Progression (Kcp)

  • Ravelling Progression (Krp)

  • Pothole Progression (Kpp)

  • Edge Break Progression (Keb)

Tables 4 and 5 demonstrate the procedure of getting the final value of calibration factors for the initiation and progression of cracking for homogeneous Cell-2 with the most sections (6 sections). The tables show the trials involving the closest values to the final distress progression factor.

Table 4 Level-2 calibration process of HDM-4 cracking initiation model (Cell 2)
Table 5 Level-2 calibration process of HDM-4 cracking progression model (Cell 2)

Other distress/cell calibration factors were calculated using the same method, and the results are reported in Table 6. Appendix B (supplementary materials) contains thorough calculations and data for each section/cell. To arrive at the final values of calibration factors for all the distress progression models, the best values of the statistical indicators RMSE, AAE, and R2 were considered [22, 25].

Table 6 Level-2 calibration factors for HDM-4 deterioration models

The average of the Initiation/Progression Calibration factors for relevant cells was then used to generate the Global Calibration Factors (GCF) for urban and rural roads. Table 7 shows the details of the GCF calculation.

Table 7 Global calibration factors (GCF) for urban and rural pavement sections

4.2 Analysis of Calibration Data

Scrutiny of the above calibration factors indicates that:

  • In the case of pavement sections in Cell-1, consisting of 4 sections, initiation of all types of distress starts on the pavement surface later than predicted by un-calibrated HDM-4 models. The crack initiation will begin 1.91 times slower, the ravelling initiation 1.7 times slower, and potholes will appear 3.27 times slower than calculated by uncalibrated HDM-4 models. The progression of these distresses is comparatively slower as compared to those predicted by the corresponding HDM-4 model as apparent from the calibration factors obtained for all distresses. The rate of distress progression is slower by 46%, 58%, 82%, and 7% for Cracking, Ravelling, Potholes, and Edge break, respectively.

  • In the case of pavement sections in Cell-2, consisting of 6 sections, initiation of all types of distress starts on the pavement surface later than predicted by un-calibrated HDM-4 models. The crack initiation will begin 1.57 times slower, the ravelling initiation 1.51 times slower, and potholes will appear 3.2 times slower than calculated by uncalibrated HDM-4 models. The progression of these distresses is comparatively slower as compared to those predicted by the corresponding HDM-4 model as apparent from the calibration factors obtained for all distresses. The rate of distress progression is slower by 40%, 44%, 73%, and 3% for Cracking, Ravelling, Potholes, and Edge break, respectively.

  • In the case of pavement sections in Cell-3, consisting of 2 sections, initiation of all types of distress starts on the pavement surface later than predicted by un-calibrated HDM-4 models. The crack initiation will begin 1.71 times slower, the ravelling initiation 2.39 times slower, and potholes will appear 2.92 times slower than calculated by uncalibrated HDM-4 models. The progression of these distresses is comparatively slower as compared to those predicted by the corresponding HDM-4 model as apparent from the calibration factors obtained for all distresses. The rate of distress progression is slower by 55%, 69%, 84%, and 9% for Cracking, Ravelling, Potholes, and Edge break, respectively.

  • In the case of pavement sections in Cell-4, consisting of 4 sections, initiation of all types of distress starts on the pavement surface later than predicted by un-calibrated HDM-4 models. The crack initiation will begin 1.2 times slower, the ravelling initiation 1.37 times slower, and potholes will appear 1.68 times slower than calculated by uncalibrated HDM-4 models. The progression of these distresses is comparatively slower as compared to those predicted by the corresponding HDM-4 model as apparent from the calibration factors obtained for all distresses. The rate of distress progression is slower by 11%, 21%, and 9% for Cracking, Ravelling, and Potholes, respectively.

  • In the case of pavement sections in Cell-5, consisting of 4 sections, initiation of all types of distress starts on the pavement surface later than predicted by un-calibrated HDM-4 models. The crack initiation will begin 1.12 times slower, the ravelling initiation 1.15 times slower, and potholes will appear 1.53 times slower than calculated by uncalibrated HDM-4 models. The progression of these distresses is comparatively slower as compared to those predicted by the corresponding HDM-4 model as apparent from the calibration factors obtained for all distresses. The rate of distress progression is slower by 6%, 4%, and 6% for Cracking, Ravelling, and Potholes, respectively.

  • According to the global calibration factors (GCF) obtained, the variability between observed distress and uncalibrated HDM-4 values was found to be greater for rural pavement sections than for urban pavement sections, even though they follow the same pattern for distress initiation/progression as described above for individual cells.

5 Development of Pavement Deterioration Models

5.1 Sensitivity Analysis of Distresses

Sensitivity analyses are statistical studies that assess a dependent variable's sensitivity to changes in independent variables (also known as explanatory variables) over a suitable range of values. In this study, the analysis was conducted using the Spearman rank correlation coefficient method, Fisher-Snedecor’s F-test, and the student’s t-test of ranked variables. The selection of final variables, to be used in pavement deterioration models, was done thereafter.

Spearman’s correlation coefficient is a statistical measure of the strength of a monotonic relationship between paired data. In a sample it is denoted by rs and is by design constrained as follows:

Its interpretation is like that of Pearson's coefficient (r), e.g., the closer rs is to ± 1, the stronger the monotonic

$$- 1 \, \le r_{{\text{s}}} \le \, 1$$

relationship. It can be calculated using the following formula:

$$r_{{\text{s}}} = 1 - \frac{{6\sum d^{2} }}{{N\left( {N^{2} - 1} \right)}},$$
(3)

where, d is the difference between the two ranks of each observation, N is the number of observations.

Further, hypothesis testing was used to determine the utility of the regression model and the relationship between the dependent and independent variables. The hypothesis is based on some assumptions regarding the problem. We made the following assumptions while utilizing the t-test procedure; Both samples are random samples chosen from separate populations that can be represented by a normal distribution, and both populations' standard deviations and variances are identical. The followings are the null (Ho) and alternative hypotheses (H1):

$$\begin{gathered} H_{{\text{o}}} : \, \beta_{j} = \, 0 \hfill \\ H_{1} : \, \beta_{j} \ne \, 0 \hfill \\ \end{gathered}$$

Rejection Rule: Reject Ho if, tstat > tcritical or tstat < –tcritical.

If the Ho is not rejected, then the independent variable associated with that regression coefficient is not significant in explaining the variability of the dependent variable in the model. tstat is calculated using Eq. 4 below as follows:

$$t_{{{\text{stat}}}} = \frac{{\widehat{{\beta_{j} }}}}{{SE\left( {\widehat{{\beta_{j} }}} \right)}},$$
(4)

where βj is the regression coefficients, and SE shows standard error of the regression coefficient.

The calculated 't' values (tstat) for pavement deterioration family models for the rural and urban roads in plain terrain have been compared with critical 't' values for the level of significance (α) of 5% (tcritical) and degree of freedom as (N-2), and the details of the same are given in Appendix C (Supplementary materials). If the observed value of the test statistics, tstat falls into the rejection region, the Ho hypothesis is rejected it can be concluded that there is a significant relationship between the independent and dependent variables of the model.

The P-value technique (F-test) is also used to evaluate individual regression coefficients, much like it is for determining the significance of the regression. The P-value is defined as the smallest level of significance that would lead to rejection of the null hypothesis Ho. The hypothesis Ho is rejected if the P-value for tstat is less than the level of significance (α). The details of the F-test of variables with their corresponding P-values are indicated in Appendix C (Supplementary materials).

Eventually, it was found that pavement age, traffic volume, MSN, and CDS were all significant factors in the cause of cracking, ravelling, and potholes, according to the overall study of the sensitivity of variables to the corresponding model (detailed calculations and graphs presented in Appendix C of supplementary materials). Shoulder width, as well as pavement age, MSN, and CDS, was identified to be significant factors in the Edge Break model, rather than traffic volume. Pavement Age, MSN, and CDS were found to be important parameters in the Rut Depth model. After sensitivity analysis, the parameters were chosen and exclusively utilized for the creation of the pavement deterioration model.

5.2 Testing of Assumptions and Box-Cox Transformation

Certain assumptions must be made when using a t-test or an ANOVA. In other words, a statistical test cannot be employed indiscriminately; it must meet a specified set of criteria to be considered acceptable and useful. These conditions are known as model assumptions. The model assumptions for t-test or ANOVA include independence, normality, and homogeneity of variances. Running a test without evaluating its assumptions might result in significant (but incorrect) findings [58]. If just the normalcy assumption failed, the data should be checked first for anomalous observations that are driving the non-normality. Normality can be achieved by deleting or replacing discovered cases in the data or through a transformation such as the Box-Cox transformation.

The Box-Cox transformation is used to alter the distributional shape of a set of data so that analyses and confidence limits that require normality may be applied correctly. This approach may not be able to effectively normalize data with outliers [58]. For the Box-Cox transformation, a λ value of 1 is equivalent to using the original data. Therefore, if the confidence interval for the optimal λ includes 1, then no transformation is necessary. Appendix C also presents the statistical details of the normality test of variables, their probability plots, and the Box-Cox transformation plots of all variables which were used further for the development of deterioration models.

5.3 Model Development Using MATLAB

Pavement deterioration models may be developed using a variety of methods [59, 60]. Straight-line extrapolation, regression, mechanistic-empirical, polynomial restricted least square, S-shaped curve, probability distribution, and the Markovian technique are a few examples.

This study aims to create non-linear regression-based pavement deterioration models for a network of rural and urban roads in the study area's plain topography. Regression analysis is used to suggest deterioration models for each section, using pavement distresses as the ‘dependent’ variable and pavement age, traffic volume, CDS, MSN, and shoulder width as the ‘independent’ variables. The details of deterioration models (rural and urban) along with some statistical parameters such as R2, Adjusted R2, and RMSE values are presented in Table 8. The MATLAB codes for respective models along with their graphical output are provided in Appendix D.

Table 8 Model description with respective statistical parameters

Constraints of the Model These models will only apply to the Rajasthan State of India. It will only be applicable for bituminous (virgin and modified asphalt) pavements located in the plain terrain of the region. The ravelling phenomena on pavements were difficult to identify; thus the models presented should be utilized with discretion.

6 Validation of HDM-4 Calibration Factors and Regression-Based Models

Before using the calibrated HDM-4 pavement deterioration models and regression-based models, it is essential to validate them to ensure that the calibration and developed models are correct [61, 62]. On a selected urban and rural road network, the validity of these models was evaluated, and the competence of these models was assessed. The validity of the calibrated and regression-based pavement deterioration models is evaluated in this study by comparing observed vs. model-predicted distresses on 8 rural and urban road sections (different from model developments) in the study region. The details of selected 8 rural and urban road sections (2 rural and 2 urban sections in each of Ajmer and Kota) for model validation in various districts of the study region are given in Table 9, with locational data shown in Fig. 3. For simple identification in the road network, each of the selected pavement sections has been given a unique "Section ID" and a "Section Name" as shown in the table. “Validation Section (VS)-Type of section- Number issued to pavement section” is the series used to denote Section ID. The details regarding average annual daily traffic (motorized and non-motorized), subgrade CBR, pavement inventory and construction history details, etc. for rural and urban roads selected for validations are collected from field study as well as from the official records of the PWD divisions, in charge of the maintenance of these roads. Measurement of various pavement distresses like total cracking, raveling, pothole, edge break, and rut depth are carried out on each validation rural and urban road section in the years 2021 and 2022. HDM-4 analyses are carried out for each validation rural road section by using the suggested calibration factors as given in Table 7 (using Global Calibration Factors), to determine the predicted distresses corresponding to the suggested calibration factors.

Table 9 Details of Validation pavement sections
Fig. 3
figure 3

Map showing the location of study and validation sections

6.1 Validation of Calibrated HDM-4 Models (cellwise)

For validation of calibration factors calculated for individual cells, the time series pavement condition data for all selected 20 test sections of the rural and urban areas of Rajasthan were collected consecutively for the years 2017–2021. The time series condition data collected in these 5 years for all the test sections, classified into five homogeneous cells, were used for calibration of the deterioration models, and the data collected in the year 2022 were used for validating these models. For cellwise validation, the calibration factors reported in Table 6 were utilized to perform the HDM-4 project analysis and the deteriorations in the study sections were predicted for the year 2022. In the following sections, the validation details for the cracking, ravelling, potholes, and edge break model of a representative cell (Cell 2) are illustrated. Cell 2 consists of 6 pavement sections situated in the rural areas of the study region.

6.1.1 Validation of Cracking Model

The observed vs predicted values for Cell-2 (consisting of 6 sections) are given in Table 10. These values are plotted on a graph with respect to the Line of Equality as shown in Fig. 4 below.

Table 10 Validation of Calibrated HDM-4 models (cellwise)
Fig. 4
figure 4

Validation of Calibrated HDM-4 models (cellwise)

The negative values of residuals obtained for most of the sections indicate the model’s overprediction of cracking. As a result, the observed cracking percentage is lower than what calibrated HDM-4 models suggest. Only for pavement sections in Cell 5, do the positive residuals indicate the model’s underprediction of cracking %. This might be due to the movement of overloaded multi-axle trucks on these urban pavement sections (even though regulations against overloading are in place). Because their frequency was very low, these vehicles were not taken into consideration when calculating traffic volume. So, the urban road sections were not originally designed for these heavy loads and get cracked at a faster rate than predicted by HDM-4 models.

For each cell, the root mean square error (RMSE) between observed and predicted values was found to be less than 0.5, which is quite reasonable and indicates that the model predictions lie close to actual observations. From Fig. 4, it can be observed that the plotted values are quite close to the Line of Equality, indicating a good agreement between the predicted and observed values of the cracks. The slope value (1.04) as indicated in the figure too, being closer to 1, indicates that the predictions are reasonably accurate.

6.1.2 Validation of Ravelling model

The observed vs predicted values for Cell-2 are given in Table 10. These values are plotted on a graph with respect to the Line of Equality as shown in Fig. 4.

The positive values of residuals obtained for most of the sections indicate the model’s underprediction of ravelling. As a result, the observed ravelling percentage is higher than what calibrated HDM-4 models suggest. Only for pavement sections in Cell 3, do the negative residuals indicate the model’s overprediction of ravelling %.

For each cell, the root mean square error (RMSE) between observed and predicted values were found to be less than 0.5, which is quite reasonable and indicates that the model predictions lie close to actual observations. From Fig. 4, it can be observed that the plotted values are quite close to the Line of Equality, indicating a good agreement between the predicted and observed values of the ravelling. The slope value (0.96) as indicated in the figure too, being closer to 1, indicates that the predictions are reasonably accurate.

6.1.3 Validation of Pothole Model

The observed vs predicted values for Cell-2 are given in Table 10. These values are plotted on a graph with respect to the Line of Equality as shown in Fig. 4. The negative values of residuals obtained for almost all sections indicate the model’s overprediction of potholes. As a result, the observed no. of potholes/km is lower than what calibrated HDM-4 models suggest.

For each cell, the root mean square error (RMSE) between observed and predicted values were found to be less than 0.5, which is quite reasonable and indicates that the model predictions lie close to actual observations. From Fig. 4, it can be observed that the plotted values are quite close to the Line of Equality, indicating a good agreement between the predicted and observed values of the potholes. The slope value (0.99) as indicated in the figure too, being closer to 1, indicates that the predictions are reasonably accurate.

6.1.4 Validation of Edge Break Model

The observed vs predicted values for Cell-2 are given in Table 10. These values are plotted on a graph with respect to the Line of Equality as shown in Fig. 4. The positive values of residuals obtained for all sections indicate the model’s underprediction of edge break. As a result, the observed no. of edge break(m2/km) is higher than what calibrated HDM-4 models suggest.

For each cell, the root mean square error (RMSE) between observed and predicted values were found to be less than 0.5, which is quite reasonable and indicates that the model predictions lie close to actual observations. From Fig. 4, it can be observed that the plotted values are quite close to the Line of Equality, indicating a good agreement between the predicted and observed values of the potholes although the slope value of 0.8,8 as indicated in the figure too, indicates that the predictions are somewhat accurate.

6.2 Validation of Calibrated HDM-4 Models (using GCF rural)

For validation of HDM-4 global calibration factors (GCF) for rural regions, four different pavement sections (2 each in rural areas of Ajmer and Kota) were selected. HDM-4 project analysis was performed using the GCF, as indicated in Table 7 and the pavement deterioration on these sections in the years 2021 and 2022 was predicted.

In rural areas, the predicted values were found to be somewhat higher than the observed values for every deterioration model in HDM-4. The root mean square error (RMSE) between observed and predicted values for each distress was found to be larger than 0.5 (Range 0.52–0.91), indicating that model predictions are reasonably close to real data. The slope values on the scatter plots between observed and predicted values range from 0.85 to 1.09, showing that the predictions are relatively accurate.

The observed vs predicted values along with RMSE, for cracking, ravelling, potholes, and edge break on rural validation sections are shown in Table 11. Scatter plot details between observed vs predicted distress for validation of GCF (rural) are shown in Fig. 5.

Table 11 Validation of Calibrated HDM-4 models using GCF-rural
Fig. 5
figure 5

Validation of Calibrated HDM-4 models (using GCF rural)

6.3 Validation of Calibrated HDM-4 Models (using GCF urban)

In urban areas, the predicted values were found to be somewhat lower than the observed values for the year 2021 but this trend reverses in the year 2022 for every deterioration model in HDM-4, which indicates that the rate of distress progression in HDM-4, calibrated using GCF (urban), is higher than the actual observations. The root mean square error (RMSE) between observed and predicted values of cracking, raveling,and potholes indicates that model predictions are reasonably close to real data. The slope values on the scatter plots between observed and predicted values range from 0.89 to 1.17, showing that the predictions are relatively accurate.

The observed vs predicted values along with RMSE, for cracking, ravelling, and potholes, on urban validation sections are shown in Table 12. Scatter plot details between observed vs predicted distress for validation of GCF (urban) are shown in Fig. 6.

Table 12 Validation of Calibrated HDM-4 models using GCF-urban
Fig. 6
figure 6

Validation of Calibrated HDM-4 models (using GCF urban)

6.4 Validation of Regression-Based Deterioration Models

For validation of regression-based pavement deterioration models, 8 different pavement Sects. (4 each in rural and urban areas of Ajmer and Kota) were selected. These pavement sections were located in distinct parts of the study region with their details as presented in Table 9. Using the non-linear pavement deterioration models suggested in Table 8, deterioration in these sections in the years 2021 and 2022 was predicted. The predicted values were then compared to the observed deterioration in these pavements, with the findings reported as follows.

6.4.1 Validation of Regression-Based Deterioration Models (rural)

The root mean square error (RMSE) between observed and predicted values, as indicated in Table 13, was found to be in the range of 0.29–0.65 which is quite reasonable and indicates that the model predictions lie close to actual observations. From Fig. 7, it can be observed that the plotted values are quite close to the Line of Equality (LoE), indicating a reasonable agreement between the predicted and observed values of the distress in rural sections. The slope value ranging from 0.94 to 1.07, as indicated in the graphs too, being closer to 1, indicates that the predictions are quite accurate.

Table 13 Validation of Regression based deterioration models-rural
Fig. 7
figure 7

Validation of Regression based deterioration models-rural

6.4.2 Validation of Regression-Based Deterioration Models (urban)

The root mean square error (RMSE) between observed and predicted values, as indicated in Table 14, was found to be in the range of 0.14–0.30 which is quite reasonable and indicates that the model predictions lie close to actual observations. From Fig. 8, it can be observed that the plotted values are quite close to the Line of Equality (LoE), indicating a reasonable agreement between the predicted and observed values of the distress in urban sections. The slope value ranging from 0.83 to 1.01, as indicated in the graphs too, being closer to 1, indicates that the predictions are quite accurate, except for the pothole model (0.83). It can also be deduced from the statistical analysis that the predictions in the urban scenario have a better correlation with observed values than those in the rural regions.

Table 14 Validation of Regression based deterioration models-urban
Fig. 8
figure 8

Validation of Regression based deterioration models-urban

With negligible bias and prediction errors well within the statistical limits, regression-based pavement deterioration models are thus well validated on these observed vs predicted data.

7 Conclusions

The major conclusions of the current study can be summarized as follows:

  • The Level 2 calibration factors for deterioration models in HDM-4 for every road network were calculated and the values are presented in Table 6 (cellwise) and Table 7 (GCF). These calibration factors can be used for future estimation of deterioration and other analyses to be conducted through HDM-4. These models can also be used to forecast road conditions for bituminous road surfacing and to plan maintenance strategies in the future.

  • According to the global calibration factors (GCF) obtained, the variability between observed distress and calibrated HDM-4 values was found to be greater for rural pavement sections than for urban pavement sections, even though they follow the same pattern for distress initiation/progression.

  • The results obtained from sensitivity analysis of pavement deteriorations indicate that the variables/parameters that have the greatest influence on pavement performance are Pavement age, MSN, Shoulder Width, CDS, and Traffic Volume, in decreasing order of impact elasticity and Spearmans’ correlation rank. It's worth noting that the traffic volume variable in the study belongs to the sensitivity class "negligible," which means it's unlikely to have much of an impact on edge break and rutting progressions. Instead of traffic volume, the variable "shoulder width" is then introduced to the edge break model. Based on the result of the sensitivity analysis of distresses, the parameters to be used further in the pavement deterioration modelling were identified.

  • Based on the time-series data for 5 years, pavement deterioration models of most observed distress on these sections such as Cracking, Ravelling, Potholes, Rut depth, and Edge break were developed using MATLAB, and the model description is presented in Table 8 with MATLAB codes in Appendix D. A few specific observations related to pavement deterioration modelling are as follows:

    • The adjusted goodness of fit (R2 value) is above 0.75 for all the distress models developed, which indicates a good relationship between the independent and dependent variables adopted for the model development.

    • It was also observed that the percentage of distress on pavement sections having low CDS and MSN values was higher as compared to the contrary.

  • During the validation of calibrated HDM-4 distress models, the percentage variability found between observed and anticipated distresses was often relatively low. When compared to the global calibration factors (GCF), cellwise calibration of HDM-4 deterioration models yielded better predictions.

  • Furthermore, statistical inferences related to non-linear pavement deterioration model validation produced the greatest results in terms of output proximity to observed deterioration on validation sections (other than model development). As a result, the locally developed non-linear regression-based pavement deterioration models should be used to estimate the future deterioration of any pavement section in the study region.