Introduction

Time headway between successive vehicles is a critical microscopic traffic-flow parameter. It helps to perform a wide range of analyses for road traffic, including road safety studies, car following-lane changing behaviour, capacity and level-of-services [1]. Thus, researching the characteristics of vehicle headways and then developing appropriate distribution models have been an essential look-out to the traffic analysts while increasing roadway capacity and reducing vehicle delays. Besides, suitable headway distribution models must be applied in microscopic simulation models to generate possible vehicle events. While using the negative exponential function to describe the empirical distribution of headway data is quite common, several researchers reported alternatives that can clearly illustrate such distribution, especially when the prevalent traffic is mixed in character and exhibits frequent vehicle-following interactions.

Vehicle-following interaction is reasonably frequent on two-lane roads, which aggravates further if the prevailing traffic is mixed and composed of various vehicle types in terms of their static and dynamic characteristics. Since both directions of traffic use a single carriageway on such roads, a range of vehicles, including non-motorized ones, use the same road space and also, the flow of traffic in one direction affects flow in the other direction. Faster vehicles overtake slower ones using the adjacent opposing lane through the gap acceptance process. Hence, under such traffic operations, headway events range from shorter during platoon movements to longer when vehicles move freely, and their speed is not restricted. Headways are sometimes even close to zero values when drivers shift on the adjacent lane during overtaking operation [2]. Further, there are instances when a driver finds it tricky to evaluate the acceptable gap under such traffic and hesitates to perform overtaking operations even if the overtaking vehicle is on the adjacent lane. As a result, an oscillating movement between initial and adjacent lanes occurs, resulting in zero values of headways.

Statistically, the exponential model can describe the headway data if the co-efficient of variation is one [2, 3]. However, based on empirical investigations, many researchers reported that the heterogeneity effect of traffic stream results in a deviation of co-efficient of variation; this makes the exponential model inappropriate in describing headways. The hypo-exponential model is suitable when the coefficient of variation is less than one; whereas, a hyper-exponential model is considered statistically valid when it exceeds unity. Erlang distribution function exhibits similar applicability like hypo-exponential distribution in a stochastic process. Though a single distribution model is simple and easy to apply at a low flow level, the performance of such model forms is unsatisfactory, particularly at moderate and heavy flow because of their limited capabilities in approximating smaller headways [4]. The study describes the concepts of using shifted distributions while improving the accuracy of single distribution models when the coefficient of variation is less than one. The study also proposes the combined distribution function as an alternative of mixture distribution for describing headways, respectively, at moderate and heavy flow. Combined distribution contemplates modeling of shorter and longer headways separately based on a simplistic approach. For grouping the data into shorter and longer headways, the current study used partitional clustering and found K-means algorithm with Euclidean distance measure to exhibit aptness in this regard. The appropriate models for headway data were selected using a method based on the K-S test and the examination of probability plots.

In India, interurban highways are, in general, two/ multi-lanes. Moreover, most of the country’s national and state highways and a significant portion of major district roads are of two-lane status. Thus, there is a pressing need to develop capacity standards for such highways, especially when the prevailing traffic is mixed and has a wide variety of vehicles in its composition. Since vehicle-following behaviour varies considerably across vehicle types, most distribution models proposed so far have been reported to exhibit incompatibility in such a mixed-mode movement of traffic, especially on two-lane roads where interaction occurs in both directions. Therefore, the present study aimed to develop appropriate headway distribution models based on microscopic analysis of the time headway of vehicles considering such heterogeneity in the traffic mix. In doing so, it intended to investigate identifying following and free vehicles in the mixed traffic stream and, thereby, developing an appropriate approach to modelling time headways considering following and free vehicles. Eventually, the study developed a comprehensive understanding of distributions of headways which is fundamental in traffic flow study and its’ simulation issues.

Research Background

Many studies reported investigations on vehicle time headways and proposed several distribution models to describe headways. A variety of factors, including traffic flow [5], composition [2], and roadway condition affect distribution of headways and put restrictions on the use of those models. In an attempt to identify the boundaries in the traffic flows where the distribution of headways changes, a study found that at low flow, when interactions among the vehicles are insignificant negative exponential distribution fits the observed headways aptly. However, at medium and heavy flow with a frequent car-following interaction, Shifted Exponential/Gamma and Erlang distribution provided an excellent fit to the empirical distributions [3].

There have been several studies that reported a range of compatible distributions like generalized extreme value [6], hyperlang [7] and negative exponential [2] for urban roads. Interestingly a study found the distribution of headways for urban roads different even across lanes. It indicated Lognormal and Gamma distributions with a shift of 0.24 s and 0.69 s appropriate in passing and middle lanes, respectively [8]. Notably, the applications of Lognormal and Gamma distributions for headways in urban roads are widespread. For instance, modeling preferred time headway and drivers' time headway [9] and driver’s desired time gap and time headway in a steady car-following state [10].

A handful of studies indicated mixed distributions like gamma-generalized queuing model (gamma-GQM) and double displaced negative exponential distribution (DDNED) for modeling headways. Gamma-GQM exhibits aptness for a wide range of time headways [11], considering traffic flow and composition as endogenous and weather and geometric features as exogenous factors [12]. A study aimed at assessing the performance of various headway models using freeways data observed DDNED to provide a decent fit to data points. However, the study found shifted lognormal distribution to fit such data well for other types of roads [4]. Some recent studies reported Log-Pearson 3, Lognormal, and Log-logistic, Pearson 5 and Pearson 6 distributions suitable for a range of flow levels by examining headway data of two-lane roads with mixed traffic conditions [13, 14].

A spectrum of studies focused on analyzing vehicle-type-specific headways for a variety of roads and found that more often than not, headways are largest for truck-truck combinations and smallest in the case of car-car. Hence, the distribution of headways varies with the headway type. For instance, Erlang, lognormal, and inverse Gaussian models are appropriate for car-truck headways, composite and lognormal for car-car headways, shifted negative exponential, inverse Gaussian and lognormal models for truck-car and truck-truck headways [1, 15, 16]. Based on traffic data of two-lane highways, a study indicated Pearson-III to be suitable for a realistic replication of headways over a range of vehicle combinations and time of day [17]. However, empirical distributions of vehicle-type-specific headways for such highways vary considerably in the event of mixed traffic due to the risk-taking behaviour of the driver population. A couple of studies in this context highlighted this fact and proposed predictive models for understanding car-following behaviour [13, 18]. Another recent study in context to such traffic investigated the applicability of the copula approach for different vehicle-pair combinations and found Frank copula suitable for almost all leader-follower vehicle pairs [19].

While several studies have addressed the distribution of headways, a handful attempted to describe their characteristics under mixed traffic situations. Recent experiences with such traffic revealed that headways of lead-lag vehicles largely depend on the length of the lead vehicle [20], and the interactions cease beyond six seconds headway [21] at vehicle-following situations. Since the desired headway also varies across drivers [22], the current study aimed to systematically investigate time headways of vehicles under mixed traffic and propose a representative distribution of such headways.

Study Motivations

Mixed traffic with a range of vehicle types that include two/ three-wheelers and non-motorized modes like paddle tricycle, etc. is symptomatically different from the traffic that exhibits more or less homogeneous vehicle types. More often than not, mixed traffic, as considered in the current study, exhibits vehicles that are considerably different in their statics and dynamics. They do not follow lane discipline while sharing the same road space and occupy any available lateral position. At times, smaller vehicles occupy the gaps between larger vehicles in the traffic stream, resulting in a significant increase in shorter headways even at moderate and low flow. Thus, headway consists of ‘following’ and ‘free’ components, and a composite distribution can model them for such traffic. For years, several studies derived quite a few important composite models according to this concept; however, calibration and estimation of parameters were not easy for those distributions due to the inherent complexities in analyzing such traffic and the complicated structure of those model forms. Moreover, vehicles' arrival patterns change noticeably with traffic flow, resulting in different distributions to fit better at different flow levels. Thus, it is critical to discern the precise distribution of time headways for a road with a given flow condition.

Data and Methods

Field Study

Design of field study aimed at capturing time headways, covering a range of flow levels. Eight study sites (road segments) were selected for collecting traffic data on a stretch of a national highway (NH-8; two-lane highway) in India. The road stretch provides access to a city at its western end, and the study section extends about twenty kilometres towards the east of the city limit. Therefore, the paper describes traffic heading towards the urban area as westbound and opposing traffic as eastbound. Since the effect of intersections, curvatures, and ribbon developments were outside the scope of this paper; the current study made efforts to select road segments so that they are free from these effects. All the chosen segments have a reasonably long straight stretch with good and uniform pavement conditions (see Fig. 1). Field investigations also included a pilot study for validation and selected two highway segments (SH-13 and NH-11) that exhibit more or less similar roadway and traffic characteristics.

Fig. 1
figure 1

A view of the study section (image by author)

The current study used video cameras to capture traffic data to achieve adequate precision and accuracy in the dataset. The site layout plan for data collection includes a reference line across the pavement and two camera installation points on either side of it to capture both directional movements. The study was conducted during daylight hours and continued for several days covering peak and off-peak hours (each for about two hours) to capture an extensive range of traffic flow levels. For video data extraction, the study utilized computers with a large display screen. It noted the necessary readings, i.e., vehicle type, axle configurations and time when a vehicle crosses the reference line. Then the study computed the time headway of vehicles using the extracted traffic data. Since, it is necessary to determine the class interval for the observed data to obtain a smooth histogram, the study applied Struge’s rule [23, 24] for such assessment.

Figure 2a–b shows the composition of traffic consisting of two/ three-wheelers, bus, truck, car, and non-motorized ones, including paddle tri-cycle. A look into the figure reveals that the share of two-wheeler and car is significant, ranging from about 20–35 and 20–40 percent, respectively. Also, city-bound traffic consists of a substantial proportion of non-motorized vehicles; in a way that they share around 20 percent of total traffic. The study attempted to be acquainted with the reason behind such paradox through an informal opinion poll. Responses from about 50 road users indicate that lack of adequate public transport facilities compel commuters to use para-transit modes like paddle tri-cycle or personalized transportation like cars, motorized two-wheelers, etc.

Fig. 2
figure 2

Vehicle composition at the study sites: a westbound and b eastbound traffic [NMV non-motorized vehicle, Th-W three-wheeler, T-W two-wheeler, v/c ratio–volume to capacity ratio]

Further, traffic flow levels were grouped into a low, moderate and heavy flow based on traffic platooning and expressed in v/c ratio: derived from a capacity value of 2300 pc/h [25]. As moderate flow relates to stable flow, it would be rational to assume that the average number of headways inside and outside platoons under such flow is approximately equal. The current study found that the flow levels corresponding to a v/c ratio of 0.5–0.7 represent the moderate flow. Accordingly, flow levels at v/c ratio of 0.4 or less and 0.8 or more were, respectively, considered as low and heavy flow (see Table 1). Table 1 provides the sample size and also, details of headway frequencies at different flow levels.

Table 1 Details of traffic condition and observed headway data

Modeling Techniques

A suitable headway distribution model development calls for an approach to examine the statistical models with the observed headways (see Fig. 3). The use of negative exponential distribution to describe headways is quite widespread. However, several researchers have proposed using alternate models to illustrate the headway distribution pattern explicitly. Notably, these models are Erlang and Gamma distributions that provide a decent fit for an extensive range of flows and lognormal distribution, which has a theoretical connection to the car-following models [3]. The flexibility and compatibility of Gamma distribution make it suitable for extensive use in modelling headway data [4]. By the same token, Erlang distribution also has wide applicability owing to its relation to the exponential and Gamma distributions.

Fig. 3
figure 3

A schematic representation of the proposed framework for probabilistic modelling of time headway of vehicles with varying sizes and speeds on two-lane highways

The exponential distribution function is considered statistically valid if the headway data’s co-efficient variation equals one [2, 3]. The hypo-exponential model is considered suitable for distributions of such data with a coefficient of variation of less than one. A hyper-exponential model is statistically valid for those with a coefficient of variation of more than one [26]. Typically, Erlang distribution represents a hypo-exponential distribution function [26], and hyper-exponential distribution can be well represented by a mixed distribution function [27, 28]. As the study schemed to develop headway distribution models, data corresponding to an extensive range of traffic flow were fitted to different distributions that meet the required traits (see Fig. 4). The following sections provide a detailed insight into it.

Fig. 4
figure 4

A schematic representation of the combining headway distributions by adjusting the predicted probabilities at the estimated headway threshold ‘h’: a Positive adjustment and b Negative adjustment

An Application of Shifted Headway Distribution Model

Hypo-exponential distribution appears to be appropriate if the observed data's coefficient of variation is less than one. Erlang distribution is found to have similar applicability to hypo-exponential distribution in the stochastic process [26]. Applying single distribution models is straightforward; however, they have limited capabilities in approximating shorter headways. Therefore, the concept of shifted single distribution models is applied to improve the accuracy of single distribution models [4]. Equations 1 and 2 demonstrate the cumulative density functions of shifted negative exponential and shifted Erlang distributions, indicating a \(\uptau\) seconds shift to the right. Accordingly, investigations were made with these functions with a range of shifts (0 to 3 s), keeping a step of 0.015 s to determine a shift against which the function gives the best result.

Shifted negative exponential distribution function is given by,

$$F\left( h \right) = 1 - {\text{exp}}\left( { - \lambda \left( {h - \tau } \right)} \right)$$
(1)

Shifted Erlang distribution function is given by,

$$F\left( h \right) = \left. {\left( {\frac{{\Gamma \left( {h - \tau } \right)/_{\beta } \left( m \right)}}{{\Gamma \left( m \right)}}} \right.} \right)$$
(2)

where: \(\mathrm{h}\) = headways, λ = continuous inverse scale parameter (λ > 0), m = shape parameter (positive integer), β = continuous scale parameter (Erlang distribution) (β > 0).

An Approach of Combined Headway Distribution Model

Overtaking opportunities and proportion of vehicles spaced widely start declining on two-lane roads when flow rate upturns. The situation further aggravates in the event of heterogeneity and a sizable proportion of slower vehicles in the traffic mix, resulting in a significant amount of shorter headways, particularly at moderate and heavy flow. There have been several researchers who proposed the concept of mixed distribution models for describing headways in such situations: these are ‘double displaced negative exponential distribution’ [29], ‘generalized queuing model’[30], ‘Cowan M3 and M4 model’ [4]. Equations 35 demonstrate these mixed distribution functions:

Double Displaced Negative Exponential Distribution (DDNED) function is given by,

$$f\left( h \right) = \left\{ {\begin{array}{*{20}c} {\phi *\Upsilon _{1} *\exp \left( { - \Upsilon _{1} *\left( {h - d} \right)} \right) + \left( {1 - \Phi } \right)*\Upsilon _{2} *\exp \left( { - \Upsilon _{2} *\left( {h - d} \right)} \right)~} & {~~h \ge ~d} \\ 0 & {h < ~d} \\ \end{array} } \right.$$
(3)

Generalized Queuing Model function is given by,

$$f\left(h\right)=\theta *g\left(h\right)+\left(1-\theta \right)*\Upsilon*\mathrm{exp}\left(-\Upsilon*h\right)*{\int }_{0}^{h}g\left(x\right)*\mathrm{exp}\left(\Upsilon*x\right)dx$$
(4)

Cowan M3 distribution function is given by,

$$f\left(h\right)=\left(1-\Phi \right)\delta \left(h-\tau \right)+\Phi \lambda {e}^{-\lambda (h-\tau )}u(h-\tau )$$
(5)

where: f (h) is the probability density function; \(\upphi\) is a weighting factor constrained by \(0<\phi \le 1\); \({\Upsilon }_{1}\) and \({\Upsilon}_{2}\) are constants associated with the flow status, and d is a displaced parameter; \(\uptheta\) = proportion of vehicles tracking at the minimum headway; = \(\updelta (h-\tau )\)unit impulse (Dirac delta) function; x = proportion of the following headway; γ = exponential decay function.

The concept's premise is that headway comprises ‘following’ and ‘free’ components. Accordingly, mixed headway distribution models are supposed to capture headway dynamics better. However, calibration and parameter estimation of such distribution models are tricky because of their complicated structure [4]. This is particularly true for mixed traffic situations where prevailing vehicles exhibit a wide variety of static and dynamic characteristics.

Accordingly, the combined distribution function was considered an alternative of mixture distribution while describing moderate and heavy flow headways to overcome this difficulty. The simplistic analytical approach considers modelling shorter (lower fraction) and longer (upper fraction) headways separately using appropriate distribution functions. Figure 4 demonstrates the approach of combining headway distributions by adjusting the predicted probabilities at the estimated headway threshold ‘h’. An adjustment \({\mathrm{\Delta {\rm P}}}_{\mathrm{h}}\) was made on the proportional contributions of the lower and upper fractions, i.e., ΔΡ1 and ΔΡ2 to ensure that the total probability is one. Partitional clustering technique exhibits its aptness in identifying the headway threshold. Lognormal and Gamma distributions seem appropriate at car-following situations because of their flexibility and compatibility [4], and the negative exponential model describes the widely spaced data well. Therefore, Lognormal and Gamma distributions were applied on headway data for describing shorter headways, and they all were applied for longer headways. Thus, the combined distribution represents ‘following’ and ‘free’ components of headways separately based on the estimated headway threshold ‘h’ as displayed in Eq. 69.

Combined lognormal and Gamma distribution function is given by,

$$F(h) = \int_o^h {\Phi \left( {\frac{\ln h - \mu }{\sigma }} \right)} dh + \int_h^\alpha {\frac{\Gamma h/_\beta \left( \alpha \right)}{{\Gamma \left( \alpha \right) }}dh\Delta {\rm P}_h }$$
(6)

\({\Delta {\rm P}}_{h}\)= Difference in probability between lognormal and Gamma distribution.

Combined Gamma and lognormal distribution function is given by,

$$F\left( h \right) = \mathop \int \limits_0^h \frac{{\Gamma {\text{h}}/_\beta { }\left( \alpha \right)}}{{\Gamma \left( \alpha \right){ }}}dh + \mathop \int \limits_h^\infty \phi \left( {\frac{\ln h - \mu }{\sigma }} \right)dh \pm \Delta {\rm P}_h$$
(7)

\({\Delta {\rm P}}_{h}\)= Difference in probability between Gamma and lognormal distribution.

Combined Gamma and exponential distribution function is given by,

$$F\left( h \right) = \int_0^h {\frac{{\Gamma {\text{h}}/_\beta { }\left( \alpha \right)}}{{\Gamma \left( \alpha \right){ }}}dh\, + \,\int_h^\alpha {\left( {1 - exp\left( { - \lambda h} \right)} \right)dh \pm \Delta {\rm P}_h } }$$
(8)

\({\Delta {\rm P}}_{h}\)= Difference in probability between Gamma and exponential distribution.

Combined lognormal and exponential distribution function is given by,

$$F\left( h \right) = \int_0^h {\phi \left( {\frac{\ln h - \mu }{\sigma }} \right)dh + } \int_h^\alpha {\left( {1 - exp\left( { - \lambda h} \right)} \right)dh \pm \Delta {\rm P}_h }$$
(9)

\({\Delta {\rm P}}_{h}\)= Difference in probability between lognormal and exponential distribution. Where: \(\mu\) = continuous location parameter, \(\sigma\) = continuous scale parameter (Lognormal distribution), α = continuous shape parameter  > 0).

Evaluation of Goodness-of-Fit

Selection of appropriate models that exhibit compatibility to the observed data entails an appropriate method based on goodness-of-fit (GOF) tests. The use of Chi-square and Kolmogorov–Smirnov (K–S) tests for extracting GOF test statistics are quite common in traffic analysis. However, the K–S test provides specific benefits over the chi-square test, like (a) it can use data with a continuous distribution and (b) there is no minimum frequency per test interval [1]. Accordingly, the paper applied the K–S test on the way to identify compatible models. The GOF test statistic, ‘D’ is the discrepancy between the cumulative percentage of the measured and expected frequency: the largest discrepancy over the entire measurement indicates the value [21, 31]. Also, quantile–quantile plots help visualize the goodness-of-fit graphically by comparing their observed and estimated quantiles.

Analysis and Results

Descriptive Statistics of Headway Data

The possible capacity of the highway section was observed to be around 2300 pc/h [25]. Headway data were collected considering directional segments separately. A wide range of traffic volume (expressed in terms of pc/h; calculated based on passenger car equivalents as suggested by IRC 64: 1990 [32]) corresponding to volume to capacity ratio of 0.2–1.0 was covered. Table 2 displays the descriptive statistics of the headway data. A careful examination of Table 2 indicates that the median is less than the mean and the standard deviation decreases with the flow rate at all flow scopes. It signifies the concentration of shorter headways and is attributed to the high risk-ability of driver populations, which eventually lessens traffic safety. Besides, the mean and standard deviation should result in a 45° plot, i.e., coefficient of variation should be one in case of a negative exponential distribution. However, an empirical investigation indicates a deviation, exhibiting the negative exponential model's inappropriateness in describing headways. Moreover, the coefficient of variation was less than one upto a flow level that corresponds to a volume to capacity ratio of 0.3 for both the directions and exceeds the value at low to moderate, moderate and heavy flow (volume to the capacity ratio: 0.4–1.0).

Table 2 Descriptive statistics of headways at different flow levels

Headway Distribution at Low Flow Level

Erlang and negative exponential distributions with shifts ranging from 0 to 3 s (with a step of 0.015 s) were used in modeling headways at low flow. A comparison of the K–S test statistic obtained with 5 percent level-of-significance for the selected models, was made to examine the extent of fit to the observed data. The test statistic decreased with the increase in shifts, which started increasing beyond an optimal value. Accordingly, the trend discerns an optimal shift for the fitted models; Erlang distribution model with shifts of 0.675, 1.41 and 2.76 s and exponential distribution with 0.495 s shift were found to have acceptable statistical validity in terms of K–S test statistic. Table 3 provides the details of goodness-of-fit and estimated parameters.

Table 3 Goodness-of-fit Details and Estimated Parameters of the Fitted Distributions

Figure 5a displays the cumulative density function of the selected models. It indicates that the proportion of shorter headways at a volume to capacity ratio of 0.2 is less for city-bound (westbound) traffic, increasing with the flow. Eastbound traffic points to a diametric characteristic wherein shorter headways were significant at such flow level. This is because city-bound traffic was quite less during the morning, and it starts increasing as the day progresses. In contrast, trips to the city outskirts were sizable during the morning, and, thenceforth, it decreases. Figure 5b indicates that the data points are clustered in about a 45° plot signifying satisfactory agreement between the theoretical and observed data distribution.

Fig. 5
figure 5

Shifted headway distributions at low flow: a Cumulative density functions b Q–Q plot

Headway Distribution at Moderate and Heavy Flow

The proportion of shorter headways was observed to be considerably high at the moderate and high flow levels. It was, therefore, essential to describe shorter and longer headways separately. Accordingly, the field data, collected at different flow scopes, was grouped into clusters or subsets such that the data in each subset are similar to each other [33]. The data set similarity is defined by a distance measure; this plays a vital role in obtaining correct clusters [33]. However, the selection of distance measures depends on the data type. The present study’s observed data set is of interval type as it contains a range of continuous values [34]. Therefore, the Euclidean distance measure was adopted in the present study as it corresponds to the interval data type. The number of clusters was selected, given shorter and longer headways, and then the partitioning algorithm was used for identifying the boundaries. The mean of each cluster was used to determine its centroid. Accordingly, the k-means algorithm was applied, adopting Euclidean distance measure as an effective heuristic method of partitional clustering [35]. The algorithm appeared efficacious since it provides a faster computational platform and helps interpret findings through easy visualization of clusters. The study performed 100 iterations with a random set of initial clusters, calculating the distance between data points and the cluster centre while arriving at the optimized cluster. The limiting value of shorter headways was found to be 4 s; a similar value was also obtained by a study conducted in India [36].

The study considered lognormal and gamma distribution models to describe the shorter headways and lognormal, Gamma and negative exponential distribution models for the larger headways. Table 4 elucidates the goodness-of-fit details of the selected models and the calibrated model parameters. Since the limiting value of shorter headway was 4 s, shorter headways were modelled considering headway data up to 4 s, and the remaining headways were modelled as longer headways separately. The selection of best-fitted models used a method based on K–S test results (see Table 4).

Table 4 Goodness-of-fit details and estimated parameters of the selected distributions

Selected best-fitted distributions for shorter (following component) and longer (free component) headways were accordingly combined (see Eq. 69) and applied for describing the data set. Figure 6 illustrates the cumulative density function of combined models and indicates that the shorter headway’s probability is relatively higher than longer headways. This attributes to the fact that platoon formation is frequent and car following interaction is quite high at moderate and high flow. Figure 7 displays the quantile plots of the combined models and indicates that the data points are close to the straight line signifying satisfactory agreement between the theoretical model and the distribution of the observed data.

Fig. 6
figure 6

Cumulative distributions of combined models at different volume-to-capacity ratio: a 0.4; b 0.5; c 0.6; d 0.7; e 0.8; f 0.9; g 1.0

Fig. 7
figure 7

Quantile–Quantile plots of combined models at different volume-to-capacity ratio: a 0.4; b 0.5; c 0.6; d 0.7; e 0.8; f 0.9; g 1.0

Validation of the Proposed Model

The current study further attempted to test the validity of the model outcomes to ensure that they represent the existing traffic system with a good amount of accuracy. Two different highway sections that exhibit similar roadway and traffic characteristics were selected for the pilot study, and traffic data were collected, respectively, at the low and moderate flow levels. A comparison between the outcomes of the shifted Erlang/ shifted negative exponential models and empirical headways obtained at low flow indicated the variability of predictions. The study expressed them in terms of the standard error of the estimate (SEE) and found the values ranging between 0.04 and 0.06. (see Fig. 8a). The empirical probability of headways measured at moderate flow was compared with the outcomes of combined distribution models, and the values were found to be ranging between 0.02 and 0.06 (see Fig. 8b). These indicate a good agreement between expected and empirical probabilities, and the slight deviation is possibly due to variations in traffic composition and driver’s behaviour across study sites.

Fig. 8
figure 8

Agreement between the probabilities of headways obtained from the graph and pilot field studies: a Shifted and b Combined model

Discussions

The current study investigates headway distribution models when the prevailing traffic is mixed. Based on field measurements, the study attempted to characterize vehicle headways using distribution functions statistically. The investigations indicated shifted Erlang and shifted negative exponential distributions appropriate at low flow and ‘combined distribution’ at moderate and heavy traffic flow. Most drivers move with widely spaced headways at low flow when they have enough freedom to manoeuvre. Frequent interactions at moderate and heavy flow result in the formation of platoons. Vehicles entrapped inside platoons start moving in following with shorter headways and overtake through the gap acceptance process. Table 5 demonstrates a steady increase in ‘probability of headway less than ‘t’ sec’ at moderate flow (upto a v/c ratio of 0.4–0.6), which, however, starts decreasing at the higher flow level. A further investigation clarifies this fact and indicates that most faster vehicles initiate overtaking at a moderate flow level and move with shorter headways in such operations. However, limited overtaking opportunities at heavy flow restrict them from taking such initiatives and accordingly, they start moving in following keeping adequate spacing.

Table 5 Comparison of headway probabilities: combined (current study) and single [21] distribution models

The current study compared the outcomes of combined distribution models with those obtained from single distribution models [21]. It was found that a single distribution model results in a lower probability at moderate flow (v/c ratio: 0.4–0.6) than the combined one. It is, however, quite the opposite in the case of heavy flow when a single distribution model results in a higher probability (see Table 5). The current study, thus, makes it evident that a combined distribution model exhibits its aptness in approximating headway data at moderate and heavy flow under mixed traffic.

Conclusion, Limitation and Scope for Further Study

The paper presents an investigation on time headways of two-lane highways under mixed traffic situations. The study encompassed extensive traffic flow levels ranging from low to heavy for analyzing the driver's behaviour in choosing headways. Observations indicate that a part of the driver population follows impeding vehicles keeping unsafe headways at the car-following situation, i.e., less than 2 s [37]. Statistical parameters were computed, and the mean was observed to be larger than the median at all flow levels, signifying the concentration of shorter headways; this attributes to the high risk-taking behaviour of the driver population, eventually resulting in safety reduction [38]. Further, empirical investigation reveals that the heterogeneity effect results in a deviation of the coefficient of variation and makes the exponential model inappropriate for describing headways. The study found that the hypo-exponential model is suitable when the coefficient of variation is less than one. In contrast, the hyper-exponential model is statistically valid when it exceeds unity. The following sections provide detailed insights into the research findings indicating limitations and scope for future work.

Conclusion

The present study used shifted-Erlang/negative exponential distributions as hypo-exponential and combined distribution as hyper-exponential. It found Erlang distribution with shifts of 0.675, 1.41 and 2.76 s and exponential distribution with a 0.495-s shift to exhibit acceptable statistical validity. The combined distribution function appeared as a suitable alternative to mixture distribution while describing headway data at moderate and heavy flow. A simplistic approach modelled shorter and longer headways separately using (a) Lognormal and Gamma distribution models in describing the shorter headways and (b) Lognormal, Gamma and negative exponential models for the longer ones. Lognormal distribution can describe shorter and longer headways up to moderate flow levels. However, Gamma distribution often best fits the data obtained in high and congested flow due to its flexibility and compatibility.

Headway models, established for describing low-flow states, indicate an insignificant proportion of shorter headways. In contrast, combined models confirm that the probability of shorter headways at moderate and heavy flow is relatively high compared to longer ones. The study observed that frequent vehicle following interaction at such flow levels results in platooning and entrapment of faster vehicles inside those platoons, compelling them to move with shorter headways. Further, empirical examinations substantiate that arrival pattern changes considerably with the change in flow and traffic composition resulting in inter-arrival time or time headways following different distributions at different flow levels.

Future Scope and Limitations

The models have been developed based on field data collected from a straight stretch of roads that pass through plain terrain. The study did not experiment with the impacts of varying geometric conditions, such as vertical and horizontal curvature and pavement conditions. Further, the scope of the paper considers endogenous conditions, such as (a) traffic flow rate and composition, (b) effects of non-motorized vehicles and (c) lead-lag vehicle pair combinations. It does not explain the impacts of exogenous conditions like weather and night visibility on headways. Thus, the study demonstrates the need for introducing a robust time headway modelling technique based on comprehensive field data.