Introduction

Metropolitan planning organizations (MPOs) coordinate transportation investments from federal, state, and local sources to ensure that regional transportation plans meet performance criteria such as air quality and congestion management. One of the essential ways MPOs determine how to allocate funds is the forecasting of future travel demands. Forecasts are ordinarily made using what is known as the four-step travel demand model.

Some MPOs are beginning to abandon the traditional four-step travel model in favor of activity/tour-based travel modeling (ABT). As of 2015, in the US, ABT modeling was still in its formative stages and not standard practice (Travel Forecasting Resource 2015). Atlanta Regional Commission, San Diego Association of Government, and New York Metropolitan Transportation Council are some of the pioneering MPOs using this approach. Notwithstanding nearly 30 years of promotion of activity-based modeling (ABM) in the travel modeling literature, we believe that enhancements of the conventional four-step model are still relevant and desirable. As presented in this paper, our survey of MPOs, both big and small, shows that all still use the conventional four-step model, and 20 of 25 MPOs surveyed still use the gravity model for trip distribution. The conventional model and gravity model are still near-universal among small and medium-sized MPOs. As a representative of our local MPO said, when it comes to modeling, MPOs need to be “met where they are at.” Meeting MPOs at the current state of the practice and providing an incremental advancement to that practice are the goal of our suggested approach. Our method is meant to be simple and used in connection with the gravity model.

In the simplest terms, the four-step model proceeds from trip generation, to trip distribution, to mode choice, and finally to route assignment. Trip generation tells us the number of trips generated (produced or attracted) in each traffic analysis zone (TAZ). Trip distribution tells us where the trips go, matching trip productions to trip attractions by considering the spatial distribution of productions and attractions as well as the impedance (time or cost) of connections. Mode choice tells us which mode of travel is used for these trips, factoring trip tables to reflect the relative shares of different modes. Route assignment tells us what routes are taken, assigning trips to networks that are specific to each mode. The model’s behaviors are estimated based on travel patterns distilled from surveyed household trips. The model is calibrated and validated by comparing the predicted trips in the base year to actual travel survey data. The four-step modeling process is visualized below in Fig. 1.

Fig. 1
figure 1

Four step travel demand model (Adapted from McNally 2007)

A major weakness of conventional travel-demand models is that they tend to predict intrazonal trips with poor accuracy. Trips are classified as intrazonal if their origin and destination are contained within the same TAZ. Intrazonal trips are a minor consideration in the four-step travel demand modeling process, despite the fact that they typically amount to 10% or more of all trips in household travel surveys. They are treated like any other zonal interchange in the trip distribution step. Trip productions and attractions are modeled as occurring at a single point in the four-step model, the zone centroid, and the entire local street network on which intrazonal trips occur is reduced to one or more centroid connectors to the external street network. This means that intrazonal trips must be modeled differently than interzonal trips. To quote an unnamed reviewer of this paper, “The limitations of the gravity model are well known and it cannot be expected to deal with trips that travel what is really an unknown average distance. Practitioners have tried to overcome this limitation mostly with heuristic approaches to select an average travel distance for intrazonal trips. The main reason for this is that intrazonal trips are not particularly interesting in themselves but their number affects all the other interzonal trips estimated by destination choice models.”

This paper presents a new method for modeling intrazonal trips that addresses the major identified shortcomings of traditional approaches to intrazonal trip modeling in two ways. First, we employ a novel dataset with disaggregated travel survey data coupled with TAZ-specific built environmental measurements. This rich dataset allows us to account for differences in important built environment measures like activity density, street connectivity, and mixed land uses and how they impact intrazonal trip making. The second significant improvement over standard intrazonal modeling efforts is the use of discrete choice modeling. Where traditional methods employ the gravity model which merely measures the attraction potential of a destination less its impedance from an origin on a uniform, aggregated network, discrete choice modeling actually integrates elements of behavior and utility maximization. We use binomial logistic regression, which models the decision of whether to stay within the zone or to leave, as a discrete choice dependent on built environment characteristics within the traffic analysis zone. This method more accurately represents the behavioral aspects inherent in individual travel decision making.

Our paper proceeds as follows. First, we discuss the most common method in use for trip distribution within and across TAZs, namely the gravity model, and known limitations of the method. Then we present results from a survey of 25 MPOs of different sizes across the US, determining their method-in-use for distributing trips. Then we describe our new method, developed as a substitute and improvement upon the commonly used approach. Finally, we present results using our method, validate the models, and conclude its implementation.

Limitations of the gravity model

Various methods have been developed for forecasting intrazonal trips as a component of conventional four-step modeling. However, limitations of the methods raise concerns about the ability of conventional travel demand modeling to adequately account for intrazonal trips. This section considers some methods in common use and their limitations.

One of the most glaring issues with travel demand modeling and the gravity model is that it is done at a relatively aggregate level. Hamilton (1989) was one of the first to point out this issue, stating that as data become more aggregated, the model’s assumptions become more and more compromised. Varying sizes of TAZs could lead to differing likelihoods that trips will be intrazonal (Hamilton 1989; Moeckel and Donnelly 2015; Okrah 2016).

Cervero (2006) provides a critique of the conventional approach to four-step modeling that makes a similar point, while also emphasizing the importance of considering localized information on built environment characteristics. He asserts that in the conventional four-step process, “fine-grained land use mixes, local street connectivity, and pedestrian amenities, do not influence intrazonal trip estimates.” This is a general criticism of four-step models, but is particularly apropos to the modeling of intrazonal trips. The failure to consider local land use and street network patterns potentially leads to an underprediction of intrazonal trip rates in densely developed areas.

Research investigating intrazonal travel empirically in relation to characteristics of the local built environment is scant, but some findings are pertinent to this discussion. Modeling intrazonal travel in Gainesville, Florida, Ewing and Tilbury (2002) found that built environment variables (the D variables of development density, land use diversity, street network design, destination accessibility, and distance to transit) rival or sometimes exceed the explanatory power of the gravity formula used to estimate intrazonal trips in a conventional four-step model. This finding has two implications: first, that conventional models are ill-suited to predict intrazonal trips, and second, that sketch planning models that account for these other variables can correct the problem to a degree. One land-use variable, an entropy measure, appeared consistently significant in their models of intrazonal travel for different trip purposes. This variable, derived from Property Appraisers’ parcel-level data using GIS, captured the following mix of land uses: pedestrian-oriented retail uses; finance, insurance, and real estate offices; general office buildings; and commercial lodging. Also, highly significant in the authors’ models was the presence of a grocery store (for home-based shopping and non-home-based trips) and a public school (for home-based social-recreational and other trips).

Examining intrazonal trip characteristics, Greenwald (2006) found that mode choice for these trips is affected by urban form. The choice of mode, in turn, then affects trip distribution, as non-motorized trips are more likely to stay close to their origin. However, as Greenwald cautions, there is a threshold effect in the ability of the built environment to affect travel behavior; at some point, changes to the economic diversity of a TAZ start showing decreasing impacts on mode choice.

Although research is limited on intrazonal travel measured empirically in relation to D variables, there has been more work on methods for forecasting intrazonal travel as a component of the four-step model. The trip distribution step in the conventional four-step model relies on measuring trip impedance, essentially a measure of the time it will take to travel from a trip origin to a destination. The most common method for capturing impedance is to employ a gravity model, but the standard gravity model disregards local land use and street network patterns. Facile approaches to intrazonal trip distribution are common, including the use of uniform intrazonal trip rates derived from travel surveys as well as simple runs of a gravity model. In the latter case, impedances must be estimated based on intrazonal travel times. Impedances for intrazonal trips are technically zero in the four-step model, since both origins and destinations are located at the same point in space, the zone centroid (Horner and Murray 2002; Bhatta and Larsen 2011). Therefore, intrazonal travel times must be crudely approximated, usually by factoring the size of a TAZ or travel time to adjacent zones.

The traditional four-step model treats intrazonal trips exactly like all trips within the trip distribution step. The basic approach is to use a gravity model to determine the number and proportion of trips being made from a specific origin zone to a specific destination zone. The gravity model works under the assumption that the trips produced at an origin and attracted to a destination are directly proportional to the number of trip productions at the origin and the trip attractions at the destination, and inversely proportional to the travel time impedance between the original and destination. The standard form of the gravity model is depicted below:

$$T_{ij} = \frac{{A_{j} F_{ij} K_{ij} }}{{\sum\nolimits_{allzones} {A_{x} } F_{ij} K_{ix} }} \times P_{i}$$

where Tij is trips produced at i and attracted at j; Pi is total trip production at i; Aj is a total trip attraction at j; Fij is the travel impedance between i and j; Kij is the socioeconomic adjustment factor for interchange ij (Anas 1985).

A relatively large body of literature has been published on techniques for estimating intrazonal impedances in the gravity model, in other words for estimating the Fij values in the above formula. Early methods were based on assumptions that vastly simplified the problem, such as one advanced by Batty (1976). In this method, Batty assumed a constant population density over an evenly spread circular zone. His equation for estimating intrazonal travel cost was as follows:

$$C_{ii} = \frac{{r_{i} }}{\sqrt 2 }$$

where cii is travel cost and ri is the radius of the zone.

Venigalla et al. (1999) suggest a relatively simple method in which intrazonal trip impedance is calculated by merely dividing the trip length and time to the nearest zone centroid in half, sometimes referred to as the nearest neighbor approximation. Others have assumed that intrazonal travel time is two-thirds the time to the nearest neighboring zone, or equal to a set fraction of the average travel time to two or more adjacent zones.

These methods have obvious shortcomings, such as the necessity to make assumptions that zones are circular in shape and demonstrate homogeneous population densities. A marginal improvement to this method was made by Dowling (2005), who divided each zone into 13 concentric squares. The authors then determined mean distance by averaging the distances from the zone centroid to the perimeter of each of the squares. Finally, they used a table of speeds by area type and time of day to compute travel time from the intrazonal distances.

In some regions, the method of calculating intrazonal impedance is based on the zone’s total area as well as the average travel speed of the zone. This approach is one of the earliest to be developed (Lamb et al. 1970). The average intrazonal trip distance is approximated by one half of the square root of the zone’s area, and the conversion to time in minutes is made with the intrazonal speed in miles per hour and the constant 60 to convert hours into minutes (Martin and Mcguckin 1998).

$$Intrazonal\;time = \frac{{0.5\times\sqrt {\left( {{\text{zonal}}\;{\text{area}}} \right)} \times 60}}{{{\text{Intrazonal}}\;{\text{speed}}\;\left( {{\text{area}}\;{\text{type}}} \right)}}$$

Whatever approximation is used, the result flies in the face of findings from our empirical research. Using the gravity model, the larger the zone area is, the greater the impedance is and the smaller the proportion of intrazonal trips becomes. In fact, however, we determined empirically that all else being equal, larger zones capture a higher proportion of total trips generated within the zone. We discuss our research findings on this topic in more detail below.

State-of-the-practice in intrazonal travel modeling

To understand the gap between academic research and practical implementation, we conducted a survey of current intrazonal travel-modeling practices at 25 MPOs in the U.S. in May 2018. We selected MPOs with various population size: three MPOs with a service area population of less than 300,000, nine MPOs between 300,000 and 1 million, and 13 MPOs with more than 1 million population. We focused mostly on large regions because we assume that their MPOs are leaders in using new travel modeling techniques.

We examined the MPOs’ travel modeling documents if available and contacted MPO travel modelers to confirm their methods (Capital District Transportation Committee 2010; CMAP & CATS 2014; Community Planning Association of Southwest Idaho 2017; Fresno Council of Governments 2014; H-GAC 2014; Kimley-Horn and Associates, Inc. 2013, 2014; LSA Associates, Inc. 2007, 2011; METROPLAN Orlando 2016; National Capital Region Transportation Planning Board, Metropolitan Washington Council of Governments 2012; PBS&J 2010ab; RVAMPO 2011; StanCOG 2010; The Association of Monterey Bay Area Governments 2015; The Des Moines Area MPO 2006). The survey findings are presented in Table 1 with their population size, trip distribution model, and intrazonal trip forecast method. The results of our survey show that the four-step travel demand modeling process is still being widely used for regional travel modeling. All surveyed MPOs use the conventional four-step model.

Table 1 The summary of MPOs methods for calculating trip distribution and intrazonal trips (as of March 2018; sorted by population size)

The model that is used most commonly for estimating trip distribution is the gravity model. Out of 25, 20 MPOs use the gravity model for trip distribution—both intrazonal and interzonal. The next most widely used method is the destination choice model, a type of trip distribution or spatial interaction model, typically employing logit. The destination choice model can be thought of as a generalization of the gravity model. In the gravity model, most MPOs use nearest neighbor approximations for calculating the intrazonal travel time, while the number of adjacent zones included in the equation varies from one (the nearest zone; e.g., COMPASS, StanCOG) to four (e.g., ARTS, CHCNGTPO, Memphis, Brunswick).

Basically, the MPOs treat intrazonal trips just like interzonal trips, and the only zone-specific attributes accounted for are trip productions at the zone centroid, trip attractions at the zone centroid, and a crude estimate of intrazonal travel time to create separation between the two—except for CMAP which is not based on the travel time (see Table 1). It is worth mentioning that six of them (FresnoCOG, NCTCOG, SEMCOG, OKI, NJTPA and CMAP) are working on activity-based modeling which is the state-of-the-art in travel modeling. While some of them are almost done with this process, they have not completely switched to ABM yet, as of May 2018.

Our methodology

Data

For 31 regions in the U.S. (Table 2), household travel surveys were collected from MPOs. The surveys were conducted between 2006 and 2012. While conducted by individual regional organizations such as metropolitan planning organizations (MPOs) or State Departments of Transportation, the regional household travel surveys have quite similar structure and questions, akin to U.S. DOT’s National Household Travel Survey (NHTS). To gather comprehensive data on travel and transportation patterns, the survey data consistently includes, but is not limited to, household demographic information, vehicle information, and data about one-way trips taken during a designated 24-hour period on a weekday, including travel time, mode of transportation, and purpose of trip information. The survey data have exact XY coordinates so we could geocode the precise locations of households and the precise origins and destinations of trips. The regional survey data were acquired from individual MPOs or state DOTs with confidentiality agreements. The pooled data set consists of 843,287 trips produced by 89,768 households within 25,469 traffic analysis zones (TAZs) in 31 regions.

Table 2 Percentage of Intrazonal travel by trip purpose from travel surveys

The 843,287 trips were classified as either intrazonal (produced and attracted within the same TAZ) or interzonal trips (produced in one TAZ and attracted to another). On average, intrazonal trips account for 10.7% of total trips. This is a significant share of total trips. We computed intrazonal trip shares by trip purpose from the regional household travel surveys. The result is presented in Table 2. The shares vary from region to region. For example, intrazonal home-based work trips make up 2.9% of all home-based work trips on average, ranging from 1.3% in Eugene to 5.9% in Madison. Intrazonal home-based other trips (excluding work and and shopping-related ones) make up 14.4% of all home-based other trips on average, ranging from 7.4% in Eugene to 26.0% in Palm Beach. This large variance may reflect differences in zone size, land use and street network patterns, or even socio-demographics. The need to model intrazonal travel, in terms of these variables, is evident. In this paper, we show results from modeling intrazonal travel in relation to the D variables for the 31 regions, based on the regional household travel surveys.

Also, we collected land use data at the parcel level with detailed land use classifications, so we could study land use intensity and mix down to the parcel level for the same year as the household travel survey. We also gathered GIS data layers for streets, population and employment for TAZs, and travel times between zones by different modes, again for the same years as the household travel survey. Built environmental variables were computed for each TAZ and assigned to households within the TAZ.

Variables

In this study, the D variables of the built environment were measured and used to predict the intrazonal travel. The measurement of the D variables and their expected effect on travel behavior are summarized in Table 3. Some dimensions capture closely related qualities (e.g., diversity and destination accessibility). Still, it is a useful framework used to organize the empirical literature and provide order-of-magnitude insights (Ewing and Cervero 2010). The dependent and independent variables used in this study are defined in Table 4. Sample sizes and descriptive statistics are also provided.

Table 3 The D variables (Ewing et al. 2015)
Table 4 Descriptive statistics for our variables

For home-based trip (home-based-work, home-based-shopping, and home-based-other) models, the D variables of the TAZ where the home is located were used to characterize the built environment of the TAZ. For the non-home-based-work trip model, the D variables of the TAZ where the workplace is located were used to characterize the built environment of the TAZ. For the non-home-based-non-work trip model, the D variables of the TAZ where the trip origin is located were used to characterize the built environment of the TAZ.

Analysis methods

We treated intrazonal/interzonal travel as a binary choice, and hence modeled it with multilevel binomial logistic regression. We modeled intrazonal travel for the 31 regions. A binomial logistic regression predicts the probability that an observation falls into one of two categories of a dichotomous dependent variable (intrazonal or interzonal travel, in this case) based on multiple independent variables (in our case, the TAZ-level D variables).

A three-level model was required to represent the nested nature of the dataset, with multiple trips nested within TAZs and TAZs nested within regions. Multilevel modeling accounts for dependence among observations. All trips within a given TAZ share TAZ characteristics and all TAZs within a given region share regional characteristics. This dependence violates the independence assumption of standard regression. Standard errors of regression coefficients will consequently be underestimated. Moreover, coefficient estimates will be inefficient. Multilevel models overcome these limitations, producing more accurate coefficient and standard error estimates (Raudenbush and Bryk 2002). The three-level model used in this study partitions variance among the trip level (Level 1), the TAZ level (Level 2), and the regional level (Level 3) and uses level-specific variables to explain the variance at each level.

A multi-level model can be interpreted the same way as a single-level model; values of the independent variables are substituted for the variables in equations, multiplied by coefficients, and summed to get the log odds. Then, by exponentiating the log-odds, we can compute the odds of an intrazonal trip and the probability of an intrazonal trip, which is equal to [odds of intrazonal trips/(1 + odds of intrazonal trips)].

The final models were chosen based on three considerations—(1) whether the sign of a coefficient is expected or not (for example, total population of a TAZ is expected to have a positive relationship with the share of intrazonal trips. If not, we drop that variable), (2) statistical significance of the explanatory variable, and (3) the overall model fit based on the pseudo-R2.

Model validation

To test how well the intrazonal models are able to predict intrazonal travel, we evaluated the predictive performance of our five models—one for each trip purpose—by running k-fold cross-validation on our datasets (Fielding and Bell 1997; Hair et al. 1998). Using the same data to estimate parameters and to test predictive accuracy may overestimate model validity. In k-fold cross-validation, the data are divided into k equal partitions. One partition is withheld, and the model is fitted with the remaining data. As Borra and Di Ciaccio (2010) suggest, data were randomly divided into tenfolds: 90% of the data (training data) used for model fitting and 10% of the data withheld for model validation in each iteration.

The receiver operating characteristic (ROC) curves and the areas under ROC curves (AUC) are appropriate measures to evaluate prediction capability of logistic regression models (Greiner et al. 2000; Hanley and McNeil 1982; Meng 2014; Zweig and Campbell 1993). For the ROC curves, the rate of true-positives is plotted on the vertical axis and the rate of false-positives is plotted on the horizontal axis. Then the ROC statistics, AUC, provide the predictive accuracy of the logistic models, with values from 0.5 (no predictive power) to 1.0 (perfect prediction). In this study, the ROC curves were first used to visualize prediction capability of our models using only the left-out partition that was not used in model fitting. Predictive accuracy is then assessed by calculating the areas under ROC curves (AUC). This procedure is repeated for each of the k partitions, and the AUC values are averaged to obtain the mean AUC value.

In addition to the k-fold validation, we also validated our models against a conventional practice—the gravity model. How much more accurate is our model than the gravity model? Instead of modeling it, there are a few regions using a constant value, a region-wide proportion of intrazonal trips by trip purpose, to estimate intrazonal trip distribution. Is our model better than that simplest approach?

To prove the validity of our model, we compare our model with two other models—a gravity model and a constant model (using a region-wide average proportion of intrazonal trips by trip purpose) using data from two regional MPOs—Wasatch Front Regional Council (WFRC) and Mountainland Association of Governments (MAG). Two regions were selected because we could obtain intrazonal proportions by TAZ from their gravity models. Thus, our unit of analysis is the TAZ. The modeled values were compared against the actual proportion of intrazonal trips by trip purpose by TAZ from the 2012 Utah Household Travel Survey.

The problem with this approach is that many TAZs have no or only a few trips. This raises sampling error issues, meaning that the small number of trips in the survey cannot represent all trips occurring in that TAZ. For example, if a TAZ has only one trip (which is internal) from the survey, it gets 100% intrazonal trip probability. Thus, we tried different values in the minimum number of trips in a TAZ to minimize the sampling error and determined 20 as a threshold for model validation purposes.

Root mean square error (RMSE) is an appropriate measure of model prediction quality between two continuous variables (in this case, the proportion of intrazonal trips in the survey vs. a model). RMSE is a frequently used measure of the differences between values predicted by a model and the values actually observed. RMSE is a measure of accuracy, to compare forecasting errors of different models for a particular dataset. The smaller the RMSE, the more accurate the model (and the better the predictive power).

Results

Intrazonal trip share models

Tables 5, 6, 7, 8 and 9 show the results of multilevel binomial logistic regressions for intrazonal trips by trip purpose. The intercept in the tables is the constant of the models, which is the expected mean value of log-odds of Y (intrazonal trip share) when all independent variables are zero. The coefficients are log-odds of a trip being intrazonal not interzonal for a one-unit change in the specific independent variable. By exponentiating the log-odds, we can compute the odds of intrazonal trip and the probability of intrazonal trip, which is equal to [odds of intrazonal trips/(1 + odds of intrazonal trips)].

Table 5 Home-based-work models
Table 6 Home-based-shopping models
Table 7 Home-based-other models
Table 8 Non-home-based-work models
Table 9 Non-home-based-non-work models

Different D variables are shown to be significant predictors of intrazonal trips for different trip purposes. All relationships are as expected. To summarize, total employment (demographic variable) is positively associated with the share of intrazonal trips for all five trip purposes. Total population (demographic variable) is positively associated with the share of intrazonal trips for home-based-shopping, home-based-other, and non-home-based-non-work purposes. Area size has a positive association with the intrazonal trip likelihood for home-based-work, home-based-shopping, home-based-other, and non-home-based-non-work trips. Activity density is only included in non-home-based-work model. A land use diversity variable, job-population balance, is positively related to the share of intrazonal trips for all home-related trip purposes but home-based-work trips. Destination accessibility—the percentage of jobs available within 10-min, 20-min, or 30-min by car or 30-min by transit—is negatively associated with the share of intrazonal trips for all five trip purposes. This implies that the more jobs immediately outside of the given TAZ, the more likely a trip crosses the zone boundary for specific trip types. A measure of street network design—the percentage of four-way intersections—is positively associated with intrazonal trip likelihood only for home-based-shopping and non-home-based-work trips. Lastly, regional variables—total population, total employment, and population density—are not statistically significant in any models, and so were dropped.

Model validation results

After fitting the models with the full data, we assessed the predictive power of the five intrazonal models using tenfold cross-validation. Travel data were randomly split into ten equal-sized groups. The validation data set, 10% of the data, was used to validate the model which was fitted using the other 90% of the data through multilevel logistic regression.

As a result of the tenfold cross-validation, we obtained average AUCs by trip purpose. The average AUCs range from 0.671 for the non-home-based-non-work model to 0.887 for the home-based-work model (Fig. 2). The AUC provides the predictive accuracy of the logistic models, with values from 0.5 (no predictive power) to 1.0 (perfect prediction). Following Swets (1988) and Manel et al. (2001), models with an AUC value ranging between 0.7 and 0.9 are treated as ‘useful applications’ and those with values greater than 0.9 as being of ‘high accuracy.’ Thus, most models can be considered useful applications. The non-home-based-non-work is lower than the threshold of 0.7, implying a need for a different, more advanced modeling approach such as generalized additive model (Hastie and Tibshirani 1990).

Fig. 2
figure 2

Model validation (1): receiver operating characteristic (ROC) curves and the area under the ROC (AUC) statistics for measuring predictive power of the models

In addition to the k-fold validation, we validated our models against a conventional practice—the gravity model. We compare our model with two other models—a gravity model and a constant model (using a region-wide average proportion of intrazonal trips by trip purpose) using travel survey data from the 2012 Utah Household Travel Survey.

Table 10 shows that our model outperforms other models for all five trip purposes. The error rate of gravity model is significantly higher than that of our model (more than tenfold in most models), and even higher than the constant model using an identical region-wide value of intrazonal proportion for each trip purpose.

Table 10 Model validation (2): root mean square error (RMSE)

Conclusions

Conventional four-step models, used by virtually all metropolitan planning organizations (MPOs), state departments of transportation, and local transportation planning agencies to forecast future travel patterns, are the basis for long-range transportation planning in the United States. Trip distribution is one of the critical steps in travel demand forecasting. In the model structure, it consists of two categories—intrazonal trips and interzonal trips. As Bhatta and Larsen (2011) explained, intrazonal trips cannot be ignored, due to the impact they have on important aspects of transportation, such as congestion and pollution. For modeling intrazonal trips, there are two important components: (1) predicting whether a trip will be intrazonal and (2) determining the impedance of intrazonal trips. Little attention has been given to the former component, and in this study, we developed an approach to enhance the conventional gravity model for predicting intrazonal trips by including more built environment D variables and using a more robust modeling method.

In the first step, we surveyed 25 MPOs about how they model intrazonal travel. The finding shows the dominance of the gravity model with nearest neighbor assumptions, while a few regions are currently in the process of shifting to activity-based modeling. However, the current model involves validation errors, probably due to differences in zone size, land use, and street network patterns, none of which should be overlooked. The need to model intrazonal travel in terms of the built environment variables is evident. Thus, by using multilevel binomial logistic regression models and regional household travel survey data from 31 U.S. regions, we proved that different D variables are significant predictors of intrazonal trips for different trip purposes. Model validation results confirm that our models are useful for prediction purposes.

There is broad interest in the planning and policy communities in developing accurate tools to predict the consequences of land use and transportation strategies on travel demands. State, regional, and local organizations such as state departments of transportation and MPOs, public health organizations, transit agencies, and city and county planning commissions are also eager to have a reliable means of evaluating growth scenarios and planning alternatives. To this end, the results of this study could be used in travel demand modeling practice, especially in the hundreds of medium- and small-sized MPOs. It is worthwhile to note that two regional MPOs, Wasatch Front Regional Council (WFRC) and Mountainland Association of Governments (MAG), are incorporating our models into their four-step models in the transportation modeling software, Cube, to improve the accuracy of travel forecasts. Because we estimated models based on 31 region database, the models have external validity, and are generalizable for future changes on land use and transport toward more compact, mixed-use, and transit-supportive developments.

A limitation to this study is the fact that we are proposing a novel approach to the less than novel practice of four-step travel demand modeling. A travel modeler must understand the limitations of our modeling approach—advanced intrazonal trip models with built environment variables. For one thing, there can be a trade-off between a unified destination choice model and a gravity model separating intrazonal and interzonal trips. This separation of trips clearly affects the accuracy of a calibrated distribution model. However, we contend that intrazonal travel is qualitatively different than interzonal travel, even to a nearby zone, because intrazonal travel time is only crudely approximated by the gravity model. Furthermore, the decision to stay within a zone is highly affected by the built environment characteristics of the zone. Our use of D variables to model intrazonal trip shares is essentially a refinement of the intrazonal travel time estimate for vehicle trips and an add on for intrazonal travel by non-motorized modes. In a subsequent study, one could add D variables as zonal attributes in a more sophisticated utility function for a destination choice model and compare the results.

In addition, as we described in the introduction, the state-of-the-art is activity-based modeling (ABM). Many of the shortcomings of the trip-based approach to travel modeling such as the inability to consider the potential sequencing of trips, are rectified by the application of ABM. However, while ABM is the state-of-the-art in travel demand modeling, trip-based modeling is still the state-of-the-practice for small to medium-sized MPOs and many large MPOs surveyed. While we acknowledge that our small sample of large MPOs seems to have some bias toward those which are still using the four-step model, our survey indicates that some of the largest MPOs with the highest capacities continue to use the four-step model. An incremental improvement to the tool that is currently the most ubiquitous among travel modelers is a valuable contribution to practice.