Introduction

This paper assesses several methods for estimating the mean length of stay in prison. Techniques used to process data on prison populations are highly diffuse and their efficacy usually depends upon the validity of a particular set of assumptions. Most of the measures used assume the existence of a stationary population—a population in which the number of entrances equals the number of exits and duration-specific attrition rates are constant. Clearly, this assumption is not valid for the United States’ prison population, which has been growing rapidly since the 1970s (Ruth and Reitz 2003). Violation of the assumption has led to flawed and inconsistent information in the criminological literature concerning the length of stay in prison.

The inadequacy of many present measurement approaches has been recognized by others. Lynch (1993) employed three methods to estimate the mean length of time served in prison and demonstrated that the two indirect methods based on stationary assumptions give values that are biased relative to those yielded by a life table, which is a direct embodiment of exit rates. Biderman (unpublished) reviewed several studies that examine mean length of stay in prison between 1988 and 1994. His paper critiques the methods used and advocates the use of life table methods employed by demographers and actuaries. Lynch and Sabol (1997) compare life table results to those using the most common indirect measure, the mean duration of time served by those leaving prison in any particular year. They show that this latter method provides estimates of the expected length of time to be served for someone incarcerated for murder that are too low by at least 4 years.

This paper provides a systematic analysis of existing measures and develops two new methods for estimating mean length of stay in prison. By this phrase, we refer to the mean length of time an individual will remain in prison from time of entrance to time of exit. We believe that the paper will contribute to the criminological literature by helping to foster accuracy and uniformity in measurement and more precise estimators of the impact of various programs and policies. Improved measures should prove especially useful in the deterrence literature, especially specific deterrence, where length of time spent in prison is often a key indicator (Tittle 1969; Chiricos and Waldo 1970; Logan 1975; Beck and Hoffman 1976; Orsagh and Chen 1988; Gendreau et al. 1999; Kleck et al. 2005).

The remainder of the paper proceeds as follows. First, we describe the best method for estimating mean length of stay, the life table, and then describe three other estimation methods, each of which relies upon the assumption of a stationary population. We show what biases result in these measures when a population is not stationary and we develop a method for correcting the biases in two of the three cases. Secondly, we demonstrate the performance of the four existing methods and two improved methods in populations that are not stationary. One set of such populations are termed “stable” populations in demographic parlance. Next, we use population simulations, allowing us to impose shocks stemming from growth or decline in prison entrances and in changes in the length of stay inside prison. Both of these approaches permit us to assess the sensitivity of the estimates to violation of assumptions and to error in data. We conclude with recommendations about the measures to be used for future examination of the length of stay inside prison.

Estimating Length of Stay in Prison

Life Tables

Methods for studying survival processes have been developed by demographers, biostatisticians, and actuaries. The principal device used to characterize survival, attrition, and length of stay is a life table (Keyfitz 1968; Preston et al. 2001). A life table presents many functions that describe particular features of the dying out of a cohort. A period life table, which is the most common variety, exposes a hypothetical cohort to a set of attrition rates recorded for that period. A period attrition rate has the number of exits during the period in the numerator and the number of person-years exposed to the risk of attrition in the denominator. These attrition rates are arrayed by duration in the state; if there is no duration-dependence, there is no reason to construct a life table. The cohort enters a state at duration zero and is then exposed to the set of duration-specific attrition rates until all members have exited. The classic life table was developed with respect to actual length of life, so that duration in the state (in this case, being alive) was simply one’s age.

Table 1 presents a life table for all prison stays during 1997 in a group of 29 states.Footnote 1 Column 1 shows the duration of stay in prison at the beginning of a particular duration interval. The interval extends to the next value shown in the duration column. Column 2 presents the duration-specific attrition rates (i.e., exit rates) for various intervals.Footnote 2 Column 3 converts the attrition rates to probabilities of attrition during the interval for someone who stays in prison until the beginning of the interval.Footnote 3 Column 4 uses the values in column 3 to calculate the number of persons that remain in prison at the beginning of each duration interval. Column 5 presents the years expected to be spent by an arriving cohort in a particular duration category. Finally, column 6 shows the number of additional years expected to be spent in prison for someone who stays to a particular duration. Upon arrival, the expected length of stay based upon these attrition rates is 2.26 years. Table 2 presents the same information for the offense of murder, where the expected stay upon arrival is 20.01 years.

Table 1 Life table for all prison stays, selected states, 1997
Table 2 Life table for murder stays, selected states, 1997

Stationary Population Measures

The format and calculations presented in Tables 1 and 2 are the most satisfactory means available for measuring the mean duration of stay in prison (the analog to life expectancy at birth in a classic life table). However, they require data on attrition rates by duration of stay, data that are often not available, in addition to requiring meticulous calculations. In the absence of appropriate data, several other means have been used to estimate mean duration of stay. The most common is the computation of the mean length of stay by people exiting prison in any particular periodFootnote 4 (Biderman unpublished; Lynch and Sabol 1997; Lynch 1993). A second common method is to compute the ratio of the number of people in prison at any point in time to the annual number of entrances (Blumstein and Beck 1999; Lynch 1993). A third obvious method that has been suggested but not, to our knowledge, been employed is to compute the ratio of the number of people in prison at any point in time to the annual number of exits (Butts and Adams 2001). All of these surrogate measures rely, implicitly or explicitly, on the assumption that the population of prisoners is “stationary”.

A stationary population is created when the annual number of entrances to a state has been constant for a long period and the set of duration-specific attrition rates from that state have also been constant over time. When these conditions prevail in a prison population, the number of prisoners at a particular duration will be constant and equal to the annual number of annual entrants times the probability of surviving to that duration. Accordingly, the total size of the prison population will be constant; the growth rate will be zero, and the number of annual entrances will equal the number of annual exits.

In a stationary population, the expected length of time to be spent in prison for a new arrival will equal the size of the total prison population divided by the number of annual arrivals. Thus, the expected length of time to be served is the reciprocal of the entrance rate. Since the entrance rate is equal to the exit rate, it is also the reciprocal of the exit rate. The expected length of stay for a new arrival will also equal the mean duration of time spent in prison by those exiting in any particular year, since the duration distribution of exits is constant over time. Thus, the exit distribution in any particular period will replicate the distribution of exits for any entering cohort.

So, a stationary population provides three handy indirect measures of the expected length of stay in a state:

  • the ratio of the size of the population in the state at a moment in time to the annual number of entrances to that state (or, what is equivalent, the reciprocal of the entry rate);

  • the reciprocal of the exit rate;

  • the mean duration in the state of people exiting during any particular year or period.

While these measures are convenient, they can be misleading if the population under study is not stationary. For example, if the annual number of entrances has been growing and attrition constant, the prison population will be “younger” (in duration of time served) than the stationary population consistent with its attrition rates. The ratio of the number incarcerated for 2 years to the number incarcerated for 10 years will be greater than in a stationary population because it comprehends not only attrition between durations two and ten but also growth in the number of entrants during the period 2–10 years ago. Since the population is “younger” than the stationary population, the mean duration of time served by people exiting in any year will be younger than in the stationary population and will provide a downwardly biased estimate of the mean length of stay for a newly incarcerated person. Likewise, the entrance rate will be higher than in a stationary population having the same attrition rates. Accordingly, the reciprocal of the entrance rate will be too low as an estimator of expected length of time served.

In order to gain a general sense of the likely amount of error in such estimates, we take advantage of another demographic model, that of a stable population. The stable model, like the stationary model, also assumes that duration-specific attrition rates are constant. But, rather than assuming a constant annual number of entrances, it assumes that the annual number of entrances grows or declines exponentially. The stationary model is a special case of the stable model, one for which the growth rate in the annual number of entrances is zero.

If attrition rates are constant and the growth rate of entrances has been constant for an extended period, then the population will become “stable”: its entrance rate, exit rate, growth rate, and duration-composition will all be constant over time (Coale 1972; Preston et al. 2001). The basic equations representing the relations among variables are:

$$ b=\frac{1}{\int {\hbox{e}^{-ra}p(a)\hbox{d}a} } $$
(1)
$$ c(a)=b\hbox{e}^{-ra}p(a) $$
(2)

where c(a) = proportion of the population at duration a, p(a) = probability of surviving from entrance to duration a, b = “entrance rate” of the population; annual number of entrances divided by person-years lived in that year and r = growth rate of the population.

In order to evaluate how the estimates that assume stationary conditions fare when the population is growing or declining, we compute the value of the three indirect measures cited above in stable populations. We continue to assume that attrition rates are constant but we allow systematic growth or decline in the annual number of entrances and thus in the total size of the prison population. Our growth rates vary from  −0.05 to 0.2, which mirror the growth rates that occurred during the last two decades of the twentieth century. Using a spreadsheet published by the Bureau of Justice Statistics (Rice and Harrison 2000), we calculated that the annual growth rate of the prison population between 1980 and 1989 varied from  −0.02 to +0.19; between 1990 and 1998 the range was  −0.03 to +0.08.

Table 3 and Fig. 1 show the performance of the indirect measures for “all causes of incarceration” under various growth scenarios. Of course, when the growth rate is zero and the population is stationary, all of the indirect measures are completely accurate. As conditions diverge from stationarity, the measures become more and more biased. At an annual growth rate of .05, the entrance rate estimator is 25% too low; at a growth rate of  −0.05, it is 68% too high.

Table 3 Various estimators of the length of stay in prison in stable populations (in years), all offenses, e0 = 2.262
Fig. 1
figure 1

Various estimators of length of stay inside prison, all offenses

One important result is that the estimated mean length of stay using the reciprocal of the exit rate is always closer to the correct mean length of stay than the other two estimates. This result seems ironic in view of the fact that it is the only one of the indirect measures that has not been employed in the literature. The mean duration of time served by those exiting in any particular year, the most commonly employed measure, is by far the worst estimator in all circumstances. Thus, over a broad range of conditions, the reciprocal of the exit rate is the best estimator of the mean duration of stay in prison for a new entrant.Footnote 5 Below, we show that this result has broad applicability.

To test the performance of these measures under a quite different pattern of exit rates, we examine those imprisoned for murder, an offense with much longer sentences. Table 4 presents the indirect estimates of length of stay for those convicted of murder. In this case, the entrance rate estimator performs even more poorly than in Table 3. At a growth rate of .05, the entrance rate estimator is less than half of its true value. At a growth rate of  −.05, it is more than three times the correct value. The entrance rate performance is worse for murder than for “all causes” because there is a longer set of durations over which the growth-bias can manifest itself. The same increase in error is apparent for the mean duration of time served at exit. On the other hand, in the wide range of growth rates from  −.025 to +.05, the exit rate estimator is within 10% of the correct value. So when nothing else is known, the size of the prison population divided by the annual number of exits provides the best estimate of the mean length of stay in prison among the three methods (Fig. 2).

Table 4 Various estimators of the length of stay in prison in stable populations (in years), murder, e0 = 20.01
Fig. 2
figure 2

Various estimators of length of stay inside prison, murder

Improved Indirect Measures

Even without complete information on attrition rates, analysts can often do a better job of estimating the mean length of stay in prison than by using the indirect measures that we have described. Improvements are possible because, as shown in Tables 3 and 4, the biases in these measures are related to the growth rate of the prison population, a readily observable datum. In order to provide better indirect measures of mean length of stay, we offer two equations that are derived in Appendix B. One is based upon the observed entry rate (3) and the other upon the observed exit rate (4):

$$ \hbox{e}_0^0 \approx \frac{1}{b[1-rA_P ]} $$
(3)
$$ \hbox{e}_0^0 \approx \frac{1}{d[\hbox{e}^{-r(A_D -A_P )}]}, $$
(4)

where d is the exit rate of the stable population and b is the entrance rate of the stable population. There are two expressions rather than three because the indirect procedure involving the mean duration of time served at exit does not lend itself to a simple expression but rather involves variances whose values are infrequently observable.

The formulas involving corrections of the entrance rate and exit rate are straightforward. To estimate the entrance rate in a stationary population—the reciprocal of life expectancy at entrance—it is necessary to perform a growth correction on the observed entrance rate. This correction involves the rate of growth and the mean age of the population. The expression shows that the reciprocal of the entrance rate will underestimate life expectancy in a growing population, and overestimate it in a declining one. This pattern is clearly revealed in Tables 3 and 4. Likewise, to estimate the exit rate in a stationary population (also the reciprocal of life expectancy at entrance) it is necessary to perform a growth-correction on the observed exit rate. This correction involves the rate of growth and the difference between the mean age at exit and the mean age of the population.

These expressions are relevant to the common practice of assuming stationary conditions and using the reciprocal of either the entrance rate or exit rate to estimate years spent in prison. They indicate why the error made by using the entrance rate will nearly always be larger than the error made by using the exit rate, as shown in Tables 3 and 4. The reason is that the difference between the mean age and exit and the mean age of the population, A D  − A P , will almost never exceed the value of the mean age of the population itself. If exit rates are constant with age, then A D will equal A P and there will be no bias whatever in the exit rate estimator. At the other extreme, everyone exits at the same duration. In this case, A P is approximately equal to A D /2 and the biases in the exit rate estimator and entrance rate estimators will be equal to one another. In between these extremes, where most populations reside, A P will be greater than A D /2 and less than A D , so that the bias in the exit rate estimator will be smaller than that in the entrance rate estimator. This result is also relevant to the sensitivity of estimates to errors in the estimated value of r. Because the multiplier of r is almost always larger in the estimator using b than in the estimator using d, the estimator using d will be less sensitive to an error in r than will the estimator using b.

Tables 3 and 4 show that these new expressions perform much better than the uncorrected indirect estimators. In contrast to the uncorrected estimates, there is now little to choose between entrance-based and exit-based estimates. Both give estimates of mean length of stay that are well within 10% of its true value in a range of r from  −.05 to  + .05 for “all offenses”. For murder, the range of below-10% error extends from r =  −.025 to r = .05; both are also within 10% at a value of r as high as 20% per year.

Simulations

The preceding analysis assumed that the prison population was stable and compared two sets of estimators under these somewhat artificial conditions. We now relax the assumption of stability and permit random variability in

  • the time sequence of entrances;

  • the set of attrition rates;

  • both the time sequence of entrances and the set of attrition rates.

To introduce variability into the number of entrances, we assume that “entrances” are expected to grow at an exponential rate, but that the rate is subject to random variation:

$$ r(t)=r[1+v(t)], $$
(5)

where v(t) is a normally distributed random variable with mean of zero and standard deviation of .05. We conducted 100 simulations at each of the expected growth rates, using as expectations the same set of growth rates employed in the stable population analysis.

To introduce a stochastic element into the attrition rates, we assume that

$$ m(x,t)=m(x)[1+w(t)], $$
(6)

where w(t) is a normally distributed random variable with a mean of zero and a standard deviation of 0.10.

Table 5 shows how the “all offenses” estimators perform under these conditions.

Table 5 Various estimators of the mean length of stay (in years) in prison in destabilized populations, all offenses (100 simulations), e0 = 2.262

Table 6 presents the same information for murder. With one exception out of 24 comparisons, the uncorrected exit rate always outperforms the uncorrected entrance rate as a predictor of mean duration of stay. The mean duration of time served at exit is almost always the worst performer. With two exceptions out of 32 realizations, corrected entrance rates and exit rates give mean values of estimated mean stay that are within 10% of the actual values in a range of growth rates from  −.025 to .05. This accuracy applies regardless of whether the source of variation is entrances or exit rates.

Table 6 Various estimators of the mean length of stay (in years) in prison in destabilized populations, murder (100 simulations), e0 = 20.01

Although the mean of corrected entrance rate and exit rate estimates are satisfactory and similar to one another within this range of growth rates, the standard deviation of corrected exit rate estimates is smaller than that of corrected entrance rate estimates 75% of the time (12 of 16 comparisons). Thus, the estimate based on the corrected exit rate is less likely to be seriously wrong. This result holds despite the fact that we assumed twice as high a standard deviation in the annual exit series as in the annual entrance series. The estimates based on the corrected entrance rate are particularly erratic (i.e., have a high standard deviation) when the growth rate in entrances is at least 10%/year and variability is introduced into the entrance sequence. When the growth rate is highly negative (r  =   −.05), all “corrected” estimators work well for the short-lived process and poorly for murder.

Finally, we combine variability in the entrance sequence with variability in exit rates, using the same parameters previously described. The results in Table 7 indicate that uncorrected estimates based on exits always have a lower standard deviation than those based on entrances. And with one exception, uncorrected estimates based on exits have a mean value closer to the true mean than uncorrected estimates based on entrances (ignoring cases where r = 0, where the estimates are essentially indistinguishable). The mean duration at exit is consistently the worst performer. The corrected exit rate estimator has a lower standard deviation than the corrected entrance rate estimator in all but two circumstances. When mean values for the two estimators differ by more than .03, the mean value of the exit rate estimator is closer to the true value in 12 cases out of 13.

Table 7 Various estimators of the mean length of stay (in years) in prison in destabilized populations, stochastic births and stochastic survival rates (100 simulations)

Thus, the simulations show the same pattern of results as the stable population analysis. When no growth-correction is possible, the best estimator is the reciprocal of the exit rate and the worst estimator is the mean length of time served by those exiting in a particular year. When a correction for growth can be made, exit rate and entrance rate estimators are more similar in performance but the exit rate estimator has a lower standard deviation, is less sensitive to errors in data, and is usually closer to the correct value.

Conclusion

This paper compares various estimates of the mean length of stay in prison for someone newly admitted. One estimate is provided by the life table, which yields precise information about survival when duration-specific attrition rates are available. We also considered three indirect indicators—the mean number of years served at exit for people exiting in a particular year, the ratio of the total population to the number of entrances, and the ratio of the total population to the number of exits. These measures are accurate when the population is stationary, i.e., constant in size with constant attrition and constant annual numbers of entrances and exits. Unfortunately, the prison population is not stationary, and we demonstrate the biases that result in these measures from implementing the stationary assumption in a non-stationary population. We also develop two new formulas that provide a growth-adjustment to the reciprocal of the exit or entrance rate so that they give values that are much closer to the true value contained in the life table.

Our findings indicate a hierarchy in the success of the estimates. Estimates based on the reciprocal of the attrition rate perform best both when a growth-correction can be made and when it cannot. When no growth-correction can be made, it is the only one of the three indirect measures that provides satisfactory results over a substantial range of growth rates. When a growth correction can be made, estimates using the exit rate are generally closer to the mark than those based on the entrance rate, and they show substantially less variability in most circumstances. We recommend the use of this measure, which to date has not been exploited in the criminological literature.

Uncorrected entrance rate-based estimates are subject to considerable error even when prison populations exhibit only mild deviations from stationary conditions. However, the corrected measure of the entrance rate works well in stable populations and in population with simulated variability in entrances and in exit rates. Nevertheless, in comparison to corrected exit-based estimates, those based on the corrected entrance rate are more sensitive to errors in the growth rate and have higher standard deviations.

The measure that performed the worst in all applications was the mean duration of time served among those exiting in a particular year, which is also the most commonly used measure in the literature.

Given our findings, every effort should be made to improve estimates of length of stay. Existing measures are often quite misleading and can produce bad policies or flawed policy evaluations. For example, in 1999 a New York Times article summarizing a report from the Bureau of Justice Statistics stated that: “the average time served in state prison by violent criminals rose to 49 months in 1997 from 43 months in 1993” (Butterfield 1999). The article went on to conclude that legislation imposing longer sentences was working. However, the observation could also be a reflection of the observed decline in the growth rate of the prison population between these years, a factor unrelated to sentencing practices. Without a proper analysis, no policy conclusion is warranted. Because these policies affect both liberty and safety, we cannot afford to be cavalier in our calculations.