Introduction

Evaluations of place-based criminal justice interventions have been limited in their ability to assess program outcomes in both space and time simultaneously. Prior scholarship in this area tends to consider only one of those elements at a time. For example, Koper (1995) employs event history analysis to examine whether police presence at crime hot spots creates residual deterrence. While useful to identify the optimal duration of time an officer should spend at a hot spot to achieve maximum deterrence (the Koper curve), this aspatial analysis does not provide information on how far deterrent effects extend over a given distance or whether crime in this particular instance is spatially displaced. Alternatively, a study may examine program effects over space and not time. For instance, Weisburd et al. (2006) examines whether concentrating policing at crime hot spots leads to spatial displacement of crime to nearby areas (a two-block radius surrounding the target area) and concludes that intensive police presence does not displace crime outside of the targeted areas. However, accommodating the temporal element of their data would yield a more approximate estimation of true treatment effects since program effects are averaged across the entire study.

The ability of analyses to detect space–time interactions is an important aspect of program evaluation. A space–time interaction implies “more than just spatial clustering (same places experience more burglaries than other regardless of time) or just temporal clustering (the overall risk of burglary varies by season)” (Fotheringham et al. 2000, p. 245), and arises when the location and time of occurrence for an event is dependent upon the location and time of occurrence of a separate event. An example of a space–time interaction is when motor vehicle theft may become more frequent in coastal communities during the summer when there is an increase in the number of vacationers (Levine 2004). In recent years, there have been developments in modeling to examine the problem of space–time interactions in the field of epidemiology. Even though the underlying areas of study differ greatly, methodological advancements in epidemiology can be used to inform and advance the field of criminology.

The aim of this study is to demonstrate the use a bivariate spatiotemporal K-function, which is an innovative method for identifying space–time interactions (Diggle et al. 1995). So far, this statistical technique has been neglected in the field of criminology, but it is increasingly used by epidemiologists to examine the interaction between two point patterns in space and time. To illustrate this approach, the present study employs the method to assess the influence of police stop-question-frisk practices (SQFs) on crime at a microgeographic level. Specifically, this study uses crime incident data to examine the effect of SQFs conducted by the New York City Police Department (NYPD) in a selected subset of cases in the Bronx, NY over a 150-day period in 2006. This research focuses on how the statistical technique can be a useful tool to examine place-based criminal justice interventions and discuss its advantages over the Knox test (a more commonly used space–time approach), as well as traditional, non-spatial evaluation methods.Footnote 1

Background

Description of Analytical Technique

A number of measures to quantify space–time clustering have been developed in geographical epidemiology as a means of identifying outbreaks via the spatial distribution of infectious disease in a given timeframe. One of the simplest and most common techniques is the Knox test, which was the first quantitative test of space–time clustering (Knox 1963). This technique was applied by Knox (1964) to identify unusual clustering of childhood leukemia in Northeast, England. Knox (1964) deemed leukemia cases as adjacent in time if two events occurred in less than 60 days and adjacent in space if they occurred within a distance of 1 km. A 2 × 2 contingency table may be plotted with this information, where X denotes the number of times leukemia cases occurred close in both time and space, N1S denotes the number of cases adjacent in space, and N1T denotes the number of cases adjacent in time (see Table 1). N represents the total number of, n, cases that can be joined in all possible n(n − 1)/2 points pairs. From this process, Knox (1964) concluded that the number of leukemia cases close in time and space exceeded what should be expected based on chance, indicating a disease transmission.

Table 1 Knox 2 × 2 contingency table

The original Knox test only allows point pairings to be placed into two space and two time categories, but an adapted Knox test has been developed to accommodate a range of spatial and temporal scales (Johnson et al. 2007a; Townsley et al. 2003; Wells et al. 2012; Wyant et al. 2012; Zhang et al. 2015). Since the Knox test includes the corresponding space–time categories for each point pairings, the Mantel (1967) test has been employed to test for statistical significance because it accommodates the dependence in observations (David and Barton 1966). The Mantel test is given by the following:

$$\mathop \sum \limits_{i} \mathop \sum \limits_{j \ne i} s_{ij} t_{ij}$$

where s ij is the distance in space and t ij is the distance in time between i and j events. A z test is then employed to determine if the observed values are significantly different from expected, and a separate z score may be computed for each cell (Johnson and Bowers 2004a, b). The Mantel test is equivalent to the Knox test when s ij and t ij take the value 1 when the points i and j are close in space and time. While the Knox test is a useful and powerful tool to study space–time relationships, the spatiotemporal K-function overcomes several of its shortcomings. In particular, (1) it is less computationally intensive, (2) does not have arbitrary cut offs for selecting space and time thresholds, (3) corrects for edge effects, and (4) is more robust under a number of conditions. A description of the spatiotemporal K-function, along with how it differs from the Knox test, is discussed below.

The spatiotemporal K-function offers a similar, but more flexible approach to studying space–time clustering. Since the technique computes estimates over a range of pre-defined spatial and temporal buffers simultaneously, it overcomes the problem of the Knox test that arises due to the selection of arbitrary cut offs (McNally et al. 2012; Picado et al. 2007). While the adapted Knox test improves upon this limitation as well, estimates for each scale must be computed separately through multiple tests (Ye et al. 2015). This is in contrast with the spatiotemporal K-function, which is less computationally intensive since it treats space and time continuously in a single test. As Diggle et al. (1995) describe the computation of the spatiotemporal K-function:

[T]he estimate of the second-order properties of the process is closely related to Knox’s statistic X, but interpreted as a function of spatial and temporal separation rather than for a fixed pair of threshold values [for space] and [time]. This interpretation provides a natural way of combining information from different spatial and temporal scales, whether for estimation purposes or to construct an omnibus test of space–time clustering. (p. 125)

It expands upon Ripley’s K-function, a technique that is traditionally employed to examine the degree of clustering or dispersion among a univariate, purely spatial point pattern (Ripley 1976).

The estimation of the bivariate spatiotemporal K-function involves three components:

  • K(s) = \(\frac{1}{\lambda }\) E[number of type j events within distance, s, of an arbitrary type i event]

  • K(t) = \(\frac{1}{\lambda }\) E[number of type j events within time, t, of an arbitrary type i event]

  • K(s, t) = \(\frac{1}{\lambda }\) E[number of type j events within distance, s, and time, t, of an arbitrary type i event]

where λ is the expected number of events per unit space per unit time, s is the user-specified distance from point i event to point j event, and t is the duration of time after event i. The number of events (N) and the area (A) may be used under the assumptions of homogeneity to estimate the intensity of the point pattern λ = N/A.

Figure 1 presents elements of K(s). The estimation of K(s) for the example of SQFs described below would be calculated only for SQFs falling within the bounding window around the area of interest, A. The spatial clustering of crimes (indicated by points j in Fig. 1) around SQFs (indicated by points i) is then examined within a given radial distance, s, of where the SQF occurred. The estimation of K(s) is only computed using information of where these events occurred irrespective of when they occurred. The elements in Fig. 1 are similar to K(s, t), but time is incorporated by having points i and j that are used in the estimation drawn from different time periods with a given lag time, t. For instance, a 2-day time lag would mean that only crime events occurring 2 days after an SQF occurrence and within the given radius would be used in the computation.

Fig. 1
figure 1

Elements of Ripley’s K-function estimation

\(\hat{K}(s)\), \(\hat{K}(t)\), and \(\hat{K}(s,t)\) are given by

$$\hat{K}(s) = \left| { A } \right|\{ n\left( {n - 1} \right)\}^{ - 1} \mathop \sum \limits_{j \ne i} w_{ij} I\left( {d_{ij} \le s} \right),$$
$$\hat{K}(t) = T\{ n\left( {n - 1} \right)\}^{ - 1} \mathop \sum \limits_{j \ne i} v_{ij} I\left( {u_{ij} \le t} \right),$$

and

$$\hat{K}(s,t) = \left| { A } \right|T\{ n\left( {n - 1} \right)\}^{ - 1} \mathop \sum \limits_{j \ne i} w_{ij} v_{ij} I\left( {d_{ij} \le s} \right)I\left( {u_{ij} \le t} \right).$$

If these point patterns are independent, \(\hat{K}(s,t)\) = \(\hat{K} (s)\hat{K} (t)\). Spatial and temporal edge effects are calculated separately. The weights, \(w_{ij}\), represent a correction for edge effects in estimating second-order properties, and the weights, \(v_{ij}\), serve the same role for the estimation of temporal properties (Lotwick and Silverman 1982; Ripley 1982).

As noted, a benefit of the spatiotemporal K-function over the Knox and Mantel tests is its ability to account for edge effects, which can affect statistical inference (Lynch 2006; Picado et al. 2007). Censoring bias (due to the inability of observing the full extent of an object that lies partially within the sampling window) and sampling bias (due to the likelihood of observing an object being dependent upon its size or shape) are two of these validity threats (Baddeley 1998). There are a number of edge corrections for estimating K. The isotropic correction originally proposed by Ripley is perhaps the most common, where the weight function, \(w_{ij}\), provides the edge correction. If the search circle with radius s falls completely within bounding window A, then \(w_{ij}\) has a value of 1. If part of the search circle falls outside the bounding window, then \(w_{ij}\) is proportional to the circumference of the part of the circle that falls within the bounding window (Lee et al. 2013). As with the univariate K-function, this correction can be employed to obtain K-estimates that are unbiased by edge effects (Pélissier and Goreaud 2015).Footnote 2 Due to the border correction, \(\hat{K}_{ij} (s,t)\) and \(\hat{K}_{ji} (s,t)\) are highly correlated but not identical (Arbia et al. 2008).

Instead of analyzing cross-correlations, the spatial and temporal dependence between the point patterns is assessed using the spatiotemporal K-function with the derived function:

$$\hat{D}_{0} (s,t) = \frac{{\hat{K}(s,t) - \hat{K}(s)\hat{K}(t)}}{{\hat{K}(s)\hat{K}(t)}}.$$

\(\hat{D}_{0} (s,t)\) represents the portion of the space–time correlation that remains after correction for purely spatial and purely temporal correlations, and it is a global measure that defines the scale and nature of the dependence between type j and i events. A \(\hat{D}_{0} (s,t)\) value of 1 is considered a strong space–time interaction or a doubling of the risk of a point j type event following a type i event (Diggle, personal communication, February 11, 2014; Lynch and Moorcroft 2008; Poljak et al. 2010; Sanchez et al. 2005).

The results between the spatiotemporal K-function and adapted Knox test may be similar (Ye et al. 2015), but are different under certain conditions. The Knox and Mantel tests tend to be more similar to the K-function when the possibility for edge effects is deemed low (such as with short distances, d) (Townsley et al. 2003) or when adequate statistical controls can be employed to overcome boundary problems (Johnson and Bowers 2004a). A study that compared the results of the spatiotemporal K-function and adapted Knox found that larger differences between the two techniques are expected when “the space and time spans go beyond half size of the study area and half of the study time period […] in order to generate large enough distortion for different boundary definitions to make a difference” (Ye et al. 2015, p. 213).

Dissimilar results may also arise under circumstances where the spatiotemporal K-function is more robust. First, the adapted Knox test is sensitive to seasonality, and requires the user to manually control for effects of weather (Johnson et al. 2007b). In contrast, \(\hat{D}_{0} (s,t)\) automatically eliminates correlations between the point patterns attributed to both temporal (e.g. weather) and spatial (e.g., elevation) covariates (Lynch and Moorcroft 2008). Second, Knox tests that use the Chi squared (Knox 1963) or Poisson (Knox 1964) tests to identify space–time interactions are at risk of overestimating statistical significance; the interaction between space and time are compounded with these tests as sample size increases (Levine 2004). The consequences of violating the assumption of independence exacerbates this issue because the unit of analysis in the Knox test is the point pair (not a single event), with each point contributing to n − 1 of the pairs considered (Grubesic and Mack 2008; Johnson et al. 2007a, b). Third, unlike the Knox and Mantel tests (Malizia 2012), another benefit of the spatiotemporal K-function is that it “is likely to be robust [to missing data] unless the underreporting [of said point patterns] is severe and confined to specific spatial locations and times” (French et al. 2006, p. 3).

Application of Space–Time Clustering Analyses

While limited, the use of an adapted Knox test to diagnose space–time clustering in criminology is increasing. It is most frequently implemented to test the near-repeat victimization hypothesis (Johnson and Bowers 2004a, b; Townsley et al. 2003; Zhang et al. 2015), particularly with residential burglary risk. Johnson and Bowers (2004a), for example, show that repeat burglaries tend to cluster within 2 months and 300–400 m of a given burglary. Grubesic and Mack (2008) compare the degree of spatial–temporal clustering among different crime types. They find strong spatial–temporal hot spots of burglaries (<650 m within 14 days), assaults (<161 m within 9 days), and robberies (>600 and <2500 m within 1 and 8 thru 14 days) within time and space. Wells et al. (2012) find that near-repeat gun assaults cluster in small spatial distances within a 28-day period. Lastly, Wyant et al. (2012) use the adapted Knox test to demonstrate police firearm suppression effects up to two blocks away from a firearm arrest (<6 days).

Applications of the spatiotemporal K-function are most commonly found in geographic epidemiology to study the transmission of disease across space and time. For example, French et al. (2005) use the approach to determine if spatial and temporal patterns interact to produce space–time clustering of equine grass sickness cases, whereas McNally et al. (2009) test the hypothesis that environmental factors contribute to primary biliary cirrhosis disease cases. Lynch and Moorcroft (2008) employ the space–time K-function to determine if western spruce budworm infestations affect the likelihood of a forest fire. Other studies apply the technique to explore the spread of foot-and-mouth disease (Picado et al. 2007; Wilesmith et al. 2003), outbreaks of bursal disease in broiler chicken farms (Sanchez et al. 2005), patterns of occurrence that would suggest infectious origin of type 1 diabetes (Zhao et al. 2002), and the transmission of porcine circovirus associated disease (Poljak et al. 2010) and Legionaries’ disease cases (Bhopal et al. 1992). A review of the literature did not find any applications of a bivariate spatiotemporal K-function in criminology.

Hypothesis Testing

Since approaches for testing null hypothesis correspond to different frameworks, it is important to select a relevant null hypothesis as it can change the interpretation of spatial relationships (Goreaud and Pélissier 2003). Two common methods for testing a null hypothesis with the bivariate extension of Ripley’s K are random labeling and independence (Diggle 1983; Dixon 2002). Marcon et al. (Forthcoming) describe the differences in assumptions (see Goreaud and Pélissier 2003 for further discussion):

The random labeling hypothesis considers that points preexist and their marks are the result of a process to test (e.g. are dead trees independently distributed in a forest?). The population independence one considers that points belong to two different populations with their own spatial structure and wants to test whether they are independent from each other. (p. 3)

Random labeling is induced by the random permutation of times, t i = 1, …, n, from the observed distribution (x and y coordinates held constant). For instance, random labeling assumes that different processes generate the patterns of SQFs and crimes, and under this method, the time of occurrence labels (day of year in this case) are randomly shuffled while holding the x–y coordinates of the SQFs constant. Departure from independence in this instance signifies that an interaction exists between SQFs and crime that displays attraction or repulsion. A number of studies have undertaken hypothesis testing using random labeling over a range of distances with the K statistic (Diggle et al. 1995; Gatrell et al. 1996; Lynch and Moorcroft 2008; Picado et al. 2007) and the adapted Knox test (Grubesic and Mack 2008; Johnson et al. 2009; Ratcliffe and Rengert 2008; Wells et al. 2012; Wyant et al. 2012).

Alternatively, independence testing randomizes the relative position of the points for each Monte Carlo simulation of envelopes at every distance lag, s (Lotwick and Silverman 1982). The simulation envelopes generated are applied in a two-sided fashion, and the null hypothesis is rejected when the observed patterns fall below or above the envelope (Dixon 2002). As such, independence hypothesis testing assumes that the process of assigning labels to SQFs acts independently of the locations of crimes, and departure from this is conditioned on the locations of events of the independent population (De La Cruz et al. 2008). Accordingly, this process would be induced by displacing all the locations of SQFs by a randomly chosen distance while holding the location of crimes fixed (Arbia et al. 2015).

A concern has been raised by Loosmore and Ford (2006) regarding the use of randomization for hypothesis testing with spatial point patterns, such as with the K statistic. They contend that the “use of CSR [complete spatial randomness] as a null model is uninformative in that no spatial processes are truly random” (p. 1929). They go on to demonstrate that a hypothesized spatial model calculated from CSR significantly underestimates the expected type I error rate—and the estimation of simulation envelopes over a range of distances (such as with the spatiotemporal K-function and adapted Knox test) to determine if the observed pattern deviates from the hypothesized model is also questionable. Loosmore and Ford (2006) present an example in prior research where the authors constructed 95th percentile confidence intervals (CI) using simulation, and 74 of 99 of those simulations of CSR contributed to the upper or lower CI for at least one spatial lag. This makes the type I error rate ~74 %, not the nominal 5 % that is expected. The type I error rate is also incorrect when inferring scale, such as a range of spatial distances, since “results at any distance reflect both the instantaneous value at that distance as well as the combined results from small distances […] Results for an observed pattern could therefore lie outside the envelope at a distance where the instantaneous value was not different than the specified model” (p. 1927). The use of the Bonferroni correction, which is sometimes employed to deal with the issue of multiple testing, is inappropriate for addressing the incorrect type I error rate since “both the correlation between results at consecutive distances which violates the assumption of independence, as well as the large number of distances being simultaneously evaluated” (p. 1927).

One way to preserve the type I error rate is to use an approach proposed by Loosmore and Ford (2006), which is to collapse down the information into a single statistic. A limitation of this approach, however, is that information on what spatial scale might be driving significant results is lost since only a single statistic is provided. Given these considerations, a method of hypothesis testing should be chosen when using the K estimate (or adapted Knox test) that is theoretically relevant and preserves the nominal type I error rate. We discuss such a method below.

Data and Methods

To provide an example of this method, the present study draws upon data from a study of SQF practices in NYC (Rosenfeld et al. 2012). SQF refers to instances in which an officer, typically as part of normal patrol, stops an individual and requests some information. These periods of detainment sometimes (but not always) involve a frisk of the individual for weapons. SQFs are sometimes called Terry stops, because the 1968 Supreme Court decision in Terry v. Ohio gave officers the right to stop and detain a person when there was reasonable suspicion that he or she was in the act of committing a crime or about to commit a crime (see Jones-Brown et al. 2010). SQFs have become extremely controversial, leading some scholars to argue that it is likely doing more harm than good (e.g. see Fagan et al. 2010). The approach has been criticized for targeting the young, minorities, and specific neighborhoods of the city (see Gelman et al. 2007; Ridgeway 2007; Stoud et al. 2011). SQFs in NYC provide an appropriate case for illustrating the space–time interaction method because they are targeted at specific places, and they are strongly confounded in space and time with crime (Weisburd et al., under review).

The specific focus of this research is a sample of cases drawn from a 150-day period in the Bronx, NYC in 2006. The analyses are limited to a specific time period and one NYC borough to make the computation and presentation of the analyses manageable. At the time of this study, the crime rate was higher in the Bronx than NYC citywide (5.44 vs 5.01 per 1000 people) (New York City Police Department 2014). The data included all non-traffic related crime incidents and SQFs that occurred in NYC for the time period of interest. The dataset provided includes x–y coordinates of SQFs and crime incidents, as well as their dates of occurrences.Footnote 3 Figure 2 presents a map of these crime incidents and SQFs by street segment. This 150-day study period had 45,027 crime incidents and 32,600 SQFs in the Bronx, for a total of 77,627 events. A time series plot of the total number of SQFs and crime incidents each day during the study period is presented in Fig. 3. The New York penal law classifications of the crime incidents were as follows: 13.8 % minor violations, 60.4 % misdemeanors, and 25.8 % felonies.

Fig. 2
figure 2

Spatial distribution crime incidents and non-confounded SQFs in the Bronx (Jan–Jun 2006)

Fig. 3
figure 3

Trends in crime incidents and SQFs in the Bronx (Jan–Jun 2006)

Estimation of Spatiotemporal K-Function

The following section describes the specifications of the spatiotemporal K-function that was computed to examine the likelihood of a crime incident following an SQF in space and time at a micro-scale. Detail is also provided on the two informative null spatial patterns to which this distribution is compared. Inferences will be made with these hypothesized models because they are more theoretically relevant than a distribution based on Monte Carlo/random labeling methods and do not suffer from incorrect type I error rate performance discussed by Loosmore and Ford (2006). All analyses were computed using the splancs package in R (R Development Core Team 2005) and K1D software (Gavin et al. 2006, 2008).Footnote 4 , Footnote 5

Average K-Function on Days with an SQF

\(\hat{D}_{SC} (s,t)\) represents the increased risk of a crime event due to an SQF at a spatial distance s and time lag t, where i is the SQF event and j is the crime incident. Simply put, the estimate provides an indication of the average likelihood of a crime event at various distances immediately surrounding an SQF the day(s) following its occurrence. \(\hat{D}_{SC} (s,t)\) was computed across a range of cut offs for the space and time dimensions. The selection of a temporal buffer was guided by literature indicating that deterrent effects of place-based interventions in targeted areas are short-term (Koper 1995; Sherman 1990; Weisburd et al. 2006) and, more specifically, by Wyant et al. (2012) (found that deterrent effects from gun arrests lasted up to 6 days). Nonetheless, the selection of the spatial and temporal buffers were exploratory given that there is limited research on the specific residual deterrent effect of SQFs (and police presence generally) at various time intervals and across space. The distance parameter, s, was estimated in 100 feet intervals that ranged from 0-feet (exact location of the SQF) to 500-feet. The maximum buffer radius is slightly longer than the average street segment in the Bronx, which is about 387 feet from intersection to intersection (Md = 284 feet; SD = 315 feet; n = 12,615 segments). The choice of these buffers related in part to the desire to limit the examination of deterrent effects to areas that would likely be clearly visible from the location of where the SQF occurred (Sherman et al. 1989; Sherman and Weisburd 1995; Weisburd and Mazerolle 1995), as well as because useful information in \(\hat{D}_{0} (s,t)\) is confined to small values of s and t relative to the spatial and temporal dimensions of A × (0, T) due to sampling fluctuations (Diggle et al. 1995). The time lag, t, increased in daily increments, ranging from 1 day (the day after the SQF) to 5 days following the SQF. These specifications produced a 5-row by 6-column array of 30 estimates.

One estimation problem in these data is that SQFs can co-occur close in space and time. Indeed, prior findings that SQFs represent a hot spots policing approach (Weisburd et al. 2014) mean that such clustering is very likely. While we recognize the importance of estimating models taking into account this clustering (see Weisburd et al. Under Review), for the purposes of illustrating the spatiotemporal K-function, we focus below on a subset of events in which we exclude any cases in which there will be multiple SQFs in the space and time buffers. In this example then, SQFs cannot be contaminated by the influence of neighboring SQFs. To do this, the Euclidean distance in space for each pair of SQFs was computed and points falling within a 250-foot buffer in the same 6-day period were removed from the dataset. This process identified 23,356 SQF events that had a subsequent SQF in the exact location within a given 6-day period, and another 4895 SQF events that had a subsequent SQF within a 250-ft distance during the same period. This left a total of 4349 SQFs that were not confounded by neighboring SQFs in time or space (see Fig. 4), which represents 13.3 % of SQF occurrences during the study period. Even though the selection of this subset may not be generalizable, it was chosen because it creates a straightforward measure of the K statistic without the complexity of potential confounding SQFs in the same area.

Fig. 4
figure 4

Trends in crime incidents and non-confounded SQFs in the Bronx (Jan–Jun 2006)

Average K-Function on Days Without an SQF

To determine if SQFs influence the likelihood of a crime incident, one option is an estimate of how much crime would be expected at these locations on days when an SQF did not occur. This null distribution has no space–time interaction and is deemed \(\hat{D}_{FC} (s,t)\). In this instance, crime incidents remain type j events but the technique also requires the user to input the time and location of type i events. Since we are interested in the likelihood of crime in the absence of a type i event, a dataset of fake type i events were generated for the analysis (x–y coordinates and corresponding time of occurrence). These events “occurred” every day a real SQF did not take place and were only generated at locations where an actual SQF occurred during the study period (based on x–y coordinates). However, since it is expected that police officers bring SQFs to areas where a crime has occurred and that there is an expected deterrent effect, dummy events were not generated on days close in time to an actual SQF occurrence (only crime levels at least 7 days before and after an SQF were assessed). As such, generating dummy type i events allows us to examine the likelihood of a crime incident in the absence of an SQF at the study locations. A subsample of 10,000 dummy events were randomly selected from this distribution for each month, and \(\hat{D}_{FC} (s,t)\) was computed with these events using actual crime incidents. This process was replicated 99 times (with replacement) and the final null distribution estimate of \(\hat{D}_{FC} (s,t)\) was taken as the mean of the 99 subsample estimates, with confidence limits derived from the 2.5 and 97.5 percentiles of the subsample distribution.Footnote 6

Average K-Function on Days Following a Crime Incident

A second benchmark, \(\hat{D}_{CC} (s,t)\), was also calculated to estimate the likelihood of a crime event following a given crime occurrence. This counterfactual was selected because police tend to respond to periods of high crime at these locations by increasing SQFs, and accordingly, the likelihood of a crime event the days following an SQF is also highly contingent upon the occurrence of a crime event. A random thinning operation (Lewis and Shedler 1979; Ogata 1981) was undertaken in which 500 crime events were retained each month and the likelihood of a crime event following these randomly selected points was estimated. The K-function is invariant over random thinning (Cressie 1993; French et al. 2005) because the retention or deletion of each point is independent of other points and the probability is proportional to the reciprocal of the estimated conditional intensity at that point (Schoenberg 2003).

To be considered for random selection, the crime event had to have taken place at locations where SQFs occurred in the present sample (based on x–y coordinates) but did not occur close in time to an actual SQF occurrence (only crime levels at least 7 days before and after an SQF were assessed).Footnote 7 This buffer was selected to ensure that effects would not be confounded by police bringing an SQF to a location as a result of a crime, nor would it be affected by residual deterrence effects from the SQF itself. A comparison of crime severity between SQF days and non-SQF days suggests that police are more likely to conduct an SQF on days when a more serious crime event has occurred, though the differences are not large: minor violation (10.0 % on an SQF versus 13.2 % on a non-SQF day), misdemeanor (65.4 vs 61.8 %), and felony (24.4 vs 25.0 %). The 500 randomly selected crime events were weighted to ensure that the severity of offenses from each subsample was similar to those on SQF days (13.2 % minor infractions, 61.8 % misdemeanor, and 25.0 % felonies). As with the prior null distribution, this process was replicated 99 times (with replacement) and the final null distribution estimate of \(\hat{D}_{CC} (s,t)\) was taken as the mean of the 99 subsample estimates. For a \(\hat{D}_{SC} (s,t)\) estimate to be considered significantly different from what should be expected, it must be an extreme value relative to \(\hat{D}_{FS} (s,t)\) and \(\hat{D}_{CC} (s,t)\).

Results

The results of the space–time K-function across time by distance lags are presented in Fig. 5. A comparison between \(\hat{D}_{SC} (s,t)\) and \(\hat{D}_{FS} (s,t)\) suggests a modest deterrent effect. The likelihood of a crime event is lower when an SQF occurred in contrast to a day a crime event occurred without an SQF across time and distance lags, with one exception (distance = 400 ft; lag = 4 days). The distribution of \(\hat{D}_{0} (s,t)\) at distance lag 0 is as follows: \(\hat{D}_{SC} (s,t)\) is 0.789, \(\hat{D}_{CC} (s,t)\) is 1.098 (CI95 = 1.088–1.107) and \(\hat{D}_{FS} (s,t)\) is 0.891 (CI95 = 0.889–0.892). This equates to the likelihood of a crime event after an SQF being roughly 30 % lower when compared to \(\hat{D}_{CC} (s,t)\) and 10 % lower when compared to \(\hat{D}_{FS} (s,t)\) (distance = 0 ft, lag = 1 day). \(\hat{D}_{SC} (s,t)\) declines until day 3 to the point where the likelihood of a crime event occurring is reduced by 42 % as compared to \(\hat{D}_{FS} (s,t)\) (CI95 = 0.886–0.890) and by 20 % as compared to \(\hat{D}_{CC} (s,t)\) (CI95 = 0.662–0.679) (distance = 0 ft, lag = 3 days). This suggests that the likelihood of a crime event following an SQF is significantly lower than any given day at these locations when an SQF did not occur and lower than when a crime event occurred without an SQF.

Fig. 5
figure 5

Cross sections of \(\hat{D}_{0} (s,t)\) distributions by distance lag across days. The confidence intervals for \(\hat{D}_{FS} (s,t)\) and \(\hat{D}_{CC} (s,t)\) would not be visible if included in this figure, but are provided in-text

A diffusion of benefits (Clarke and Weisburd 1994) effect is observed at distances within 300 feet of the SQF occurrence. But it appears to disappear by day 4. Both \(\hat{D}_{SC} (s,t)\) and \(\hat{D}_{CC} (s,t)\) experience a slight increase between days 3 and 4 (distance = 0 ft). Even though \(\hat{D}_{SC} (s,t)\) begins to increase, \(\hat{D}_{0} (s,t)\) remains below \(\hat{D}_{CC} (s,t)\) on day 4 and day 5 (distance = 0 ft). \(\hat{D}_{SC} (s,t)\) is 0.988 and 0.736, and \(\hat{D}_{CC} (s,t)\) is 1.00 (CI95 = 0.991–1.01) and 0.859 (CI95 = 0.849–0.879) on days 4 and 5 respectively. \(\hat{D}_{SC} (s,t)\) is most likely to approximate \(\hat{D}_{FS} (s,t)\) at 4 days after an SQF occurrence (distance = 0 ft). While attenuated at larger distance lags (distance > 0 ft), similar trends in crime reductions across all days are also evident.

Figure 5 also suggests that the observed deterrent effect of an SQF declines the farther away from the location of the event, and appears to have negligible influence on the likelihood of a crime event at distances greater than 300 feet from where the SQF occurred. This becomes more apparent in Fig. 6, which presents the space–time K-functions across distance by time lags.

Fig. 6
figure 6

Cross sections of \(\hat{D}_{0} (s,t)\) distributions by time lags across distance. The confidence intervals for \(\hat{D}_{FS} (s,t)\) and \(\hat{D}_{CC} (s,t)\) would not be visible if included in this figure, but are provided in-text

Between days 1 and 3, \(\hat{D}_{SC} (s,t)\) is notably lower than \(\hat{D}_{CC} (s,t)\) (distance < 400 ft). These differences become less marked as distance from the location of the SQF increases, whereby an SQF appears to have no effect on crime at distances greater than 400 feet from its occurrence. On day 3 in particular, \(\hat{D}_{SC} (s,t)\) is considerably lower than both \(\hat{D}_{CC} (s,t)\) and \(\hat{D}_{FS} (s,t)\), with the likelihood of a crime event following an SQF being much lower than expected at closer distances. \(\hat{D}_{SC} (s,t)\) estimates remain below \(\hat{D}_{CC} (s,t)\) for all points of observation except on day 4, where the likelihood of a crime occurrence following an SQF being 3 %, higher than if a crime event occurred without an SQF (distance = 400 ft; lag = 4 days), \(\hat{D}_{SC} (s,t)\) = 1.101; \(\hat{D}_{CC} (s,t)\) = 1.071 (CI95 = 1.067–1.075). Other than on day 4, \(\hat{D}_{SC} (s,t)\) estimates do not experience significant increases at greater distances, suggesting that notable spatial displacement is not observed at distances within 500 feet from the location of the SQF.

Discussion

This review of the space–time relationship between SQFs and crime in the Bronx, New York has provided a descriptive example of the more general application the bivariate spatiotemporal K-function. Prior work in criminology already demonstrates that there is significant potential for obtaining new and important information from exploring space–time interactions (e.g., Dario et al. 2015; Johnson and Bowers 2004a, b; Townsley et al. 2003; Wells et al. 2012; Wyant et al. 2012; Zhang et al. 2015). While this trend is new and important, the temporal component of data in criminology is still largely ignored (Ratcliffe 2010). An understanding of spatial–temporal relationships remains one of the most under-researched areas of criminology despite its vast ability to unlock “a better theoretical understanding of the role of geography and opportunity, as well as enabling practical crime prevention solutions that are tailored to specific places” (Ratcliffe 2010, p. 5).

The spatiotemporal K-function can also offer straightforward, policy-relevant information for practice. Pease and Laycock (1999) noted that the main barrier in crime prevention is that the police must “anticipate the place and time of occurrence” (p. 2). Further research using space–time methods will help police better forecast crime events. For instance, knowledge that burglaries generally tend to cluster in the same area for at least one month before moving to other areas can be used to prospectively allocate officers to at-risk areas. As Johnson and Bowers (2004b) suggest, forward-thinking “strategies used in the deployment of police or other resources based on the identification of clusters of near repeats would more precisely match the actual (and dynamic) spatial distribution of risk than do those using traditional hot-spotting techniques” (p. 251).

The public can also minimize their likelihood for victimization if the police communicate that they are currently at an increased risk and provide them with potential precautionary measures. It can also provide information on where to target place-based crime prevention interventions more generally (Wooditch et al. 2013). The spatiotemporal K-function will help gauge the cost-effectiveness of place-based interventions by providing a more nuanced understanding of treatment effects. For example, it will be particularly useful with place-based randomized experiments, such as with hot spots policing, since it would allow for a comparison between treatment and control conditions that accommodate the spatiotemporal element of the data. With this knowledge, the Koper (1995) curve may be modified to illustrate hot spots policing effects across time and space.

Spatiotemporal models also allow us to have a more fine-grained understanding of the relationship between interventions and outcomes. For example, studies of displacement of crime have generally used catchment areas to gain a general sense of whether and to what extent place-based interventions shift crime to other areas (e.g. see Braga 2007; Green 1995; Weisburd et al. 2006; Weisburd and Green 1995). Such areas are defined in terms of distance, for example, a single block or two, or according to an average distance from a geographic point. While such approaches allow us to gain a broad understanding of potential displacement or diffusion outcomes, they do not allow us to specify the specific temporal and spatial dimensions of the movement of crime. We found that the observed deterrent effects diffuse out from the point location of SQFs not confounded in time and space, but that such diffusion does not extend beyond 300 feet or a few days from the event. These findings are consistent with the general literature suggesting that spatial displacement of crime is an uncommon result of hot spots interventions and that diffusion of crime control benefits is more likely (Bowers et al. 2011; Braga et al. 2014; Clarke and Weisburd 1994; Guerette and Bowers 2009; Weisburd et al. 2006). More important, this method provides a more nuanced and finely grained portrait of the nature of the temporal and spatial outcomes than prior research in this area.

This approach also allows us to develop credible causal inferences absent experimental data. Many place-based crime prevention programs will not be amenable to experimental manipulation. The spatiotemporal approach presented in this paper offers an opportunity to focus on time and space units to a fine degree. In other analyses, we have found that SQFs and crime in NYC are strongly co-integrated at a monthly level (Weisburd et al. Under review), meaning that it is very difficult to disentangle the causal chain that links the two types of events. Certainly, an approach that uses long time spans, such as monthly data, will have difficulty distinguishing whether SQFs influence crime, or occur because crimes are committed at specific locations, whereas a spatiotemporal approach allows us to hone in on a specific time, in our case a day, when an SQF occurs at a specific point location.

The fact that the present study could demonstrate that crime declines on average after an SQF event and within a discrete spatial area provides more convincing conclusions than a similar finding at a larger geographic or temporal scale that did not include the space–time interaction. Without an experimental manipulation of the intervention, it is not possible to establish the causality of observations with certainty. Nonetheless, the tracking of the impacts in a short time interval and within specific geographic distances makes an inference of causality plausible (see Nagin and Weisburd 2013).

The present study argues that the spatiotemporal k-function offers new insights to understanding space–time interactions in criminology. Before concluding, it is important to note specific limitations of this study in the Bronx, as well as more general limitations of the method. Regarding analyses of SQFs, it must be recognized that SQFs may not be a robust indicator of police activity since a great deal of police presence is not captured by recorded police-citizen encounters and concerns have been raised about the completeness of these data (discussed further below). The present study assumes that SQFs are a treatment factor and everything else at these locations is random (irrespective of level of presence otherwise). One limitation of the approach is that it assumes that SQFs are indicators of police activity without making assertions regarding the amount of additional police activity at these place (ranging from none to a great deal). A further extension of the approach would be to condition this effect on the amount of police presence locations receive.

More generally, the focus in this paper was to illustrate the utility of the bivariate spatiotemporal K-function for assessing crime prevention outcomes, not to provide a systematic assessment of the impacts of SQFs on crime. Our choice in this regard to examine a sub-sample of cases in the Bronx that are not confounded in space and time made computational illustrations simpler, but it limits the ability to generalize to the population of cases.

Though the technique uses well-defined local areas around x–y coordinates to examine smaller scale effects, the unit of analysis may not be ideal for theoretical development. For instance, Taylor (1997, 1998) and Weisburd et al. (2012, 2014) argue that a theoretically relevant unit of analysis to study crime at micro-places are street segments (or street blocks) because they function as behavioral settings (Wicker 1987), whereby people get to know one another and become familiar with each other’s routines. As Weisburd et al. (2012) note further, “street segments […] provide a unit of analysis that ‘fits’ with both ecological theories and opportunities theories and that is capable of illustrating both bottom-up and top-down processes producing crime events” (p. 24). This limitation may be less of a concern given that relying upon predefined spatial buffers is claimed to be free of the modifiable areal unit problem, whereby conclusions are sensitive to variations in scale (the number of boundaries used) and zoning (how boundaries are drawn) (Gehlke and Biehl 1934; Openshaw and Taylor 1979). As such, the K-function yields information on a phenomenon across a finite geographic distance rather than relying upon artificially demarcated boundaries that can be redrawn and vary in size and shape (e.g., school districts, census tracts). While boundaries of a street segment are not artificial in the same sense, there can be large differences in the length of street segments between geographic regions (e.g., street segments in NYC will be significantly shorter than those in small rural communities) (Weisburd 2015). Accordingly, the study parameters can be easily replicated in other geographic areas without differences in the scope of the unit of analysis.

While the technique offers great ability to explore space–time relationships at a microgeographic scale that does not rely on administrative boundaries, a reliance on geocoded addresses (as is often the case in the field of criminal justice) to compute the K-function is subject to bias due to the positional error of points. A common source of error is the geocoding process itself, which is the practice of locating street addresses on a network database and then assigning corresponding geographic coordinates (e.g., identifying the latitude and longitude of 123 Main Street). One study estimates that the median offset between a geocoded point and its true location is between 84 and 102 feet (Schootman et al. 2007), depending on geocoding method. This type of bias may be minimized through such measures as increasing hit rate (Zhan et al. 2006), using commercial software to geocode (Schootman et al. 2007), or selecting a more urbanized study area (Bonner et al. 2003; Cayo and Talbot 2003). Police departments can also assist researchers by improving the accuracy of reporting the location of events, such as requiring officers to consistently report the actual location of a crime incident occurring on a street segment rather than a general location (e.g., 100 block of Main St) or its nearest intersection (Tarling and Morris 2010).

As with any analyses, a potential threat to causal inference when using the space–time K-function is missing or inaccurate data. For instance, even though the NYPD is mandated by a settlement in Daniels et al. v. City of New York et al. to document stops where a frisk or more extensive search of an individual is made (Jones-Brown et al. 2010), there have been a number of concerns and substantiated claims that SQFs are underreported in NYC (New York Bar Association 2007; Rudovsky and Rosenthal 2013; Schneiderman 2013). The completeness of the current SQF data is unknown (Rosenfeld and Fornango 2014). Since there is no evidence to suggest that the underreporting of SQFs is severely confined to specific places or times, it is believed that the study findings are relatively unaffected by any systematic undercounting of SQFs by the NYPD (French et al. 2006; Schabenberger and Pierce 2001), but the strength of the findings must be tempered by the potential shortcomings of the data source.

Despite these noted limitations, the evolution of spatiotemporal methodologies is of great utility to the field. The spatiotemporal K-function is an important next step in evolving current space–time approaches, but there are still opportunities for this technique to improve. Further refinement of the spatiotemporal K-function will involve the following (see also Picado et al. 2007): (1) the ability to examine relationships in both time and space while adjusting for potential confounders (the endogeneity problem); (2) enhanced capability of the test to detect pertinent changes in spatiotemporal patterns; (3) ability to associate these patterns to certain variables of interest (such as the duration of the event rather than just its occurrence); and (4) development of a software package to estimate \(\hat{D}_{0} (s,t)\) that does not require the user to manually adjust for the time element of the data.

Conclusions

The central aim of this manuscript was to illustrate the opportunity for the spatiotemporal K-function to aid the examination of space–time relationships in criminology and for developing stronger evaluation methods for place-based interventions in crime and justice. A number of statistical methods have been used to test for and examine space–time interactions. The spatiotemporal K-function overcomes many limitations of prior approaches. The technique has proved to be a flexible and suitable approach for second-order analysis of two spatial point patterns. While the technique cannot yet control for potential confounding factors, the fine-grained view it provides of what is occurring across time at a microgeographic level allows researchers to make believable and plausible inferences. As more spatiotemporal data become available, it is important that criminologists capitalize on statistical techniques that accommodate space and time dimensions of data simultaneously. Taking advantage of new developments in spatial statistics, such as the bivariate spatiotemporal K-function, is critical to the advancement of quantitative criminology and in particular to the evaluation of place-based criminal justice interventions.