Introduction

An increased prevalence of obesity in the United States over the last two decades has led to hundreds of thousands of excess deaths annually [13]. This increase has coincided with changes in home, school, community and other environments that influence diet and health-related behavior [4, 5]. The “community nutrition environment” (CNE), which defines the number, type, location and accessibility of food outlets, is of particular interest to both researchers and policy-makers since the ubiquity of unhealthy foods and lack of access to healthy foods may contribute to the obesity epidemic [69].

Prior reviews of food environments and obesity have not focused on the CNE, have included only a relatively small number of studies (<20), or were conducted before 2010 [911]. The burgeoning interest in food deserts and interventions on access to healthy food among both researchers and practitioners demands the most up-to-date evidence. Moreover, evidence on the food environment is being applied in increasingly proactive ways, such as in Health Impact Assessments (HIAs). HIAs use existing research—often relying on systematic reviews and meta-analysis—to estimate future effects of policies, plans and projects. This demand for timely and robust data has reinforced the need for research frameworks, study designs and metrics that maximize comparability across studies. To fill these gaps, we reviewed studies of the CNE and obesity, examining the methods, metrics and results.

Methods

We searched the MEDLINE database using the terms “Access to Healthy Foods”, “Built Environment”, “Food Access”, “Food Environment”, “Supermarket”, and “Food Desert”, paired with “BMI”, “Diet”, “Obesity”, and “Overweight”, for a total of 24 queries. We limited the search to peer-reviewed journal articles in the English-language and conducted in human populations between July 1, 2002, and July 1, 2012. From a total of 673 articles, we excluded 552 based on the article’s title and abstract because they did not directly analyze the food store number, type, location or other aspect of the community nutrition environment. Of the remaining 121 articles, we excluded 72 that were conducted outside the United States, did not spatially analyze the food environment, or did not have an obesity-related health outcome. From the remaining 49 articles we used the “snowball” technique and checked all citations to include an additional 21 articles. Of these final 70 studies, 13 were excluded because they were review articles, two were excluded because the obesity outcomes were projections rather than direct measurements (computer simulations), two because the measures were put into an index so direct interpretation was not possible and two more because although they spatially analyzed the food environment, they did not measure food store presence, frequency or distance. A total of fifty-one remaining relevant articles were included in the review. (Figure 1, Appendix 1).

Fig. 1
figure 1

Study exclusions and inclusions

For each study we classified the technique for spatially measuring food stores as frequency, proximity, or presence. Frequency includes count, density, and ratio measures. We also identified the type of food store. Studies often applied one measurement technique (e.g. presence) but examined multiple associations, stratifying by race, gender, income, or other demographic or socioeconomic variables. Studies also applied different types and sizes of buffers around areas of interest. A buffer is a distance around a point of interest such as a home or school that is used to represent an estimate of that individual’s food environment. A Euclidean buffer encompasses points of interest within a straight-line radius while a network buffer represents the distance around one point one could travel accounting for the roads and available walkways.

For each combination of food store type and measurement technique, we characterized a study as having a positive, negative or null finding depending on the observed associations between the food store and obesity. If any association was found to be positive or negative for a certain food store type and measurement technique in a study, the study was considered to have found a positive or negative finding between that food store type with that measurement technique, and obesity. For example, if a study analyzed the association between the proximity to a fast food restaurant and obesity for males and females and found a significant association in just one gender, the study was recorded as having found a positive finding for measuring the relationship between proximity to fast food restaurants and obesity. This criteria was applied regardless of the number of associations tested, meaning if a study analyzed the association between the presence of a convenience store and obesity at three different buffer lengths and separately for males and females, if at least one association was found to be positive the study was recorded as having a positive finding regarding the association between the presence of a convenience store and obesity. We calculated the proportion of studies that found at least one significant association between the CNE and obesity in the expected direction from the total number of findings. We looked at findings across the six most common store types (fast food restaurant, convenience stores, small food stores, grocery stores, supermarkets and full service restaurants) and measurement techniques (frequency, proximity and presence). In one instance a study found both a positive and negative finding for a food store type and food store measure and was included in the review as a positive and negative finding [12].

All but two studies used BMI to determine obesity or overweight status. The formula to calculate one’s BMI is weight (kg)/[height (m)]2. All studies used BMI cutoffs consistent with the CDC’s definition for BMI Z-Scores or adult BMI where a BMI between 25.0 and 29.9 is indicative of being overweight and a BMI ≥30 indicates obesity [13]. Alternate methods included using a maternal weight >200 lbs. as a proxy for obesity [14], and using different standards that adjust for age and gender in assessing the weight status of adolescents [15].

Results

Study Parameters

Of the 51 articles examined (Table 1), 32 analyzed adults and 19 investigated children and/or adolescents. Forty studies used a cross sectional design, four used repeated cross sections, and seven used a longitudinal design. We abstracted details on the methods and results focusing on the geographic scope of each study, referring to the bounded area that the author wished to make inferences about, and the level at which the food environment was measured. The scope of the studies ranged from cities to the entire United States, with the most common being cities (n = 9), counties (n = 7), states (n = 12) and the nation (n = 13). Within these scopes the food environment was analyzed at different levels, most commonly homes with surrounding buffers (n = 13), census tracts (n = 11), and counties (n = 8).

Table 1 Number and proportion of studies by study parameter

Measuring the Community Nutrition Environment

There was substantial variation in how food stores were spatially measured, located, and classified (Table 2). The CNE was defined by presence (n = 13), proximity (n = 13) and three frequency measures, density (n = 23), count (n = 16), and ratios of stores (n = 3). Sixteen studies used a combination of these spatial methods. Density refers to the number of stores within a specified area measure such as a census tract, or one square mile. Density was separated into three categories: the number of stores per capita (n = 12), the number of stores per area (n = 11), and all other measures of density (n = 2). Count refers to the number of stores within a specified distance, for example the number of stores within a 1 mile buffer of a home or school. Ratios were typically used as the number of healthy stores divided by the number of unhealthy stores. Presence is a binary measure that indicates the presence of at least one store in a specified area. Proximity is a measure of the distance between a store and a point of interest such as a school or home.

Table 2 Technique for measuring food store frequency, proximity or presence, location, and type (n = number of studies)

To locate food stores, studies used commercial databases (n = 25), government sources (n = 25), field measurements (n = 4), or the Yellow Pages (n = 7). Nine studies used a combination of these methods. Only two Dun and Bradstreet (n = 10) and Info USA (n = 9) of the seven databases were used in multiple studies. Studies determined food stores type using government coding systems including North American Industry Classification System (n = 21) and Standard Industrial Classification codes (n = 13). However, actual codes were provided in just 56 % of studies.

Buffer Zones

Among 20 studies using buffers there were fourteen different buffer sizes ranging from 1/10 of a mile to 10 miles. One mile (n = 9), 1/2 mile (n = 10), and 1/4 mile (n = 7) were the most commonly used buffer sizes. Network buffers, which measure distance using streets and sidewalks, were applied in nine studies and Euclidean buffers, which measure straight line distance, were used in eight studies. In one study both buffer types were applied and in two other studies the buffer type was unknown. Seven of the nine studies that used Network buffers found at least one significant association while five of the eight studies that used Euclidean buffers found at least one significant association. Five studies found varying results depending on the buffer size used.

Measured Effects

The relationship between the CNE and obesity varied widely across the different types of food stores and measurement techniques (Table 3). Fast food restaurants (n = 48), convenience stores (n = 30) and supermarkets (n = 29) were the most frequently tested store types, where these numbers represent the number of findings found across the studies. Fast food restaurants and convenience stores are hypothesized to contribute to the consumption of unhealthy food and obesity, while supermarkets are hypothesized to increase the consumption of fruits and vegetables and therefore promote a healthy weight. Studies analyzing the presence of convenience stores (n = 7), fast food restaurants (n = 7), and supermarkets (n = 9) corroborated these hypotheses 71, 71, and 38 % of the time respectively. Among the six most commonly analyzed food stores—fast food restaurants, convenience stores, supermarkets, grocery stores, small grocery stores, and full service restaurants −32 % of findings found at least one significant association with obesity in the expected direction, 10 % of findings found at least one significant association in the unexpected direction, and 58 % of findings found no association. A finding refers to each time a study analyzed a specific food store type using frequency, proximity, or presence. If they found at least one positive or negative association, they were recorded as finding a positive or negative finding, respectively. Among studies that found a statistically significant association between the CNE and obesity, the effect sizes were typically small and the p-values were usually borderline significant. For example, three studies (ID#’s 30, 33, 38) found fast food restaurant frequency to be significantly associated with odds of obesity and provided odds ratios. These odds ratios ranged from 1.05 to 1.79 and the lower 95 % confidence limit were consistently close to 1.0. Reference groups for these calculations varied, with some studies using quartiles of fast food restaurant frequency and others using different population subsets.

Table 3 Study results by food store type and measurement of frequency, proximity or presence

Discussion

Overview of Study Results

The initial purpose of this review was to examine the effects of the CNE on obesity. However the heterogeneity of the metrics applied precluded a conclusive summary of effect size and therefore we focused on describing the application of metrics across studies. Although the methods and results of individual studies were inconsistent, as a whole this body of research suggests that food environments are associated with obesity. Most studies (80 %) found at least one statistically significant association between the food environment and obesity. However, many studies looked at multiple associations with obesity, using different food store types and measurement techniques including frequency, proximity, and presence. When considering the balance of positive, negative and null findings among the studies and the varying store types and measurement techniques, the percent of statistically significant findings in the expected direction drop to 32 %.

Results varied widely by food store type and measurement technique. Thirty-seven percent of findings from studies analyzing the association between the frequency of fast food restaurant stores and obesity found a positive correlation. This was the largest percent of findings found in the expected direction for any specific food store type and measurement technique where there were ten or more findings. This may be because fast food outlets act as both a direct determinant of obesity and as a proxy for other obesity-related neighborhood conditions [16]. Among all studies, including those where there were <10 findings, studies using the presence measure to assess the CNE were the most likely to have significant findings in the expected direction. This may be because the presence of a store best captures an individual’s ability to access certain foods. We found that compared with adult studies, those of children and adolescents were less likely to observe a significant finding relating obesity to the food environment. This may be due to several things, including children’s’ participation in school breakfast and lunch programs which crowd out opportunities for fast food consumption. However, the variation in metrics along with the different food stores examined across these studies made it difficult to interpret these comparisons.

Study Designs

The vast majority of studies were cross-sectional, which provide a limited view of the relationship between the food environment and obesity, since they cannot draw causal inferences or directionality. Longitudinal studies can further elucidate the relationship between CNE and obesity. We found that four of the seven studies using a longitudinal study design were conducted between during or after 2011, suggesting increased investment in this more rigorous design.

Defining the Consumer Nutrition Environment

Expanding the CNE to include more food store types and considering the accessibility of food stores would lead to more accurate depictions of one’s food environment. Determining a population or individual’s car ownership status also has implications for determining an appropriate buffer size.

More recent studies are measuring a broader community nutrition environment including farmer’s markets [15, 17] emergency food stores [18], flea markets [19], direct farm sales [20], and other unaddressed food sources. Other potential food sources such as community gardens are less well-studied. Including more food store and food source types will yield a more accurate measurement of the community nutrition environment.

Variables that influence people’s ability to get to food stores are an important but often unmeasured component of food access [21, 22]. We found that studies did not consistently report whether people had access to an automobile. Additionally, the majority of studies did not address public transit and other components of the built environment that may support food procurement. Although some of this data is available from the census (car ownership) or direct observation (bus stops and transit stops), the resources required to obtain and use this data for defining the CNE are unclear. The lack of such data will continue to limit the ability of researchers to draw inference regarding the relationship between the CNE and obesity.

Researchers applied different buffer types and varying buffer sizes often without justification. Research suggests a network buffer is typically preferable to a Euclidean buffer as it accounts for the city streets and actual path a person would take to get to a food store [15, 23]. However, recent studies continue to implement Euclidean buffers without discussion [17, 24]. The size of a buffer should reflect a population’s age, their transportation abilities, and existing research. Of 2,384 adult participants of The Multi-Ethnic Study of Atherosclerosis, 47 % indicated they did most of their food shopping within one mile of their home [25]. Based on this example, researchers need to apply appropriate buffer sizes to adequately describe individual’s food environments. Although the different buffer sizes and types did not appear to influence effect size and significance, the issue of determining an appropriate buffer size and type highlights the challenges of measuring the CNE for diverse populations with different abilities to procure food. However, the justification of buffer types and sizes for certain food store types and populations would likely provide more realistic measures of access. Moreover, standardization within the tools used to calculate and measure buffers could support comparable studies. A new buffering method called the “sausage” network buffer may be more easily replicable across different software and yield results similar to network buffers [26]. A “sausage” network buffer is created by buffering all roads out a certain “street distance” from the starting point, and for a radius of some number of meters on each side of the road center line. Although this method still needs to be evaluated, it can be used across different mapping software which could lead to more consistent data analysis.

Classifying Food Store Types

Identifying food store types is also a crucial aspect of interpreting a community nutrition environment. Unfortunately, there is not a consistent definition for fast food restaurants [27]. Often, definitions are derived from the North American Industry Classification System (NAICS) code for limited service restaurants. However, we could not confirm definitions since <2–3 of studies we reviewed that used NAICS codes, identified the codes they used. Moreover, many researchers chose to modify the “fast food” category, often forming their own unique definition of a fast food restaurant [15, 28]. Researchers have also created new categories of food stores by combining NAICS codes, such as supercenters and supermarkets [29], and by further categorizing stores initially defined by NAICS into independent and chain stores [30]. Beyond the codes, researchers could create further heterogeneity by classifying based on store name. For example, one researcher may include a pizzeria or doughnut shop into their “fast food” category, while another may not. We found thirty-two different food store types/definitions (Appendix 2) in the fifty-one studies we reviewed, a large number that can be partially attributed to these varied classification practices. Consensus on standardized definitions will produce comparative research to advance the field. Ohri-Vachaspati et al. [31] proposed a method for systematically reclassifying commercial database information to make it easier to use for research, however continued research is needed to explore the utility and validity of such methods.

Locating Food Stores

Commercial and public databases used to identify food stores have been reported to be inaccurate and dissimilar [25, 3236]. For example, when checked by site visit, Dun and Bradstreet measures of CNE had a positive predictive value of 78 %, but a sensitivity of 55 % [37]. Larger discrepancies are often observed when testing the accuracy of public datasets, and internet sources such as the Yellow Pages may be accurate only 50 % of the time [38]. These statistics are alarming considering this review has shown that the majority of studies (41 of 51) used only one measurement tool for locating food stores. As other researchers have noted, future studies should use multiple measurement tools for locating food stores to improve study validity [37, 39].

Analyzing the Community Nutrition Environment

The studies we reviewed used various combinations of counts, densities, ratios, presence and proximity (distance) to measure the CNE. It is unclear which of these methods is preferable and likely depends on the particular research question and the type of store being examined. For example, given the surrounding transportation and land use characteristics, distance might be a more relevant measure for grocery stores while density might be better for fast food. It is also unclear whether density or proximity might be more beneficial for certain populations or geographic areas. Many authors did not justify the method they were using to measure the CNE. Further research should distinguish the differences in effect sizes of density versus distance. Densities measured the number of stores within a certain area, and sometimes included the number of people in that area (density per capita). Other options such as kernel density measure the number of stores in a buffer zone surrounding a central point but then weights these stores differently depending on their spatial relationship to the center point [40]. A better understanding of the constraints and applications of these and other measures will aid the development of new techniques.

Temporal Acquisition of Data

Five studies, although cross-sectional, used exposure (CNE) and outcome (obesity) measures from different time periods [18, 34, 4143]. For example they may have used current information regarding the obesity status of an area, but used data on store type and location from a different time period. Three of these studies found significant results in the unexpected direction. Future studies should account for changes of the food environment over time as it can change rapidly [44].

Other Considerations

Studies did not always describe every association tested or define the food stores they were examining. Therefore our conclusions regarding the number of positive and negative findings should be interpretably carefully. For each food store type and measurement technique, a study was considered to have a positive or negative finding if just one association was found to be positive or negative; this biased our results to report more positive and negative findings than null results. Although a limitation for conducting a meta-analysis, this did not limit our ability to analyze the varying metrics studies implemented.

Conclusion

Our findings provide additional, updated evidence supporting the association between the CNE and obesity and our review has identified methodological issues for improving research. Although none of the studies used an experimental design, most of the studies we reviewed controlled for myriad factors influencing the relationship between the CNE and obesity. Still, unmeasured components of the CNE and other parts of the food environment such as the “consumer nutrition environment”—which looks at price, promotions, and nutritional quality—are important considerations in determining access to food. These have been studied elsewhere [4548]. Most studies we examined found an association between the community nutrition environment and obesity despite the heterogeneous metrics, designs, and scopes of the studies. While the volume of research on the effects of the CNE on obesity has expanded, a concerted effort to standardize metrics and methods would increase comparability and certainty in describing this relationship. This would also serve the needs of practitioners in communities, who are increasingly using tools such as the Food Environment Atlas to create local estimates of food access and understand how it may change with certain policies.