Introduction

Ambient air pollution (AAP), also known as outdoor air pollution, is ubiquitous (Polichetti et al. 2013) and has become a global public health problem (Brunekreef and Holgate 2002; Kampa and Castanas 2008). It has been well documented that exposure to AAP was associated with a number of adverse health outcomes, such as respiratory and cardiovascular diseases (Brook et al. 2004; Dominici et al. 2006; Polichetti et al. 2009), and even mortality (Hoek et al. 2013). Because developing fetuses are more vulnerable than children and adults, many studies have investigated possible adverse influences of AAP on birth outcomes since the mid-1990s (Li et al. 2017; Ritz and Wilhelm 2008; Shah et al. 2011; Srám et al. 2005), including low birth weight (LBW, newborn weighted less than 2500 g or 5.5 lb at birth), small for gestational age (SGA), preterm birth, and birth defects.

Because LBW infants may experience higher risk of mortality and/or morbidity in childhood (McCormick 1985; McIntire et al. 1999; Watkins et al. 2016), and are more susceptible for other diseases than normal-weight infants, including stroke (Lawlor et al. 2005), coronary heart disease, hypertension, type II diabetes (Whincup et al. 2008), delayed motor/social development (Hediger et al. 2002), and learning disabilities (Litt et al. 2005), LBW is considered as an important predictor of an infant’s health. Statistics suggested that 2.6% of live term singleton births were LBW births in the United States (U.S.) during 2000–2015 (U.S. CDC 2018).

Pollutants in the air can be divided into two categories: (1) six criteria air pollutants (CAPs) designated by the Clean Air Act and subsequent amendments (including particulate matter (PM2.5, PM10), ozone (O3), nitrogen dioxide (NO2), sulfur dioxide (SO2), carbon monoxide (CO), and lead (Pb)) (U.S. EPA 2017) and CAPs-related air pollutants, and (2) other air pollutants (non-criteria air pollutants). Based on publication records related to birth weight from the Web of Science database covering years 1904–2017 (Table 1), most studies reported in the literature focused on CAPs rather than non-criteria air pollutants (Wilhelm et al. 2012).

Table 1 Ambient air pollutants in publications related to birth weight as of December, 2017

The associations between maternal exposure to the six CAPs and LBW have been investigated in a large number of studies (430 studies in total as of the end of 2017). For example, Salam et al. (2005) found that O3 exposure during the second and third trimesters and CO exposure during the first trimester were associated with increased LBW levels; Xu et al. (2011) discovered that exposure to PM10 during pregnancy might increase the risk of LBW. Ebisu and Bell (2012) identified associations between PM2.5 components and LBW in the northeastern and mid-Atlantic regions of the U.S. Dedele et al. (2017) showed that increased maternal exposure to NO2 tended to increase LBW risk in offspring. Around 12% of these studies have also examined components or combination of the CAPs (CAPs-related air pollutants), including total suspended particle (TSP) (Bobak 2000; Lee et al. 2002; Wang et al. 1997), black carbon (Brauer and Lencar 2008; Paciorek 2010), black smoke (Pearce et al. 2012; Stankovic et al. 2011), among others.

However, the examination of association between maternal exposure to non-criteria air pollutants and LBW in offspring has been reported in a limited number of studies (only 40 studies as of the end of 2017). Among these studies, 9 of them investigated benzene, toluene, ethyl benzene, and xylene (BTEX) and 11 of them studied polycyclic aromatic hydrocarbons (PAHs). For example, airborne benzene exposure was found to be associated with increasing odds of a LBW event (Slama et al. 2009; Zahran et al. 2012). Jedrychowski et al. (2017) found that exposure to PAH was inversely associated with birth weight. Aguilera et al. (2009) linked an increase in BTEX exposure levels to reductions in birth weight for women who spent < 2 h/day in nonresidential outdoor environments. However, other non-criteria pollutants were hardly mentioned.

One possible explanation for the unbalanced number of studies on the two categories of air pollutants was the lack of high-quality air monitoring data covering non-criteria air pollutants. The extensive monitoring network of CAPs with finer spatial-temporal sampling resolution has significantly facilitated the estimation of exposure to CAPs (Gong et al. 2016). Although 18 non-criteria air pollutants have been studied (Table 1), these pollutants only accounted for a limited portion of the total non-criteria pollutants released into the air. Based on information from the databases maintained by the toxic release inventory (TRI) program of the U.S. Environmental Protection Agency (U.S. EPA), there have been over 650 non-criteria pollutants from industrial facilities in the U.S. and 449 in Texas (U.S. EPA 2013). It is hypothesized that some of these pollutants (TRI chemicals) could possibly exert a negative impact on birth weight. Therefore, this study attempts to identify associations between maternal residential exposure to TRI chemicals during pregnancy and LBW in offspring using a case-control study design based on the analysis of massive georeferenced data over a 13-year period in Texas from 1996 to 2008.

Study area, data, and methods

Study area

This study chose the state of Texas in the U.S. as the study area (Fig. 1). Among the 48 states in the contiguous U.S., Texas is the largest by area and second largest by population. Two datasets in Texas were used, including air emission data from industrial facilities and birth data.

Fig. 1
figure 1

Geographic distribution of toxic release inventory (TRI) facilities in Texas that reported emissions during 1996–2008

Air emission data of industrial facilities

This study obtained air emission data of Texas industrial facilities from the U.S. EPA TRI program. The TRI program, a mandatory program established by Section 313 of the Emergency Planning and Community Right-to-Know Act (EPCRA), requires industrial facilities in the U.S. to report annual information about their names, locations, reporting years, types of chemicals released, and estimated quantity of a chemical released into the environment (U.S. EPA 2013). Based on reports during 1996–2008, 1286 to 1635 Texas industrial facilities reported air emissions to TRI program each year. Industrial facility addresses were geocoded and maintained at Texas State University (Zhan et al. 2015). During 1996–2008, 89.66% of the TRI facilities were successfully geocoded on average (Fig. 1). A TRI air emission geodatabase was constructed for the purpose of this study, containing both geographic locations of TRI facilities (Fig. 1) and non-spatial data summarizing the annual air emission amount of the 449 TRI chemicals released from industrial facilities in Texas during 1996–2008.

Birth data

This study obtained birth certificate data for all registered births during 1996–2008 in Texas from the Center for Health Statistics in the Texas Department of State Health Services (DSHS). Each birth certificate record contained information of maternal residential address at delivery; birth weight; birth year; plurality; child’s sex; gestational age in weeks; mother’s characteristics (age, race/ethnicity, education, marital status, and tobacco use during pregnancy); and father’s characteristics (age, race/ethnicity, and education). This study excluded births of weight > 5500 g or < 1000 g (0.1%), births with gestational age > 44 weeks or < 37 weeks (17.8%), births with incomplete location information (10.9%), plural deliveries (2.7%), or births given by non-Texas residents or occurred outside of Texas (0.2%). Only births with weight < 2500 g were considered as the LBW cases and used in this study. This study matched LBW cases and controls by year of birth (1996–2008) and selected four controls for each case to ensure sufficient study power. Control births from the same birth year were randomly selected from the birth certificate data. The maternal addresses of all cases and controls were geocoded to their exact geographic locations (latitudes and longitudes) by the Texas DSHS.

Air pollution exposure assessment

A modified version of the emission-weighted proximity model (EWPM) (Gong et al. 2016; Zou et al. 2009b) was used to estimate exposure intensities to a given TRI chemical at the location of a maternal residence for each case and control. This modified version of the EWPM model takes into account both the distances between the location of a maternal residence and industrial facilities within an effective distance as well as the rate and duration of a specific chemical released into the air by each facility annually. The formula for the EWPM model is given by Expression (1) below.

$$ {A^{\theta}}_i=\sum \limits_{j=1}^m{E^{\theta}}_{ij}\times {T^{\theta}}_{ij}\times \left(\left({k}^{\theta }-{D}_{ij}\right)/{k}^{\theta}\right),\kern1em \mathrm{for}\kern1em \left({D}_{ij}\le {k}^{\theta}\right) $$
(1)

where Aθi is the estimated exposure intensity to chemical θ of a person at residence location i, represented by the quantity of chemical θ reaching location i from all emission sources (j, j = 1, 2, … …, m) within an effective distance (kθ); Eθij and Tθij are the emission rate and duration of emission of chemical θ from emission source j that is within the effective distance (kθ) of location i; Dij is the distance between location i and emission source j; kθ is the effective distance beyond which chemical θ is considered to have no harm on an individual (Gong et al. 2016; Zou et al. 2009b). This study used an effective distance (kθ) of 10 km.

Identification of the most likely potential risk factors

This study used odds ratios (ORs) from a logistic regression to assess the associations between exposure to different TRI chemicals and LBW in offspring first, and then identified chemicals with the highest positive statistically significant ORs as the most likely risk factors associated with LBW. For each TRI chemical, cases and controls were divided into two groups: exposed (cases and controls with estimated exposure intensity value to a given chemical greater than zero) or unexposed (cases and controls with estimated exposure intensity value to a given chemical equal to zero). The binary logistic regression analysis used the unexposed group as the reference group. The logistic regression analysis was adjusted for several covariates, including child’s sex, gestational weeks (37–44 weeks), maternal age (11–19, 20–24, 25–29, 30–34, 35–39, > 39 years), education (< high school, high school, > high school), race/ethnicity (non-Hispanic white, non-Hispanic black, Hispanic, other non-Hispanic), the public health service region of maternal residence (11 regions in Texas as 11 categories), and year of birth (1996–2008 as 13 categories). We selected these covariates for the analyses based on recommendations in the literature (Bell et al. 2007; Brender et al. 2014; Valero De Bernabé et al. 2004). In the next phase of the analysis, this study used values of adjusted odds ratios (aORs) to rank the TRI chemicals. Ten chemicals with the highest positive statistically significant aORs from the 449 TRI chemicals were selected for additional epidemiological analysis.

Additional epidemiological analysis

For each of the ten identified TRI chemicals, this study categorized the estimated exposure intensity values for all cases and controls into four levels: one unexposed group and three exposed groups. The cases and controls in the unexposed group were those with zero estimated exposure intensity for a given chemical. The ones with estimated exposure intensity value greater than zero were divided into three intervals in such way that each interval contained approximately the same number of controls. In epidemiology, it is common to categorize exposure levels based on the distribution of exposure levels in controls because controls are thought to most closely represent the underlying population. The four levels of exposure are called zero, low, medium, and high exposure in the rest of this article. In the next step, binary logistic regressions were used to estimate the associations (aOR and 95% confidence interval (CI)) between maternal residential exposure to these ten chemicals and LBW in offspring. The unexposed groups were considered as the reference groups in the analyses. The aORs were adjusted for the same covariates listed in the previous section. In addition, a multiple comparison correction was conducted to evaluate the results from the regression analysis.

Results

There were a total of 94,106 term LBW cases and 376,424 controls that were frequency-matched to cases by year of birth in this study. Table 2 compares the LBW cases and controls by child’s sex, mother’s age at delivery, mother’s race/ethnicity, gestational length, year of birth, mother’s education, and public health service region of maternal residence at the time of delivery. In this study population, LBW cases were more likely to be female and have shorter gestational length than controls. The mothers of the LBW cases were also more likely to be non-Hispanic black, younger at age of delivery, or less educated when compared to the control-mothers.

Table 2 Descriptive statistics of low birth weight cases and frequency-matched controls, Texas, 1996–2008

Estimated exposure intensities to the 449 TRI chemicals were calculated at the maternal residence location of each LBW case and control. To protect confidentiality, we use maps showing the estimated exposure densities at the locations of the centroids of census tracts, instead of maternal residential locations, to illustrate the geographic distribution of exposure intensity to a specific chemical. We used a four-step procedure to produce these maps (Fig. 2). First, centroids of the 4388 census tracts in Texas in 2000 were considered as hypothetical point receptors of air pollution exposure. Second, we used EWPM to estimate exposure intensities to a given chemical at the locations of these 4388 centroids. Third, the estimated exposure intensity at the centroid of a specific census tract was used to represent the exposure intensity of the whole census tract, assuming that the exposure distribution in the census tract is homogenous. Fourth, exposure intensities in the 4388 census tracts were categorized into seven levels (exposure intensity at zero and greater than zero divided into six equal-interval groups).

Fig. 2
figure 2

Distribution of estimated 1,1,1-trichloroethane exposure intensities, Texas, 1996–2008

Figure 2 shows an example distribution of estimated exposure intensities to chemical 1,1,1-trichloroethane (chemical abstracts service (CAS) number 71556) among census tracts in Texas during 1996–2008. The highest exposure intensity level was observed in northeastern Texas (Dallas-Fort Worth area) in 1996. However, the exposure intensity level of this area decreased every year after 1996 and reached zero in year 2001. The southeastern Texas (Houston area) had a mid-level exposure intensity consistently during the 13 years. In 1996 and 1998, two areas in central Texas also showed mid-level exposure intensities.

Table 3 shows 78 TRI chemicals with statistically significant aORs greater than 1 when exposure intensities were dichotomized into exposed and unexposed groups. Compared with the unexposed reference groups, the LBW risks in exposed groups of the 78 chemicals increased by 2 to 60% (aOR 1.02–1.60). Then, 10 chemicals with the largest positive aORs were chosen from the 78 chemicals as the potential risk factors for LBW in additional epidemiology analysis. The ten chemicals were acetamide (aOR 1.60, 95% CI 1.09, 2.34), p-phenylenediamine (aOR 1.32, 95% CI 1.07,1.63), 2,2-dichloro-1,1,1-trifluoroethane (aOR 1.21, 95% CI 1.10, 1.34), 1,2-phenylenediamine (aOR 1.20, 95% CI 1.02, 1.41), resmethrin (aOR 1.14, 95% CI 1.01, 1.30), toluene-2,6-diisocyanate (aOR 1.14, 95% CI 1.02, 1.28), tributyltin methacrylate (aOR 1.14, 95% CI 1.05, 1.23), propetamphos (aOR 1.11, 95% CI 1.01, 1.23), 1,1,1-trichloroethane (aOR 1.10, 95% CI 1.05, 1.15), and creosote (aOR 1.09, 95% CI 1.02, 1.16).

Table 3 Maternal exposure to selected chemicals and low birth weight in offspring, Texas, 1996–2008

Table 4 shows the results from the additional analyses of the above ten identified chemicals, where estimated exposure intensities were divided into four levels (zero, low, medium, and high exposure). Without multiple comparison correction, seven of the ten chemicals had statistically significant aORs larger than 1.0 in at least one of the three exposure levels. These seven chemicals are acetamide, p-phenylenediamine, 2,2-dichloro-1,1,1-trifluoroethane, resmethrin, toluene-2,6-diisocyanate, tributyltin methacrylate, and 1,1,1-trichloroethane. Five chemicals survived a multiple comparisons correction using the Benjamini-Hochberg procedure with false discovery rate (FDR) at the level of 0.05. These five chemicals are acetamide, p-phenylenediamine, 2,2-dichloro-1,1,1-trifluoroethane, tributyltin methacrylate, and 1,1,1-trichloroethane.

Table 4 Maternal exposure to different levels (low, medium, and high) of intensities to selected chemicals (with top ten aORs in Table 3) and low birth weight in offspring, Texas, 1996–2008

The largest aOR value was noted in the case of low exposure to acetamide (aOR 2.29, 95% CI 1.24, 4.20), indicating that mothers with low exposure to acetamide had a 129% higher chance of delivering LBW babies when compared with mothers who had no exposure to air emissions of acetamide from industrial facilities. Chemical 1,1,1-trichloroethane showed statistically significant aORs among mothers within two exposure levels (low and medium). Monotonically increasing trends were noted between maternal residential exposure to tributyltin methacrylate and LBW in offspring (highest aOR in high exposure: 1.20, 95% CI 1.06, 1.36). Statistically significant association was also observed in medium exposure to p-phenylenediamine and 2,2-dichloro-1,1,1-trifluoroethane (Table 4).

Discussions

Adverse health outcomes of exposure to the ten chemicals in Table 4 (acetamide, p-phenylenediamine, 2,2-dichloro-1,1,1-trifluoroethane, 1,2-phenylenediamine, resmethrin, toluene 2,6-diisocyanate, tributyltin methacrylate, propetamphos, 1,1,1-trichloroethane, and creosote) have been documented in various studies reported in the literature. Inhalation of tributyltin methacrylate and propetamphos might cause serve injury or death (U.S. NOAA 2016). Short-term exposure to the other eight chemicals might cause irritation to the eyes, skin, mucous membranes, and respiratory tract, while long-term exposure to these chemicals was reported to yield other adverse health effects, including skin sensitization, asthma, narcosis, cardiac disorders, anemia, dermatitis, and hyperpigmentation of skin (U.S. CDC 2015). Long-term exposure to p-phenylenediamine and 2,2-dichloro-1,1,1-trifluoroethane might also have effects on kidney and liver respectively (U.S. CDC 2015). Chemical 1,1,1-trichloroethane, also known as methyl chloroform, was reported to be associated with neural tube defects in a study conducted in Texas investigating the associations between chlorinated solvent exposures and birth defect in offspring (Brender et al. 2014). However, no study has examined the associations between maternal residential exposure to these chemicals and LBW in offspring. The identified association between LBW and the five TRI chemicals in this study created new opportunities for further epidemiological, biological, and toxicological research. It is important to note that some other significant pollutants from this exploratory analysis might also be important risk factors for LBW, despite that they did not survive multiple testing correction reported in this paper.

As shown in Table 4, with the increase of maternal residential exposure intensities to acetamide, p-phenylenediamine, 2,2-dichloro-1,1,1-trifluoroethane, and 1,1,1-trichloroethane, the risk of having LBW newborns did not necessarily increase monotonically in that the highest aORs were not noted in mothers who were categorized in the level of high exposure. Brender et al. (2014) also found similar trends in their studies examining associations between maternal chlorinated solvent exposures and birth defect in Texas. The non-monotonic trends may be due to the fact that data used in this study contained live births only. One possible explanation is that higher maternal residential exposure intensities to certain chemicals could have severely affected weight gain in the fetus during some pregnancies and resulted in induced terminations or fetal deaths. However, data about these pregnancies were not available in the present study. Therefore, the groups categorized as high exposure in this study might have odds ratios biased toward null.

The emission-weighted proximity model (EWPM) was used in this study to estimate maternal residential exposure to the TRI chemicals during pregnancy. Compared to other exposure assessment methods, the EWPM is simpler to implement and more cost-effective when many air pollutants are considered (Gong et al. 2016). Different types and magnitudes of exposure misclassifications are expected in different exposure assessment models, the EWPM might also introduce some degree of exposure misclassifications (Zou et al. 2009a). To assess the performance of EWPM in estimating exposure intensities, a study was conducted to examine how the EWPM estimated exposure intensities correlated with air monitoring data of 27 non-criteria air pollutants at 48 monitoring sites in Texas in 2005 (Gong et al. 2016). The study concluded that the EWPM is a valid and useful approach for exposure assessment when analyses involve data covering a large geographic area over multiple years (Gong et al. 2016). This study used a consistent effective distance (kθ = 10 km) for all chemicals, meaning exposure to a given chemical was only considered to be affected by sources within 10 km from the location of interest. However, the best effective distance (kθ) for each chemical may be related to chemical properties, meteorological conditions, and terrain surrounding the location of interest. Therefore, future studies should consider finding the best effective distance for a specific chemical before using the EWPM model in order to improve the accuracy of the estimated exposure.

Air emission data used in this study had several limitations. First, because only annual-level TRI air emission data were available, this study could only estimate annual exposure to the TRI chemicals. It was not possible to estimated exposure variations at finer temporal scales to reflect daily, monthly, and seasonal variations. Second, industrial facilities self-reported their annual air emission quantities to the TRI program, which may lead to some uncertainties in the data. Third, only industrial air emissions from point sources (stacks of TRI industrial facilities) were used for exposure assessment in this study, future studies should consider integrating more emission source types (linear, areal, and mobile sources) for more accurate measures. Fourth, based on the observation that a small portion of the study population resided in areas near the Texas state boundaries to other states or Mexico (8.92% (n = 41,957) within 10 km of the border, and 6.67% (n = 31,392) within 10–50 km of the border), we performed analyses to examine how the removal of the cases and controls within 10 km of the Texas state boundaries may affect the results. Among the five significant chemicals reported in Table 4, only acetamide did not survive the multiple comparison correction after those cases and controls were removed. The results are summarized in a table in the supplementary materials (Table S-1). The results suggest that the “edge effect” of geographic boundaries does affect the results to some degree, and it should be considered when interpreting the results of the type of analysis similar to the ones reported in this article.

Maternal residential addresses at delivery were used in this study to estimate exposure during pregnancy, assuming that each maternal residence was the same from conception to delivery. Therefore, exposure misclassifications might exist if there were maternal residential movements during pregnancy in the study population (Canfield et al. 2006; Lupo et al. 2010). To overcome this limitation, data with maternal residential history are needed to more accurately estimate and categorize exposure. For a large-scale study based on birth registry data such as the one reported in this paper, data with detailed residential history were simply not available. It is worth noting that the impact might be limited because the change of residences during pregnancy tends to involve only short distances in most cases (Lupo et al. 2010). In addition, there has been a number of studies suggesting an association between maternal food restriction (MFR) with reduced birth weight (Godfrey et al. 1996). It would be ideal to include data about MFR into the analyses of this study, but again data about MFR were simply not available for this large-scale study. We used maternal education level as a covariate in the analysis. Because education level is a good indicator for socioeconomic status, we hope this covariate will partly take care of this limitation.

Conclusion

This large population-based, case-control study examined whether maternal residential proximity to some of the 449 TRI air pollutants could be potential risk factors associated with LBW in offspring. This number of 449 chemicals far exceeded the number of non-criteria air pollutants examined by studies reported in the literature. Maternal residential exposure to the TRI chemicals during pregnancy was estimated using the EWPM model. The model takes into account reported quantities of annual air emission from industrial facilities and the distances between the locations of industrial facilities and maternal residence locations. This study concluded that maternal residential proximity to industrial air emissions of some TRI chemicals during pregnancy may be associated with LBW in offspring. These chemicals included acetamide, p-phenylenediamine, 2,2-dichloro-1,1,1-trifluoroethane, tributyltin methacrylate, and 1,1,1-trichloroethane. For most of these five pollutants, the exposure-response function does not seem to be monotonically increasing when the estimated exposure is categorized into low, medium, and high levels.