Introduction

Environmental metals exist in air, water, and soil. The main routes of environmental metal exposure occur from ingestion of contaminated food and water, inhalation of contaminated air, and skin absorption. Chronic low dose exposure to environmental metals can cause disease and is a public health hazard (Rahman et al. 2020a; Wu et al. 2018). Many studies have associated heavy metals with high blood pressure (Hawkesworth et al. 2013; Telisman et al. 2001). Some studies have shown significant associations between cardiovascular disease (CVD) and exposure to arsenic, cadmium, mercury, and lead, while other studies found no significant association between these toxic metals and CVD risk (Yang et al. 2020). Prevalence of high blood pressure increases with age including 7.5% among adults aged 18–39 and 63.1% among those aged 60 and over (Fryar et al. 2017). Metals including arsenic, tin, cadmium, lead, and mercury have been associated with high blood pressure (Miao et al. 2020; Wang et al. 2018; Shiue et al., 2014b; Shiue and Hristova 2014; Abhyankar et al. 2012; Houston 2007; Hallenbeck 1984) although some studies show mixed results (Shiue 2014a; Jones et al. 2011). It has been reported that certain metals, including selenium, vanadium, and manganese are needed for maintaining blood pressure. Deficiencies can decrease oxidative defenses leading to increased adverse effects from toxicants. Excess levels can lead to negative effects on the cardiovascular system (Wu et al. 2018).

Several studies have analyzed metals through different exposures and their relationship to high blood pressure (Abhyankar et al. 2012; Kobal et al. 2004). Arsenic exposure can be measured by different methods. In analyzing arsenic exposure from drinking water, a systematic review determined an increasing trend in odds of high blood pressure with increasing arsenic exposure (Abhyankar et al. 2012). Indoor air pollution from coal combustion and indoor smoking can include trace and major metal elements, including tin (Wang et al. 2018). Cesium levels in the environment are low; however, cesium chloride is a homeopathic cancer therapy causing some individuals to have significant exposures (McGinnis et al. 2016). Lead exposure has been linked to cardiac and vascular damage leading to an increased risk of CVD (Miao et al. 2020). In occupational studies, mercury miners with chronic exposure were seen to have a weak correlation between systolic blood pressure and mercury exposure (Kobal et al. 2004). Furthermore, high blood pressure is a complex medical condition leading to a global mortality rate of 13%. It is estimated to contribute to 25% of myocardial infarctions and has been associated with lifestyle habits, including tobacco smoking, lack of physical activity, and alcohol consumption (Cuschieri et al. 2017). The cost of high blood pressure is high for both individuals and society (Shiue 2014b). The mean annual medical expenditure for patients with high blood pressure is $9089. Compared to individuals without high blood pressure, patients with high blood pressure had 2.5 times the inpatient costs, almost double the outpatient costs, and triple the prescription costs. The cost of patients with high blood pressure in the USA results in an annual cost of approximately $131 billion per year more than that of the non-hypertensive population (Kirkland et al. 2018).

The purpose of this study is to evaluate the association between increased blood pressure and urinary toxic metals in the general US population using the 2015–2016 NHANES dataset, which is the most recent NHANES dataset published by the CDC that includes urinary speciated arsenic. The current study aims to assess the link with environmental metal exposure (arsenous acid, arsenobetaine, dimethylarsinic acid, monomethylarsonic acid, total arsenic, mercury, barium, cadmium, cobalt, cesium, molybdenum, manganese, lead, antinomy, tin, strontium, thallium, tungsten, and uranium) assessed with urine samples and the risk of high blood pressure in the human population.

Methods

Data sources

The data for this project was taken from the 2015–2016 NHANES dataset, a long-standing study conducted by the National Center for Health Statistics (NCHS) which is a part of the Centers for Disease Control and Prevention (CDC). It combines interviews and physical examinations of children and adults throughout the USA to determine their health and nutritional status (CDC 2017a).

The NHANES data files used for the response factors were BPX_I—Blood pressure measurements and BPQ_I—Blood pressure questionnaire (CDC 2017c; CDC 2017b). The NHANES data files used for the urinary arsenic and urinary metals data which were used as the primary variates were UAS_I—Speciated urinary arsenic, UTAS_I—Total Urinary arsenic, UMS_I—Urinary metals, and UHG_I—Urinary mercury (CDC 2018a; CDC 2018b; CDC 2018c; CDC 2018d). The creatinine concentration used to normalize the urinary arsenic and metal concentrations came from the NHANES data file ALB_CR_I (CDC 2019a). The data for the covariates came from the NHANES data files ALQ_I—Alcohol consumption, BMX_I—Body mass index, and DEMO_I—Demographic data, COT_I—Serum cotinine (CDC 2018e; CDC 2017d; CDC 2017e; CDC 2019b).

NHANES was approved by the Research Ethics Review Board of the NCHS. As this is a public-use dataset, this study was exempt from additional review by an institutional review board.

Data cleaning

The data cleaning consisted of four primary steps: (1) removing all missing responses, (2) categorizing the continuous covariate variables, (3) normalizing the concentration of the urinary chemical species by the creatinine concentration, and (4) creating the binary categorical variable for high blood pressure. Missing responses in the demographic data, BMI, serum continine, and alcohol consumption were removed, and the reduced dataset was used for all subsequent modeling. If any one of these variables was missing, the entire response was eliminated. Missing responses for urinary arsenic and urinary metals were removed from the complete dataset just prior to creation of the logit regression model.

The continuous variables age, family income to poverty ratio (FIPR), and body mass index (BMI) were converted into categorical factors. Age was converted into a three-level categorical variable with the levels 20 ≤ age1 ≤ 40, 41 ≤ age2 ≤ 65, and 65 < age3 as in prior studies (Rahman et al. 2020b). FIPR was converted into a three-level categorical variable: 0 ≤ FIPR1 ≤ 130%, 130 %  ≤ FIPR2 ≤ 350%, and 350 %  < FIPR3. The BMI was converted into a four-level categorical variable with underweight :  BMI < 18.5, normal weight :  18.5 ≤ BMI < 24.9, overweight :  24.9 ≤ BMI < 30.0, and obese :  30.0 ≤ BMI per the CDC definitions (CDC 2020).

Natural variation in urine dilution was addressed by taking the urinary arsenic and metals species concentrations and dividing them by the creatinine concentration (Jones et al. 2011). No attempt was made to make the concentration units consistent; the values provided in the NHANES data were used directly. Once the concentration had been normalized, they were then log10 transformed to create a more nearly normal distribution of the concentrations.

The categorical binary variable for high blood pressure was set so that the respondent was deemed to have high blood pressure if any of the following criteria were true: the average of the 2nd, 3rd, and (if present) 4th systolic blood pressure readings was greater than or equal to 130 mm Hg, or the average of the 2nd, 3rd, and (if present) 4th diastolic blood pressure readings was greater than or equal to 80 mm Hg, or they were taking prescribed medicine for high blood pressure (Jones et al. 2011).

Statistical analysis

The statistical analysis was done in R version 3.6.3 using programs from the survey package to account for the complex design used in the construction of the NHANES survey (R Core Team 2020; Lumley 2004; Lumley 2010; Lumley 2020). Specifically, from the survey package, the functions svyby, svymean, svyttest, svydesign, and svyglm were used to calculate the unweighted occurrences of responses, the weighted mean of the responses, the weighted pairwise t tests of the responses, the survey design object, and the logit regression models, respectively. Additional R functions and packages used to simplify the programing included: the function nhanes_load_data in the package RNHANES was used to download the data files from the NHANES website and store them locally as comma separated values (*.csv) files (Susmann 2016). The functions optimalCutoff, misClassError, sensitivity, and specificity in the package InformationValue were used to calculate the quality metrics of the logit regression models, while the function plotROC from the same function was used to calculate the ROC curves used for visual model assessment (Prabhakaran 2016).

Logit regression modeling

The discussion below gives an overview of the approach to modeling the correlation between the concentration of selected urinary compounds and high blood pressure. The basic approach is to construct a logit regression model using the assumed covariate factors to predict high blood pressure and then add to that basic model the urinary compounds one at a time to determine whether or not there is a correlation between the urinary compound and high blood pressure. Each of these is discussed in more detail below.

Covariate-factor modeling

Since the demographic dataset is much larger than the arsenic and metal datasets, the initial covariate-factor models were developed using just the demographic dataset to gain insight into how these factors correlated with high blood pressure. Two models were considered. The first was constructed using just the main effects of the covariate factors; the second included the main effects and all the two-factor interactions except those associated with family income to poverty ratio, serum cotinine level, and alcohol consumption.

The metrics that were used to assess the goodness-of-fit of the models were fraction mismatched—the total number of predictions that were incorrect divided by the total number of predictions. Sensitivity, also called the true positive rate—the proportion of actual positives which were correctly identified as such and were complementary to the false negative rate. Sensitivity = true positives / (true positive + false negative) (Vadakkanmarveettil 2015). Specificity, also called the true negative rate—the proportion of negatives which were correctly identified as such. Specificity was complementary to the false positive rate. Specificity = true negatives / (true negative + false positives) (Prabhakaran 2016). Area under the ROC curve—this was an aggregated metric that evaluated how well the logistic regression model classified positive and negative outcomes at all possible cutoffs. It ranges from 0.5 to 1, and the larger it is the better (“Interpreting Logistic ROC Curves,” n.d.). The results of the logit regression for the main-effects model are given in Table 1.

Table 1 Logit regression model coefficients, the corresponding odds ratios, and lower and upper confidence levels (LCL and UCL, respectively) for a model that includes only main effects

Results

Preliminary data review: covariate factors

The responses are presented for the complete demographic dataset as well as for urinary cotinine concentration and alcohol consumption in Table 2. For each covariate, the total number of respondents of each group of each categorical variable is given along with the corresponding percentage and the percentage within the categorical variable that had high blood pressure (% HBP).

Table 2 Summary statistics for the covariates used in the study of high blood pressure

In summary of these results: the difference in gender is statistically significant with a higher percentage of men having high blood pressure than women. Non-Hispanic black have the highest percentage of individuals with high blood pressure (58.3%), while other Hispanic have the lowest (42.5%). However, the results for Mexican American, Other Hispanic, Non-Hispanic Asian, and Other were not statistically different at the α=0.05 level. On the other hand, the frequency of high blood pressure in Non-Hispanic whites (50.3%) was statistically different from the other groups except for Non-Hispanic Asian. For highest educational level achieved, generally the higher the level of education, the smaller the percentage of respondents with high blood pressure. There are statistically significant differences between the groups within the marital status category with those never married or living with a partner having the lowest percentage of individuals with high blood pressure and those separated having the highest. There are statistically significant differences in all three age categories with the youngest having the lowest percentage of individuals with high blood pressure and those above age 65 having the highest percentage. There is no statistically significant difference in the family income to poverty ratio groups. There is a statistically significant difference in the BMI groups with the higher the BMI, the higher the percentage of individuals with high blood pressure. There is no statistically significant difference in the serum cotinine concentration groups or in the alcohol consumption groups.

In the columns of the percentage of individuals with high blood pressure, those within the same category with the same letter following the numerical value are not statistically different at the α=0.05 level. For the categories with only two levels, this was determined using a χ2 test on the appropriate contingency table. For categories with more than two levels, this was determined using a Tukey analysis based on a linear model using only the variables within that category as the independent factors.

The results for the two-factor interaction model are not presented here due to the large number of terms in the model, 270 in total. However, the model did indicate several significant 2-factor interaction terms along with decreasing the number of significant main effects.

The two-factor model does a better prediction of correctly predicting positive results (i.e., having high blood pressure) than does the main-effects only model. The two-factor interaction and the main-effects models have similar abilities to predict negative results. The two-factor interaction model misclassifies 26% of the responses, while the main-factor model misclassifies 28% of the responses. As expected, there is a tradeoff in sensitivity and specificity. As the sensitivity decreases, the specificity increases. The values of the sensitivity and specificity are functions not only of the logit regression model but also the cutoff value used to split the continuous response into the binary prediction response. In this work, the cutoff values were determined using the optimalCutoff function in the R package InformationValue (Prabhakaran 2016). The area under the ROC curve (not presented) is higher for the two-factor interaction model than it is for the main-effects model.

Complete modeling summary

Using a covariate model with just main effects, the model coefficients and odds ratios as well as their 95% confidence intervals for the arsenic and metal compounds are given in Table 3. These results show that one of the arsenic compounds and two of the metals are statistically correlated with high blood pressure at the a=0.05 level. Model coefficients and odds ratios are not given for any models for arsenic acid and arsenocholine because the NHANES database did not have sufficient data to create a logit regression model because of the large number of terms in the model.

Table 3 Summary of logit model regressions with only main effects for the covariates for the correlation of arsenic and metal compounds, measured in μg/L, on high blood pressure

The model coefficients and odds ratios as well as their 95% confidence intervals using two-factor interaction modes for the arsenic and metal compounds are given in Table 4. These results show that none of the arsenic compounds and two of the metals are statistically correlated with high blood pressure at the a=0.05 level. The only two species significantly correlated with high blood pressure are cesium and tin.

Table 4 Summary of logit model regressions including selected two-factor interaction terms for the covariates for the correlation of arsenic and metal compounds, measured in μg/L, on high blood pressure

A second two-factor interaction model was also created to study the effect of the choice of interaction terms on the results. The second two-factor interaction model included the main effects and interaction terms associated with age to see if this would more accurately model how high blood pressure develops as individuals age. Specifically, the interaction terms included age × gender, age × race/ethnicity, age × education, age × marital status, and age × BMI. The results for this covariate model are given in Table 5. Using this model, only two metals are significantly correlated with high blood pressure: cesium and tin, the same two that were significant in the prior two-factor model.

Table 5 Summary of logit model regressions including age-related two-factor interaction terms for the covariates for the correlation of arsenic and metal compounds, measured in μg/L, on high blood pressure

A fourth model was created that used the main effects and gender related two-factor interactions, specifically: gender × age, gender × race/ethnicity, gender × education, gender × marital status, and gender × BMI. The results for this covariate model are given in Table 6. Using this model, only two metals are significantly correlated with high blood pressure: cesium and tin, the same two that were significant in the prior two-factor models.

Table 6 Summary of logit model regressions including gender-related two-factor interaction terms for the covariates for the correlation of arsenic and metal compounds, measured in μg/L, on high blood pressure

Finally, a fifth model was created that used the main effects and body mass index-related two-factor interactions, specifically: BMI × gender, BMI × age, BMI × education, and BMI × marital status. The results for this covariate model are given in Table 7. Using this model, arsenous acid, cesium, and tin are significantly correlated with high blood pressure.

Table 7 Summary of logit model regressions including BMI-related two-factor interaction terms for the covariates for the correlation of arsenic and metal compounds, measured in μg/L, on high blood pressure

Discussion

Our study analyzed the relationship between nineteen urinary metals (arsenous acid, arsenobetaine, dimethylarsinic acid, monomethylarsonic acid, total arsenic, mercury, barium, cadmium, cobalt, cesium, molybdenum, manganese, lead, antinomy, tin, strontium, thallium, tungsten, and uranium) and their effect on high blood pressure in US adults using the 2015–2016 NHANES dataset. Among five different models, it was observed that urinary arsenous acid, cesium, and tin all had statistically significant associations with high blood pressure, defined as ≥130 mm Hg systolic, ≥80 mm Hg diastolic, or patients on antihypertensive medications. High blood pressure affects approximately one-third of American adults, and nearly 1 billion people worldwide (Shiue 2014b; Wu et al. 2018). The American Heart Association (AHA) categorizes high blood pressure into several groups. Elevated systolic blood pressure ranges from 120 to 129 mm Hg, stage 1 hypertension ranges 130–139 mm Hg, and stage 2 hypertension includes ≥140 mm Hg (AHA 2019). Based on the 2017 American College of Cardiology (ACC) guidelines, we defined high blood pressure as stage 1 or greater, defined as a systolic pressure of ≥130 mm Hg and/or a diastolic pressure of ≥80 mm Hg, or patients on antihypertensive medications (Flack and Adekola 2020; Whelton et al. 2018).

Studies have shown mixed results regarding the association between arsenic and high blood pressure. Jones et al. (2011) found no association between total arsenic minus arsenobetaine, DMA, or arsenobetaine and systolic or diastolic high blood pressure using 2003–2008 NHANES data. High blood pressure was measured as ≥140 mm Hg systolic, ≥90 mm Hg diastolic, a physician diagnosis, or use of antihypertensive medication (Jones et al. 2011). In our study, we also found no association between DMA and arsenobetaine and high blood pressure. Chen et al. (1995) studied arsenic exposure through well water consumption measuring the concentration of arsenic in the water. They determined that in residents in villages where long-term arseniasis was hyperendemic, there was a 1.5-fold increase in age and sex adjusted high blood pressure compared to residents in non-endemic areas, with high blood pressure measured as ≥160 mm Hg systolic, ≥95 mm Hg diastolic, or treatment with antihypertensive drugs. A significant dose-response relationship between high blood pressure and average arsenic concentration in drinking water was found even after adjustment for age and sex. This study did not identify what form of arsenic was found in drinking water and therefore does not show whether a certain form of speciated arsenic could be responsible for the high blood pressure (Chen et al. 1995).

Our study presented an association with arsenous acid, which is an inorganic form of arsenic. Jones et al. (2011) did not analyze arsenous acid because the limits of detection for arsenite, arsenate, and methylarsonate, which reflect inorganic arsenic exposure, were too high for the population that had an overall low to moderate exposure of inorganic arsenic. Arsenous acid shares the same oxidation state of arsenite of +3, whereas arsenic acid and arsenate are +5 (Jekel and Amy 2006). Arsenous acid, among other inorganic forms of arsenic, are considered toxic and can lead to adverse health effects (Caldwell et al. 2009). Jones et al. (2011) analyzed total arsenic, DMA, and arsenobetaine. Arsenous acid and other forms of inorganic arsenic are methylated to metabolites like DMA and MMA and distributed to the body. Jones et al. (2011) did find a small association between DMA and high blood pressure only in certain subgroups, including BMI ≥30 kg/m2 and in never-smokers; however, the overall sample did not show a significant association. Our study found a positive association between arsenous acid and high blood pressure but none between DMA and high blood pressure. It has been suggested that arsenic acts on blood pressure levels via oxidative stress, inflammation, endothelial dysfunction, and nitric oxide inhibition (Wu et al. 2018).

There is minimal literature regarding the effect of tin exposure on blood pressure in humans. Wang et al. (2018) studied 367 subjects, all housewives in China exposed to indoor air pollution primarily due to coal combustion and passive smoking. A positive correlation with high blood pressure was found with arsenic, lead, and rare earth element levels in hair samples; a negative correlation was found with trace elements including chromium, cobalt, nickel, tin, and alkaline earth metals: calcium, magnesium, and barium (Wang et al. 2018). Lan et al. (2021) found no significant association between tin and systolic or diastolic blood pressure. In contrast, Shiue (2014b) found a significant association between high blood pressure and tin; our study also supports these findings.

Few studies have analyzed the effect of cesium on humans. A study using 1999–2012 NHANES data found a negative association between urinary cesium and diastolic blood pressure, showing that urinary cesium lowered diastolic blood pressure (McGinnis et al. 2016). However, Shiue and Hristova (2014) determined an increased odds of high blood pressure with cesium exposure using 2009–2012 NHANES data. Their study defined high blood pressure as ≥140 mm Hg systolic or ≥90 mm Hg diastolic. Shiue (2014b) also found a positive association between cesium and high blood pressure using 2011–2012 NHANES data. In addition, in Brazil, Rodrigues et al. (2017) studied a group of victims with radioactive exposure to cesium-137. It was determined that the prevalence of high blood pressure was similar in the radioactivity victims as the general population. This was conflicting to our results of an increased odds of high blood pressure with cesium exposure; however, their sample size was only 102 participants, and it was not adjusted for smoking (Rodrigues et al. 2017). In our results, the correlations with tin and cesium observed with the two-factor model remained consistent with the main-effect model associations. Furthermore, all other metals were not found to have an association with high blood pressure in our study. Shiue (2014a) found no association between mercury and high blood pressure but did find significant associations with cobalt, lead, antimony, and tungsten using 2009–2010 NHANES data. Wu et al. (2018), however, found no association between lead and cadmium and high blood pressure.

Within the study group, certain demographic groups had statistically significant higher odds of having high blood pressure. These included Non-Hispanic Whites, Non-Hispanic Blacks, Non-Hispanic Asians, widows, individuals over 40, and those who are overweight or obese. On the other hand, females and college graduates were statistically more likely to have lower blood pressure. Jones et al. (2011) found that compared to subjects without high blood pressure, participants with high blood pressure were more likely to be older, female, white, and less educated. These results were similar to ours with the exception of race and sex, wherein our study determined that the ethnic group with the smallest percentage of individuals with high blood pressure were Other Hispanics (42.5%) but that was statistically not different from Mexican American (43.7%), non-Hispanic Asians (46.2%), and Other (53.2%) and that males were more likely than females to have high blood pressure. This was also observed in Wu et al. (2018) where males were significantly more likely to have high blood pressure. Feng et al. (2012) found that BMI was strongly associated with high blood pressure, which was also supported by our results.

Strengths and limitations

This study was conducted using the NHANES dataset, leading to a large sample size of 4037 adults, which was representative of the US population including a sample of several races, ages, and health statuses. Exposures to nineteen different metals were analyzed, which provides a thorough evaluation of many different exposures that an individual could experience simultaneously. High blood pressure affects a large portion of the population in the USA, determining potential causes that can help to improve health and reduce healthcare expenditures in the USA.

Our study is only applicable to the adult population and is a cross-sectional design; therefore, we cannot determine causality. In addition, the sources of exposures are unknown and not accounted for. Our study does not designate which participants had occupational exposure, and therefore the public health implications of reducing exposure in certain populations are limited. We also did not study the impact of combined metal exposure, such as the interaction of specific metals with each other. Future prospective studies are needed to determine causation and the source of metal exposures.

Conclusions

Arsenous acid, cesium, and tin are likely to contribute to high blood pressure in adults in the USA. Using the 2015–2016 NHANES dataset, no other urinary environmental metals were associated with high blood pressure among the nineteen included in this study.