Introduction

In the US, over 1.1 million adults and adolescents currently live with HIV and an estimated 40,000–50,000 new HIV infections occur annually [1,2,3,4,5]. About 1 in 7 individuals infected with HIV do not know that they carry the virus [1, 6]. HIV infected individuals who are unaware of their status and continue to engage in risky sexual activity contribute to 93% of new HIV infections each year [7,8,9]. Early HIV detection allows people to start antiretroviral treatment (ART) while their CD4 counts are still high, which has led to dramatically improved long-term outcomes and similar life expectancies to individuals without HIV [10,11,12,13,14]. Thus, it is crucial for sexually active individuals to undergo routine HIV testing in order to start ART as soon as possible after they are infected with HIV [15,16,17,18].

Despite slowly rising rates of HIV testing and a decrease in new infections each year among the general population, HIV testing remains disproportionately low among certain subgroups such as adolescents, older adults, and rural residents [19,20,21,22,23]. Lack of healthcare access, fears about HIV risk and stigma, misconceptions about treatment, and the exclusion of HIV testing from routine medical care have been identified as the four key barriers that result in less than half of Americans getting tested in their lifetime [2, 24,25,26]. Existing research on the impact of living in a rural area (rurality) on HIV testing at the national and state level indicates that rural–urban differences in high HIV-transmission risk behaviors and access to HIV prevention and treatment may contribute to differences in HIV testing rates between rural and urban areas that range from 5.4 to 15.4% [27,28,29,30,31,32,33,34,35,36]. However, these works have been limited by non-contemporaneous data that is not indicative of current HIV prevalence, examination of testing at only a single geographic scale within a study, and the inability to directly compare HIV testing rates between US regions and states [27,28,29,30,31].

Our study extensively examined differences in HIV testing between rural and urban residents at the state, regional, and national scale by using contemporary nationally representative HIV data [27,28,29,30,31]. We isolated the independent role of rurality on HIV testing while controlling for sociodemographic, clinical, and health seeking behavioral factors through marginal effects. Keeping in line with previous studies on the influence of rurality on HIV testing, we hypothesized that rural residents would have lower rates of HIV testing at the state, regional, and national level compared to urban residents [27,28,29,30,31]. The results of this study allowed us to identify specific rural/urban areas with HIV testing disparities across the US and make direct comparisons of HIV testing estimates at the state, regional, and national level which can be used to guide development of future public health interventions.

Methods

Study Sample

We used the Centers for Disease Control and Prevention’s (CDC) nationally representative Behavioral Risk Factor Surveillance System (BRFSS) surveys from 2012 to 2017 [37,38,39,40,41,42]. The BRFSS is a landline and cell phone conducted survey that collects information on the health behaviors and chronic conditions of residents in all 50 states and the District of Columbia [37,38,39,40,41,42]. Potential survey participants are contacted through commercially available phone lists from Genesys, Inc. [43]. As some minority groups and rural residents are less likely to have access to telephones and cell phones, oversampling and raking adjustments are carried out in the BRFSS to ensure these groups are well represented [43, 44].

We combined data from the six most recent BRFSS surveys to maximize the sample size and increase the power of our analyses. BRFSS surveys from 2010 and earlier were not included because they used post stratification weighting, which is incompatible with the iterative proportional fitting of the 2012-2017 surveys [45,46,47]. The datasets are publicly available for download at the CDC’s website (https://www.cdc.gov/brfss/annual_data/annual_data.htm) [38].

Our study was comprised of survey participants ≥ 18 year who responded to two questions in the HIV/AIDS section of the BRFSS (section 18 in the 2012 and 2014 surveys, section 16 in the 2013 and 2017 surveys, section 15 in the 2015 survey, section 19 in the 2016 survey) [45,46,47]. Specifically, we included individuals in our study who responded to, “Have you ever been tested for HIV? Do not count tests you may have had as part of a blood donation. Include testing fluid from your mouth.” (n = 2,509,103), and “Not including blood donations, in what month and year was your last HIV test?” (n = 399,067 out of 755,726 who answered “Yes” to the previous question) [45,46,47]. Given that the median delay from infection to diagnosis was 3 years, we classified anyone who had received an HIV test in the last 3 years at the time of survey as “recent”, while those who had not were labeled “not recent” [48]. Responses to these two questions allowed us to separate the sample of respondents into those who had never received HIV testing (n = 1,753,377), those who had received HIV testing but have not kept up regular screenings (last screening more than 3 years ago) (n = 185,629), and those who maintain a regular screening schedule (n = 213,438). For both questions, we excluded respondents who answered “don’t know/not sure” or refused to answer.

Covariates

We included multiple sociodemographic, clinical, and health seeking behavioral factors in our analyses that have either been shown to be associated with HIV in numerous studies or are associated with testing [49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73]. By including these factors as covariates in our analyses, we are able to control to a large degree any confounding that could bias the association between rurality and HIV testing [74, 75]. We finely categorized all the sociodemographic, clinical, and health seeking behavioral covariates in our study to minimize any residual confounding due to coarse categorization, an issue that has arisen in previous studies on rurality and HIV testing [31, 74, 76, 77]. All covariates in the study were categorical and consisted of age, household income, educational attainment, self-reported race, general health status, body mass index (BMI) categories, health care coverage, personal doctor/health care provider, and marital status [45,46,47].

We used the Metropolitan Status Codes (MSCODE) in each survey to classify respondents as either rural, suburban, or urban residents [45,46,47]. Rural residents were those who do not live in a metropolitan statistical area (MSA) (MSCODE 5), suburban residents were those who live inside a suburban county of the MSA (MSCODE 3), and urban residents were those who live in the center city of an MSA (MSCODE 1) or outside the center city of an MSA but inside the county containing the center city (MSCODE 2) [45,46,47]. Living in an MSA that has no center city (MSCODE 4) was only available in the 2012 and 2013 BRFSS surveys and was very small compared to other categories in the same survey and the overall study sample [37, 45]. Due to the small number of MSCODE 4 individuals (n = 3629 for “Have you ever been tested for HIV?”, n = 507 for “How recent is your last HIV test?”) we do not discuss results pertaining to MSCODE 4, although it was included in our analysis.

Statistical Models

We accounted for the complex survey design and unequal weighting using survey weights in logistic regression models that evaluated the relationship between HIV testing and rurality (MSCODE) while controlling for the sociodemographic, clinical, and health seeking behavioral factors mentioned earlier. Two logistic regression models were run in this study, one for “Have you ever been tested for HIV?” (HIVTST) with “Yes” as the outcome and the other for “How recent is your last HIV test?” (WHENLAST) with “3 years or less” as the outcome [78]. The two logistic models controlled for the same covariates but had different sample sizes (as only respondents who had ever been tested for HIV could answer about its recency). The sample sizes for these two models were 2,509,103 and 399,067 individuals respectively. All logistic models were run in SAS 9.4 [79].

No subgroup analyses were conducted for groups that are at high risk for HIV infection such as Men who have Sex with Men (MSM) and injection drug users due to the manner the BRFSS is structured and administered [40, 41, 44, 80,81,82,83]. For instance, we would not be able to capture heterosexual men who have sex with men in MSM and it would be impossible to figure out which individuals actually are injection drug users as the BRFSS question on drug use is a composite question on a variety of high risk HIV situations [40, 41, 44, 84, 85]. Additionally, the ability to carry out subgroup analyses is hindered by the already small sample size of MSM in BRFSS surveys before data cleaning (MSM: ~ 1.5% of survey population) [40, 41, 44, 80,81,82,83, 85].

Marginal Probabilities

After fitting the two logistic regression models, we calculated Average Adjusted Predictions (AAPs), a type of marginal probability, for the two questions on HIV testing (HIVTST and WHENLAST) [86, 87]. In this case, AAPs attempt to control for the other sociodemographic, clinical, and health seeking behavioral factors by considering a hypothetical respondent population with no variation in these factors [87]. For example, the rural AAP is the predicted marginal probability of either having ever been tested for HIV or a recent last HIV test where the survey population was hypothetically all residing in rural areas [87]. We chose to calculate AAPs in addition to odds ratios due to the values of AAPs being given in absolute numerical values, allowing for the presentation of our actual estimated testing probabilities and more practical interpretation of state-by-state testing differences [87]. Essentially, marginal probabilities can be thought of as odds ratios, where all the covariates have been held constant. AAPs allow us to isolate the association between rurality, each sociodemographic, clinical, and health seeking behavioral factor, and HIVTST or WHENLAST when all other factors are held constant in the logistic regression models for rural, suburban, and urban areas in each of the 50 states [86, 87]. Furthermore, we also calculated the urban/suburban versus rural AAP differences, known as average marginal effects (AMEs), with respect to HIVTST and WHENLAST in each state [86, 87]. In this study, an AME is the difference in the probability of people who have ever been tested for HIV or had a recent last HIV test between a hypothetically all suburban or urban survey population and a hypothetically all rural one [87]. Stata 15 was used to calculate both AAPs and AMEs [88].

Regional and National Level Analyses

We used the same US Census regions in our study as used by other studies on the influence of rurality on HIV testing [31, 89]. The four regions were the Northeast (Connecticut, Maine, Massachusetts, New Hampshire, New Jersey, New York, Pennsylvania, Rhode Island, Vermont), the Midwest (Illinois, Indiana, Iowa, Kansas, Michigan, Minnesota, Missouri, Nebraska, North Dakota, Ohio, South Dakota, Wisconsin), the South (Alabama, Arkansas, Delaware, District of Columbia, Florida, Georgia, Kentucky, Louisiana, Maryland, Mississippi, North Carolina, Oklahoma, South Carolina, Tennessee, Texas, Virginia, West Virginia), and the West (Alaska, Arizona, California, Colorado, Hawaii, Idaho, Montana, Nevada, New Mexico, Oregon, Utah, Washington, Wyoming) [89]. In order to obtain regional level estimates of HIV testing for each MSCODE, we averaged together the marginal probabilities for each of the states in the respective region. National level estimates of HIV testing for rural, suburban, and urban areas were calculated by using all study participants who met the study’s eligibility criteria from 2012 to 2017 BRFSS surveys.

Results

The people who responded to having ever been tested for HIV were predominantly ≥ 45 years (73.0%), female (58.2%), had an income ≥ $25,000 (61.7%), White (76.7%), had at least a high-school education (92%), reported that they felt good to excellent about their general health (81%), had healthcare coverage (90.9%), and a personal doctor (84.5%) (Table 1). BMI was roughly evenly split between normal weight (32.4%), overweight (36.1%), and obese (29.8%). The majority of these people reported either living in the center city of an MSA (33.6%) or not in an MSA at all (33.7%). People who responded to the recency of their last HIV test were mainly 25–54 years (67.0%), female (56.7%), had an income ≥ $25,000 (72.9%), had at least a high-school education (93.5%), reported that they felt good to excellent about their general health (81.8%), had healthcare coverage (87.4%), and a personal doctor (79.7%).

Table 1 Demographic, SES, and health risk factors among participants in the 2012–2017 Behavioral Risk Factor Surveillance Surveys included in the study

In national level results of both having ever been tested for HIV and having a last HIV test ≤ 3 years ago; men had higher odds compared to women, those earning < $15,000 had the highest odds compared to those earning ≥ $50,000, Blacks had the highest odds compared to Whites, those with poor health had the highest odds compared to those with excellent health, those with more than one personal doctor/health care provider had the highest odds compared to those with only one, and current smokers had higher odds compared to non-smokers (Table 2). Except for smoking status, all of these associations were statistically significant with p values < 0.05. Overall, we observed that the odds of having ever been tested for HIV and having a last HIV test ≤ 3 years ago tended to decrease as reported income increased, while the odds of having a last HIV test ≤ 3 years ago declined with increasing education level. Blacks, Hispanics, and those who reported other races all had higher odds of both having ever been tested for HIV and a last HIV test ≤ 3 years ago compared to Whites. In addition, suburban and urban residents had higher odds of having ever been tested for HIV and a last HIV test ≤ 3 than rural residents across the US.

Table 2 Results of logistic regression models on having ever been tested for HIV (HIVTST) and recency of last HIV test (WHENLAST)

At the national level, people who lived in urban (MSCODE 1 and 2) and suburban (MSCODE 3) areas had a higher probability of having ever been tested for HIV [AAPs: MSCODE 1: 28.26% (95% CI 28.04–28.49%); MSCODE 2: 26.52 (95% CI 26.23–26.82%); and MSCODE 3: 25.82% (95% CI 25.46–26.18%)] compared with residents in rural area (AAP MSCODE 5: 24.92% (95% CI 24.64–25.21%)). Urban and suburban residents also had a higher probability of having a recent HIV test [AAPs: MSCODE 1: 47.81% (95% CI 47.15–48.48%); MSCODE 2: 46.29% (95% CI 45.38–47.21%); and MSCODE 3: 45.83% (95% CI 44.72–46.94%)] compared to those residing in rural areas [AAP MSCODE 5: 44.28% (95% CI: 43.34–45.21%)]. There was the greatest difference in probability of having ever been tested for HIV between those living In the center city of an MSA (MSCODE 1) and those not living in an MSA (MSCODE 5) (AME: 3.34%) and the smallest difference between those living inside a suburban county of the MSA (MSCODE 3) and those not living in an MSA (MSCODE 5) (AME: 0.9%) (Table 3). For recency of last HIV test, the largest difference in probability was between those living in the center city of an MSA (MSCODE 1) and those not living in an MSA (MSCODE 5) (AME: 3.54%) and the smallest between those living inside a suburban county of the MSA (MSCODE 3) and those not living in an MSA (MSCODE 5) (AME: 1.55%).

Table 3 Average Adjusted Predictions (AAPs) (%) for different urban–rural categories and average marginal effects (AMEs) (%) of MSCODE 1, 2, 3 vs. MSCODE 5 with respect to respect to having ever been tested for HIV (HIVTST) and recency of last HIV test (WHENLAST)

Regionally, the Northeast had the highest probability of people having ever been tested for HIV for all MSCODE areas (AAP: 25.65–29.12%) and the Midwest had the lowest (AAP: 17.57–20.32%) (Table 4). The region with the highest probability of people having a recent HIV test was the South for all MSCODE areas (AAP: 49.71–53.32%) and the region with the lowest probability was the Midwest (AAP: 37.65–41.14%). The states with the greatest probability of people having ever been tested for HIV are Alaska (AAP: 35.19–39.32%), Maryland (AAP: 34.37–38.27%), and New York (AAP: 34.28–38.21%) while South Dakotas (AAP: 14.74–17.18%), North Dakota (AAP: 14.72–17.19%), and Utah (AAP: 14.75–17.26%) are the states with the smallest probability. Mississippi (AAP: 61.55–64.9%), Louisiana (AAP: 57.45–60.91%), and North Carolina (AAP: 56.14–59.67%) had the greatest probability of people having a recent HIV test while Wisconsin (AAP: 28.05–31.16%), Maine (AAP: 29.21–32.45%), and Idaho (AAP: 29.34–32.52%) had the smallest probability.

Table 4 Average adjusted predictions (AAPs) (%) at the state level for different urban–rural categories with respect to having ever been tested for HIV (HIVTST) and recency of last HIV test (WHENLAST) in all five metropolitan status areas

Discussion

We conducted a nationally representative study using 2012–2017 BRFSS to examine the influence of rurality on HIV testing. At the national level, suburban and urban areas had higher odds and AAPs of having ever been tested for HIV and having a recent HIV test compared to rural areas across the US. Compared to other regions, the Midwest had the lowest AAPs for both having ever been tested for HIV (17.57–20.32%) and a recent HIV test (37.65–41.14%). For both having ever been tested for HIV and having a recent HIV test, regions with the highest probabilities had the greatest difference in probabilities between rural and urban areas and regions with the lowest probabilities had the smallest difference in probabilities between rural and urban areas. In addition, the highest rural–urban testing disparities were observed in states with high AAPs for having ever been tested for HIV and having a recent HIV test. HIV testing odds ratios and AAPs, with regard to having ever been tested for HIV and having a recent HIV test, were higher in urban compared to rural areas at the national level and in all regions and states.

Past work on the influence of rurality on HIV testing has tended to focus on this research question at one geographic scale [28,29,30,31]. A Texas study of 9744 people in 2010 found that Whites who did not live in a MSA were less likely to have ever been tested for HIV compared to those who lived in the central city of a MSA [28, 29]. In two national level studies, one using the 2005 and 2009 BRFSS and the other the 2015 BRFSS, rural residents were found to be significantly less likely than urban residents to have ever been tested for HIV and for their last HIV test to have been recent [30, 31]. While our results are consistent with past research highlighting rural–urban disparities in HIV testing, the magnitude of our differences in HIV testing rates between rural and urban areas, AME: 3.4 and 1.6%, are smaller than those reported in previous studies [27,28,29,30,31]. This study’s lower differences in HIV testing rates between rural and urban areas may be attributed to our ability to adjust extensively for sociodemographic, clinical, and health seeking behavioral factors compared to other studies, reducing the impact of these factors’ bias on study estimates [27,28,29,30,31]. Our results also bring to light exceptionally high HIV testing rates in states such as Alaska which could be explained by the state’s high chlamydia (highest in the US) and gonorrhea (2nd highest in the US) incidence rates [90]. The behaviors that lead to chlamydia and gonorrhea infections and the lesions and sores that are a product of these diseases are associated with a higher risk of becoming HIV infected [90,91,92,93].

We believe that the rural–urban HIV testing disparity observed in our study centers on two distinct points: (1), differences in high HIV-transmission risk behaviors between rural and urban areas and (2) differences in access to HIV prevention and treatment resources between rural and urban areas [32]. In Florida, adolescent rural residents were more likely to report that they intended to have sex without a condom in the future than urban residents [34]. Rural men who use the internet to meet partners online were more likely to report unprotected anal intercourse in their last sexual episode compared to urban men [33]. Although the prevalence of illicit drug use is consistently higher in urban compared to rural areas, urban residents tend to have greater access to treatment facilities, in terms of availability and standard of care, than rural residents [35, 36]. Regarding differences in access to HIV prevention and treatment resources between rural and urban areas, a study of US rural and urban individuals with HIV found that not only did many rural residents with HIV have to travel to urban medical centers in order to receive Highly Active Antiretroviral Therapy (HAART), but that on average rural residents with HIV had fewer outpatient visits for HIV care compared to urban residents with HIV [32]. In addition, this same study observed that rural injection drug users were more likely to have fewer visits for HIV care compared to their urban counterparts [32]. However, pinpointing whether differences in high HIV-transmission risk behaviors between rural and urban areas, differences in access to HIV prevention and treatment resources between rural and urban areas, or both are behind the rural–urban HIV testing disparity is beyond the scope of the BRFSS data and this issue warrants further study.

It is important to distinguish between the statistical and clinical significance of our results and establish what they mean in practical terms, especially because their magnitude is less than that found in earlier and smaller-scale studies [27,28,29,30,31, 94]. Although it is true that the calculated differences in AAPs between the most urban and most rural areas for both having ever had a HIV test having had a recent HIV test are relatively small in magnitude, they are consistent in their strength and direction across multiple levels of granularity [27,28,29,30,31]. Furthermore, keeping in mind that even the finest scale we considered, the state, still encompasses millions of adult individuals, the approximate 3–4% difference in AAPs still corresponds to tens, if not hundreds of thousands of people [95]. For example, a 3.66% (MSCODE 1 and 5) and 1.76% (MSCODE 2 and 5) difference in AAP for having ever had a HIV test between rural and urban areas in Georgia, the state with the highest HIV infection rate and with a rural population of 2,415,502 in the 2010 Census, translates into approximately 88,000 and 42,500 more rural residents who would receive HIV tests if they had lived in urban and suburban areas, respectively [95, 96].

Our study has several limitations that need to be considered. Data in the BRFSS is obtained through self-report, so it is likely that some misclassification of health status, race, ethnicity, education, age, and income exists [37,38,39]. While there is no way of ascertaining the accuracy of the entire self-reported BRFSS dataset, several studies have shown that the correlations between in person measurements of clinical factors (obesity, smoking, diabetes) and BRFSS responses ranged from 74 to 82% [97]. In addition, a study comparing Massachusetts electronic health records (EHR) to Massachusetts BRFSS responses found that prevalence of diabetes (EHR: 9.4%, BRFSS: 9.7%), smoking (EHR: 13.5%, BRFSS: 14.7%), hypertension (EHR: 26.3%, BRFSS: 29.6%), and obesity (EHR: 22.8%, BRFSS: 23.8%) was very similar between the two data sources [98]. Another BRFSS validation study that examined the correlation between self-reported BMI with in clinic BMI measurements found that among men the correlation was R2 = 0.89 and among women R2 = 0.92 [99]. Therefore, we feel any issues concerning accuracy of self-report to be minor, if not negligible. Minor loss of accuracy due to self-report will likely result in minimal non-differential misclassification bias [100]. Although we controlled for a large array of sociodemographic and clinical covariates in our analysis, it is possible some residual confounding remains. However, we included all of the covariates included in other studies on rurality and HIV testing in our analyses except for whether an individual was a member of a sexual minority group, as this information was not available in the 2012 BRFSS [27, 28, 30, 31, 72]. While we did not include this covariate in our analysis, sensitivity analyses conducted by Henderson et al. in their work on the influence of rurality on HIV testing indicated there was negligible differences in results between models that included whether an individual was a member of a sexual minority group and those that did not [31]. We are unable to conduct subgroup analyses for high risk HIV groups such as MSM and injection drug users. This is largely inherent to the nature of how BRFSS survey questions are structured and conducted as well as the small sample size of these subgroups [40, 41, 44, 85].

We would also like to note the limitations in our usage of two operational definitions integral to our model, that of what constitutes “recent” and what constitutes the cutoffs between “urban, rural, and suburban”. It is possible that using alternative definitions (such as reducing the cutoff for recency or using USDA Rural–Urban Continuum Codes) may change study findings, but our usage of these definitions in this study is well justified. Although it is true that sexually active individuals have possibly engaged in intercourse later than the 3 year cutoff that we have defined for “recent HIV test”, 3 years corresponds with how the CDC defines “recency” within this survey, and accounts for any further or continuing testing within this timeframe [40, 41, 85]. With respect to our choice of Metropolitan Status Codes, the division of the codes clearly delineates the cutoff between urban versus suburban versus rural, whereas the choice is not as immediately apparent with Rural–Urban Continuum codes (for example, many urban populations in nonmetro counties have higher populations than metro counties in continuum codes) [40, 41, 85, 101].

Compared with previous research that examines the influence of rurality on HIV testing, our large nationwide study allows us to isolate the association between rurality and HIV testing practices at multiple geographic scales after taking major sociodemographic and clinical factors that influence HIV risk and health seeking behaviors into account [27,28,29,30,31, 72]. Combining the 2012–2017 BRFSS surveys gives us a large study population and adequate power for our analyses. By using survey weights and oversampling in the study population, we are able to obtain more precise estimation within the subpopulations of interest (different MSCODEs at regional and state level). Additionally, the use of marginal effects allows for easier direct comparisons of HIV testing probabilities, especially for the factors where the choice of level that odds ratios are computed against is arbitrary or not immediately clear, as with state or region. Finally, the use of MSCODE as the classification for rural, suburban, and urban areas allows for an ordinal scale that distinctly separates places by degree of rurality.

Conclusion

This study sought to determine if there are disparities in HIV testing between suburban/urban and rural residents and whether they vary across the US. Our results indicated the existence of suburban/urban–rural disparities in HIV testing at the national, regional, and state levels. Regions and states where HIV testing estimates were far below the national average can use this study’s findings to motivate the implementation of HIV testing initiatives that well-performing states already have in place. This will increase the number of HIV cases that can be detected early, improve the lives of the already infected via prompt antiretroviral treatment, and ultimately reduce future HIV cases.