Socially vulnerable communities may be at higher risk for COVID-19 outbreaks in the US. However, no prior studies examined temporal trends and differential effects of social vulnerability on COVID-19 incidence and death rates. Therefore, we examined temporal trends among counties with high and low social vulnerability to quantify disparities in trends over time.
We conducted a longitudinal analysis examining COVID-19 incidence and death rates from March 15 to December 31, 2020, for each US county using data from USAFacts. We classified counties using the Social Vulnerability Index (SVI), a percentile-based measure from the Centers for Disease Control and Prevention, with higher values indicating more vulnerability. Using a Bayesian hierarchical negative binomial model, we estimated daily risk ratios (RRs) comparing counties in the first (lower) and fourth (upper) SVI quartiles, adjusting for rurality, percentage in poor or fair health, percentage female, percentage of smokers, county average daily fine particulate matter (PM2.5), percentage of primary care physicians per 100,000 residents, daily temperature and precipitation, and proportion tested for COVID-19.
At the outset of the pandemic, the most vulnerable counties had, on average, fewer cases per 100,000 than least vulnerable SVI quartile. However, on March 28, we observed a crossover effect in which the most vulnerable counties experienced higher COVID-19 incidence rates compared to the least vulnerable counties (RR = 1.05, 95% PI: 0.98, 1.12). Vulnerable counties had higher death rates starting on May 21 (RR = 1.08, 95% PI: 1.00,1.16). However, by October, this trend reversed and the most vulnerable counties had lower death rates compared to least vulnerable counties.
Citation: Neelon B, Mutiso F, Mueller NT, Pearce JL, Benjamin-Neelon SE (2021) Spatial and temporal trends in social vulnerability and COVID-19 incidence and death rates in the United States. PLoS ONE 16(3): e0248702. https://doi.org/10.1371/journal.pone.0248702
Editor: Samantha S. Goldfarb, Florida State University College of Medicine, UNITED STATES
Received: September 18, 2020; Accepted: March 3, 2021; Published: March 24, 2021
This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: Dr. Mueller was supported by the National Heart, Lung, And Blood Institute of the National Institutes of Health under Award Number K01HL141589 (PI: Mueller). Dr. Neelon is a part-time employee of the Department of Veterans Affairs. The content of this article does not represent the views of the Department of Veterans Affairs or the U.S. government. The article represents the views of the authors and not those of the VA or Health Services Research and Development.
Competing interests: The authors have declared that no competing interests exist.
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the cause of coronavirus disease 2019 (COVID-19), has created a global public health crisis since its onset in late 2019. As of late February 2021, there have been over 27 million confirmed COVID-19 cases and nearly a half a million deaths in the United States (US) alone . Overwhelming evidence has found that the pandemic has disproportionately affected people of color, older individuals, and those of lower socioeconomic status [2–10]. Several studies have shown that African Americans have higher incidence and death rates than non-Hispanic whites [6, 8, 9, 11]. Two studies also reported that COVID-19 infection rates are greater in US counties and in states with high Latinx populations and monolingual Spanish speakers [4, 7]. Further, older age has been associated with an increased risk of death among those infected with COVID-19 [5, 12, 13]. Underlying health conditions and comorbidities may partially explain these associations , but do not fully account for the disproportionate burden. Recent studies suggest that social determinants of health and community contextual factors contribute to these disparities, and that socially vulnerable communities are at highest risk for COVID-19 outbreaks [6, 14–16].
Protecting vulnerable populations is critically important during the COVID-19 pandemic, as these groups are generally at higher risk for adverse health outcomes [17, 18]. Hurst et al. define vulnerability as an identifiably elevated risk of incurring greater wrong or harm . One type of vulnerability–social vulnerability–has been used by the Centers for Disease Control and Prevention (CDC) to identify communities most at risk when faced with adverse events that may impact health, such as natural disasters or disease outbreaks. The CDC developed the social vulnerability index (SVI) to assist federal, state, and local governments in targeting and mobilizing resources for at-risk counties in response to adverse events.
Recent studies have demonstrated the importance of considering social vulnerability in both COVID-19 cases and deaths, although the findings have been somewhat inconsistent [20–23]. Karaye et al. examined associations between the SVI and cumulative COVID-19 cases on May 12, 2020 . They found that SVI total score was associated with increased rates of COVID-19. However, the authors found no association when they examined six states with high testing rates. Khazanchi and colleagues conducted an analysis of COVID-19 cases and deaths through April 19, 2020, and found that those living in the most vulnerable counties (highest SVI) had greater risk of infection and death . Nayak et al. examined associations between the SVI and COVID-19 incidence and case fatalities through April 4, 2020, and found a significant association between social vulnerability and case fatality but not incident cases . Finally, Wang and colleagues found that social vulnerability and COVID-19 incident cases and deaths had spatially varying associations; however, the authors did not adjust for potential confounders . Notably, all studies were cross-sectional and conducted at different time points early in the pandemic, which might contribute to the inconsistent findings. In fact, to date, no prior studies have examined longitudinal trends in social vulnerability and COVID-19 incidence and death rates in an effort to determine how these relationships change over time. Therefore, the purpose of this study was to examine temporal trends among counties with high and low social vulnerability and to quantify disparities in these trends over time.
Materials and methods
We conducted a retrospective longitudinal analysis examining COVID-19 incidence and death rates from March 15, 2020 to December 31, 2020 for each of the 3,142 US county and county equivalents based on their unique Federal Information Processing Series (FIPS) codes [24, 25]. Specifically, we modeled the temporal trend in daily incidence and death rates for each county and assessed differential risks by county-level social vulnerability. We hypothesized that highly vulnerable counties would have higher incidence and death rates compared to less vulnerable counties and that this disparity would widen over time. The Institutional Review Boards at the Medical University of South Carolina and Johns Hopkins Bloomberg School of Public Health deemed this research exempt from review.
COVID-19 incident cases and deaths
We obtained daily COVID-19 incident case and death data from USAFacts  and the Johns Hopkins Center for Systems Science and Engineering . Because national data are only available at the county level, we used county as the geographic unit for our analyses. Moreover, because Johns Hopkins aggregates data for some counties (e.g., the five boroughs of New York) , we opted to use the USAFacts data in our primary analysis, and conducted a sensitivity analysis using Johns Hopkins data. For both data sources, we downloaded daily incident case and death counts from March 15 to December 31, 2020. We obtained county population data from the 2019 population datafile compiled by the US Census Bureau . We present data sources for all variables in Table 1.
Social vulnerability index
We used publicly available data from the CDC’s Agency for Toxic Substances and Disease Registry to classify counties using SVI . The SVI is a percentile-based measure of social vulnerability, or the resilience of communities to address stressors to health related to external hazards (e.g., natural disasters or disease outbreaks) . The Geospatial Research, Analysis & Services Program within the Agency for Toxic Substances and Disease Registry created the SVI database to help public health officials identify communities that will most likely need support and resources during and after a hazardous event like a pandemic . The overall index and each theme are scored from 0 to 1, with higher scores indicating greater vulnerability [30, 34]. The index was constructed using data from 15 variables from the US Census Bureau. A percentile rank was calculated for each of these variables and grouped among four themes of SVI that measure various aspects of vulnerability–these include Socioeconomic Status, Household Composition, Race/Ethnicity/Language, and Housing/Transportation [30, 34].
The Socioeconomic Status theme is composed of percentile rank data for the following variables: percentage below poverty, percentage unemployed, per capita income, and percentage with no high school diploma. For Household Composition, the variables include percentage age 65 years and older, percentage age 17 years or younger, percentage age 5 years or older with a disability, and percentage of single-parent households. The Race/Ethnicity/Language theme encompasses percentage minority and percentage who speaks English “less than well”. Finally, the Housing/Transportation theme includes data for the percentage of multiunit structures, percentage of mobile homes, percentage crowding, percentage having no vehicle, and percentage of group quarters.
For our analyses, we downloaded the 2018 county-level SVI data (the most recent available) for all 3,142 counties. One county was missing SVI data; for this county, we imputed SVI data using the national average.
We adjusted for several variables that could help explain the differential impact of COVID-19 on upper and lower SVI counties. These variables were chosen a priori based on previously reported associations with COVID-19 incidence and deaths [20–22, 35–38]. As noted above, we provide data sources for all variables in Table 1. These included the percentage of each county designated as rural, the percentage of female residents, the percentage of adult smokers in the county, the number of primary care physicians per 100,000 in each county, average daily temperature (degrees Fahrenheit), and average daily precipitation (inches). We also controlled for the percentage of residents in poor or fair health, a validated measure of the overall health and comorbidity burden of the population . We additionally controlled for population density, defined as the number of residents per square mile  and the average particulate matter of diameter ≤ 2.5 micrometers (PM2.5) , a measure of fine particulate air pollution that can compromise respiratory function. Finally, we controlled for the daily proportion of COVID-19 Viral (RT-PCR) tests performed in each state (county-level data are not currently available). We converted the number tested to a proportion by dividing the number of tests by the state population size, obtained from the US Census Bureau’s population estimate dataset .
Each of the above variables was included in the models examining the impact of overall SVI on cases and deaths as well as the SVI themes. However, because each SVI theme includes only the SVI variables specific to that theme, we included additional adjustment variables in the analyses involving specific SVI themes. For the Socioeconomic Status theme, we included the percentage of county residents aged 65 years and older, the percentage of non-Hispanic Black residents, and the percentage of Hispanic residents. For the Household Composition theme, we adjusted for the percentage below the federal poverty line as well as percentage of NHB and Hispanic residents; we did not adjust for age, as this is included as part of the theme. For Race/Ethnicity/Language, we adjusted for age ≥ 65 and over and percent poverty. For the Housing/Transportation theme, we adjusted for age ≥ 65, percent poverty, percent NHB and percent Hispanic. Finally, as a sensitivity analysis, we fit unadjusted models for overall SVI and the four themes.
We fit Bayesian hierarchical negative binomial models with daily incident cases and daily deaths for each county as the outcomes. The models included penalized cubic Bsplines for both the fixed and random (i.e., county-specific) temporal effects, with knots placed approximately every two weeks over the study period (20 knots total). The models also included county population as an offset on the log scale to convert the case and death counts to population-adjusted rates. The models further adjusted for the variables described above.
To avoid overfitting the temporal splines, we assigned ridging priors to the fixed and county-specific spline coefficients–i.e., independent, mean-zero normal distributions with shared inverse gamma variances . We assigned weakly informative normal priors to the corresponding regression parameters. We assigned a gamma prior to the negative binomial dispersion parameter. We developed an efficient data-augmented Gibbs sampler to aid posterior computation [41, 42]. For both the incidence case and death rate models, we ran the Gibbs sampler for 2,500 iterations with a burn-in 500 to ensure convergence.
To report results, we compared counties in the top or upper SVI quartile (most vulnerable) to those in bottom or lower SVI quartile (least vulnerable). For both quartiles, we graphed the posterior mean incidence and death rate trends for the reference covariate group along with the corresponding 95% posterior intervals (PIs). We also reported adjusted risk ratios (RRs) and 95% PIs comparing the upper and lower quartiles on each day for the overall SVI and its themes.
As sensitivity analyses, we ran unadjusted models as well as models based on Johns Hopkins case and death data. We conducted all analyses using R software version 3.6 (R Core Team 2019, R: A language and environment for statistical computing, R Foundation for Statistical Computing, Vienna, Austria).
The final analytic sample comprised 917,464 observations (3,142 counties x 292 study days). There were 786 counties in each of the upper and lower SVI quartiles. Table 2 presents county demographics overall and by upper and lower SVI quartile. The two quartiles were similar with respect to gender, age, smoking, and the total number of RT-PCR tests performed during the study period. The quartiles differed with respect to percent poverty, race/ethnicity composition and temperature, with more vulnerable counties having higher average daily temperature, suggesting that many of these counties are located in the southern US.
Fig 1A presents the per capita incidence trends (expressed as cases per 100,000) for the upper (most vulnerable) and lower (least vulnerable) quartiles of SVI from the unadjusted analysis. For counties in the upper quartile, the average incidence increased steadily from March 15 (0.10 cases per 100,000; 95% PI: 0.09, 0.12) to May 12 (9.36 cases per 100,000; 95% PI: 9.08, 9.61). The incidence leveled off in mid-May before a precipitous increase through July 30 (31.10 cases per 100,000; 95% PI: 30.52, 31.68). The incident cases receded in late summer and early fall, before a final uptick from mid-September to December 31 (57.21 cases per 100,000; 95% PI: 54.94, 59.43). By comparison, the least vulnerable quartile exhibited a gradual increase in incidence through the summer and fall, achieving a maximum incidence rate of 60.23 cases per 100,000 on November 17 (95% PI: 59.25, 61.32) before tapering in late November and early December.
Incidence rates (A) and adjusted risk ratios (B) for upper (most vulnerable) versus lower (least vulnerable) quartiles of overall SVI.
Fig 1B presents the posterior mean adjusted RRs comparing the upper and lower quartiles on each day. On March 15, the RR for incident cases was 0.95 (95% PI: 0.82, 1.07), suggesting that most vulnerable counties had, on average, fewer cases per 100,000 than less vulnerable \counties, although this result did not statistically differ from 1.00. In fact, through most of March, the RRs were <1.00. However, on March 28, we observed a “crossover effect” in which the RR became significantly greater than 1.00, indicating that the more vulnerable counties had higher COVID-19 incidence on average compared to less vulnerable counties (March 28 RR = 1.09, 95% PI: 1.03, 1.17). The RRs increased steadily thereafter and achieved a maximum RR of 3.61 (95% PI: 3.47, 3.77) on June 23, then decreased steadily through mid-November. Notably, on September 27, we observed a second crossover date, in which the lower (least vulnerable) quartile exhibited higher incidence compared to the upper quartile (RR = 0.97, 95% PI: 0.94, 0.99). The minimum RR of 0.60 (95% PI: 0.58, 0.62) occurred on November 9, indicating that the upper quartile had 40% lower risk of infection. However, on December 9, there was a final crossover, and the upper (most vulnerable) quartile again had higher incidence (RR = 1.03, 95% PI: 1.01, 1.05). Thus, the association between SVI and COVID-19 was not unidirectional over time, but instead exhibited an alternating pattern throughout the course of the study.
Fig 2A presents per capita death trends (expressed as deaths per million) for the upper and lower quartiles of overall SVI. The death rates for both quartiles increased until mid-April before leveling off through the end of June. Beginning in early July, however, the mean death rate for the upper, most vulnerable quartile increased steadily until August 7 (4.67 deaths per million, 95% PI: 4.42, 4.92). The trend for the upper quartile leveled off in early autumn before a final upswing through December 31 (8.51 deaths per million; 95% PI: 7.79, 9.43). The daily death rates for the lower (least vulnerable) quartile hovered between 2 and 3 deaths per million for most of the summer. However, beginning in late September, there was a rapid uptick in the death rate, and by December 31, there was an estimated 13.07 deaths per million (95% PI: 10.82, 15.71) on average in these least vulnerable counties. Fig 2B presents the daily RRs comparing the upper and lower quartiles. From mid-March to mid-May, the upper, most vulnerable quartile had lower death rates than the lower quartile. However, on May 21, the trend reversed, and the upper quartile had higher death rates compared to the lower quartile (RR = 1.08, 95% PI: 1.00,1.16). The RRs increased until achieving a maximum value on August 14 (RR = 1.97, 95% PI: 1.76, 2.17). On October 9, however, we observed a second crossover in which the most vulnerable counties had, on average, lower death rates than the least vulnerable counties. On December 4, the RR comparing the upper to lower quartiles was 0.46 (95% PI: 0.42, 0.50), indicating that the most vulnerable counties had on average 54% lower risk of death compared to the most vulnerable counties.
Death rates (A) and adjusted risk ratios (B) for upper (most vulnerable) versus lower (least vulnerable) quartiles of overall SVI. Maximum and minimum values, as well as the first and last crossover dates, are highlighted by vertical lines.
SVI theme: Socioeconomic status
Figs 3A, 3B, 4A and 4B present the temporal trends and RRs for incident cases and deaths, respectively, for the Socioeconomic Status theme. The trends were similar to those for overall SVI. Incident cases were higher for the least vulnerable quartile from March 15 through April 3, at which point the RRs became significantly >1.00. The RRs achieved a maximum of 2.80 (95% PI: 2.65, 2.94) on June 19 before a plateau in July. Starting in August, the RRs declined steadily as the per capita cases for the lower quartile began to catch up to the upper quartile. As with overall SVI, we observed a second crossover event on September 22 when the most vulnerable counties again experienced lower incidence rates compare to the least vulnerable counties. In fact, on November 9, the most vulnerable counties had 45% lower risk of infection on average compared to the least vulnerable counties (RR = 0.55, 95% PI: 0.51, 0.60). The trend reversed for a final time on December 7, and the RRs remained above 1.00 through December 31. The death rate trend for the SES theme was similar to the death trend for overall SVI. The most vulnerable counties had lower death rates on average early in the pandemic, higher rates in the summer and early fall, and lower rates again as winter approached.
Incidence rates (A) and adjusted risk ratios (B) for upper (most vulnerable) versus lower (least vulnerable) quartiles of Socioeconomic Status.
Death rates (A) and adjusted risk ratios (B) for upper (most vulnerable) versus lower (least vulnerable) quartiles of Socioeconomic Status. Maximum and minimum values, as well as the first and last crossover dates, are highlighted by vertical lines.
SVI theme: Household composition
Figs 5A, 5B, 6A and 6B present the results for the Housing Composition theme. The crossover effect was significantly delayed for this theme, with the crossover dates occurring on May 18 for incident cases (Fig 5B) and on July 3 for deaths (Fig 6B). Thus, the pandemic appears to have disproportionately impacted the least vulnerable counties with respect to household composition for much of the early pandemic. However, these trends reversed by July. For incident cases, the daily RRs achieved a maximum of 1.99 (95% PI: 1.95, 2.07) on July 29 (Fig 5B) and then declined steadily. In late October and early November, there was another brief crossover period; however, on November 21, the trends reversed again, and the most vulnerable counties once more exhibited higher incidence than the least vulnerable counties. For deaths (Fig 6B), the maximum RR of 1.79 (95% PI: 1.63, 1.94) was achieved August 21 and, unlike incident cases, remained above 1.00 for the remainder of the study, although the 95% PIs overlapped 1.00 briefly in early December, indicating a null effect during this period.
Incidence rates (A) and adjusted risk ratios (B) for upper (most vulnerable) versus lower (least vulnerable) quartiles of Household Composition.
Death rates (A) and adjusted risk ratios (B) for upper (most vulnerable) versus lower (least vulnerable) quartiles of Household Composition. Maximum and minimum values, as well as the first and last crossover dates, are highlighted by vertical lines.
SVI theme: Race/ethnicity/language
Figs 7A, 7B, 8A and 8B present the results for the Race/Ethnicity/Language theme. Unlike the previous themes, the most vulnerable counties experienced higher incidence and death rates from the outset of the pandemic. In fact, the disparity between the most and least vulnerable counties was greatest for this theme, with a maximum incidence RR of 5.00 (95% PI: 4.71, 5.27) on May 1. For cases, the RRs declined steadily from late June into August, as the incidence for the lower quartile outpaced the upper quartile. On September 21, we observed the first crossover, and by November 9, the most vulnerable counties had 43% lower risk of infection (RR = 0.57, 95% PI: 0.56, 0.59). The death rates followed a similar pattern: the most vulnerable counties had higher death rates through mid-September when the trends reversed and the most vulnerable counties had lower death rates than the least vulnerable counties.
Incidence rates (A) and adjusted risk ratios (B) for upper (most vulnerable) versus lower (least vulnerable) quartiles of Race/Ethnicity/Language.
Death rates (A) and adjusted risk ratios (B) for upper (most vulnerable) versus lower (least vulnerable) quartiles of Race/Ethnicity/Language. Maximum and minimum values, as well as the first and last crossover dates, are highlighted by vertical lines.
SVI theme: Housing/transportation
Figs 9A, 9B, 10A and 10B present the results for the Housing/Transportation theme. The incident case RRs (Fig 9B) remained significantly positive from March 15 to September 7, with a maximum RR of 1.99 (95% PI: 1.90, 2.10) on April 27. However, from September 7 on, the most vulnerable counties had lower infection rates compared to the least vulnerable counties. The death rate RRs hovered between 1.25 to 1.5 through late August; on September 29, we observed a crossover effect whereby the most vulnerable counties experienced lower death rates.
Incidence rates (A) and adjusted risk ratios (B) for upper (most vulnerable) versus lower (least vulnerable) quartiles of housing/transportation.
Death rates (A) and adjusted risk ratios (B) for upper (most vulnerable) versus lower (least vulnerable) quartiles of housing/transportation. Maximum and minimum values, as well as the first and last crossover dates, are highlighted by vertical lines.
There were also several significant associations among the adjustment variables. Table 3 presents the adjusted RRs and 95% PIs for the adjustment variables included in the overall SVI model. Percent rural, percent smoking, per capita number of primary care physicians, and average daily temperature were negatively associated with COVID-19 cases. Daily precipitation and population density were not significantly associated with cases. Similar results were observed for the death model, but here population density was positively associated with deaths, while precipitation had a modest negative association.
Sensitivity analysis using Johns Hopkins data
Sensitivity analysis using the Johns Hopkins data produced similar results to those we observed using USAFacts data. S1A, S1B, S2A and S2B Figs present the incidence and death rate trends for overall SVI. For both cases and deaths, the trends were similar across the two data sources.
Unadjusted sensitivity analysis
As a second sensitivity analyses, we ran corresponding unadjusted models for overall SVI. Results are presented in S3 and S4 Figs. The trends followed similar patterns to those in the adjusted analyses, although the RRs had a wider range in the unadjusted analysis. For example, the maximum RR for the unadjusted deaths was 3.27 (95% PI: 2.94, 3.55) on August 3 and 1.97 (95% PI: 1.76, 2.17) on August 14 in the adjusted analysis.
In this study examining COVID-19 incidence and death rates from March 15 to December 31, 2020, we found that cases were higher for the least vulnerable counties initially and toward the end of our study. However, from late March to late September, incident cases were substantially higher among the most vulnerable counties. Death rates followed a similar pattern. Thus, the RRs for both cases and deaths exhibited a sinusoidal pattern throughout the study. Three of the SVI theme (Race/Ethnicity/Language, Household Composition, and Socioeconomic Status) showed similar patterns to overall SVI. However, for Housing/Transportation, cases and deaths were higher for the most vulnerable counties from March through September. The trend reversed in October.
Our findings are generally consistent with the study by Khazanchi et al. examining data up to April 19, 2020, which found that counties in the upper quartile of overall SVI had higher incidence and death rates compared to those in the lower quartile . As in that study, we found the strongest disparity for the Race/Ethnicity/Language theme in that the RRs achieved the largest values and were substantially higher for a large portion of the study period. However, Khazanchi et al. found no association with Household Composition, whereas we found that the most vulnerable had higher rates of cases and deaths from March to September. This may be due to the fact that the authors examined cumulative cases through April 19 only, whereas we examined daily incidence. However, this does not fully explain the difference, which could instead be due to our longitudinal approach, as it provided a comprehensive picture of the evolving relationship between SVI and COVID-19, rather than a momentary snapshot.
Our results may also shed light on inconsistent findings in two other studies. As in our study, Karaye et al. found that overall SVI and Race/Ethnicity/Language were associated with increased COVID-19 incidence through May 12, 2020 . However, they found no association between Socioeconomic Status and incident cases, whereas Household Composition and Housing/Transportation had an inverse relationship. Our results place these findings in temporal context. In particular, we found multiple crossover effects for Household Composition, with differences varying over the observed across the study period. Nayak et al., meanwhile, found no association between overall SVI and cumulative COVID-19 incidence on April 4, 2020 . According to our results, however, this was close to the first crossover date of March 19, when the most vulnerable counties had significantly higher rates if infection. Additionally, Nayak and colleagues found that the Race/Ethnicity/Language and Housing/Transportation themes were positively associated with incident cases, whereas we found multiple crossover effects. Again, these results highlight the need to consider both temporal and spatial variability when attempting to fully understand, in near-real time, the impact of the pandemic on populations with different vulnerability profiles.
Several covariates from our adjusted model were significantly associated COVID-19 cases and deaths. We found that rurality was associated with fewer cases and deaths, consistent with a prior study . In contrast, percentage in poor or fair health was positively associated with both cases and deaths. This supports results from a recent study that found that patients with COVID-19 with cardiovascular disease, hypertension, diabetes mellitus, congestive heart failure, chronic kidney disease, and cancer had a higher risk of mortality, compared to patients with COVID-19 without these comorbidities . Moreover, as in prior studies [20, 35], we found that average air quality, as measured by PM2.5, was positively associated with both cases and deaths. Increased state-level testing was also associated with higher rates of COVID-19 cases and deaths, likely due to heightened surveillance. Contrary to our expectation, we found a significant inverse association between percentage of adult smokers and COVID-19 cases and deaths. Our aggregated, county-level findings align with recent individual-level studies suggesting that nicotine may have a protective effect on COVID-19 [43, 44]. Finally, the number of primary care physicians per capita was associated with both lower incident cases and deaths.
More generally, our results suggest a dynamic impact of COVID-19 on socially vulnerable communities. Contrary to expectation, we found that COVID-19 disproportionately impacted less vulnerable counties early and later in the pandemic. The initial shift could reflect local and state policy decisions, such as early re-openings in states like Georgia with a high percentage of vulnerable counties [45, 46]. By late September, however, the least vulnerable counties began to keep pace with and eventually overturn the more vulnerable counties. This suggests that the impact of COVID-19 is not static, but can migrate from less vulnerable counties to more vulnerable ones and back again over time. These results highlight the need for communities, even less vulnerable ones, to continue to monitor the spread of the disease, maintain adequate health care resources, and remain vigilant in wearing masks and practicing social distancing.
There are, however, limitations to this analysis. First, we used county-level SVI data from 2018. It is possible that social vulnerability factors may have changed between 2018 and 2020, but we used the most recent SVI data available from CDC. Second, it was challenging to model deaths because many counties reported no deaths on any given day. Future studies could employ zero-inflated models to better account for this aspect of the data [47–49]. Future work could also examine temporal trends in locations of correctional facilities, long-term care facilities, nursing homes, Indian reservations and Tribal lands, and other places with high rates of infection [50–53]. Finally, we examined trends in the US only; future work might replicate our study in low- and middle-income countries or those with emerging outbreaks.
Examining the impact of COVID-19 on vulnerable communities in the US is important both during and after the pandemic [18, 54]. Overwhelming evidence suggests that social determinants of health and community contextual factors contribute to disparities in both COVID-19 incident cases and deaths [2, 3, 6, 55] and more recently in vaccinations . It is therefore critically important to monitor and protect vulnerable populations as the pandemic continues to unfold.
Incidence rates (A) and risk ratios (B) for overall SVI using Johns Hopkins data.
Death rates (A) and risk ratios (B) for overall SVI using Johns Hopkins data. Maximum and minimum values, as well as the first and last crossover dates, are highlighted by vertical lines.
Incidence rates (A) and risk ratios (B) for overall SVI in the unadjusted analysis.
Death rates (A) and risk ratios (B) for overall SVI in the unadjusted analysis. Maximum and minimum values, as well as the first and last crossover dates, are highlighted by vertical lines.
- 1. CDC COVID Data Tracker: United States Laboratory Testing 2020 [Available from: https://www.cdc.gov/covid-data-tracker/index.html#testing.
- 2. Millett GA, Jones AT, Benkeser D, Baral S, Mercer L, Beyrer C, et al. Assessing Differential Impacts of COVID-19 on Black Communities. Annals of epidemiology. 2020;47:37–44. pmid:32419766
- 3. Yancy CW. COVID-19 and African Americans. Jama. 2020;323(19):1891–2. pmid:32293639
- 4. Rodriguez-Diaz CE, Guilamo-Ramos V, Mena L, Hall E, Honermann B, Crowley JS, et al. Risk for COVID-19 infection and death among Latinos in the United States: Examining heterogeneity in transmission dynamics. Annals of epidemiology. 2020. pmid:32711053
- 5. Du RH, Liang LR, Yang CQ, Wang W, Cao TZ, Li M, et al. Predictors of mortality for patients with COVID-19 pneumonia caused by SARS-CoV-2: a prospective cohort study. Eur Respir J. 2020;55(5).
- 6. Kim SJ, Bostwick W. Social Vulnerability and Racial Inequality in COVID-19 Deaths in Chicago. Health education & behavior: the official publication of the Society for Public Health Education. 2020;47(4):509–13. pmid:32436405
- 7. Macias Gil R, Marcelin JR, Zuniga-Blanco B, Marquez C, Mathew T, Piggott DA. COVID-19 Pandemic: Disparate Health Impact on the Hispanic/Latinx Population in the United States. The Journal of Infectious Diseases. 2020. pmid:32729903
- 8. Lieberman-Cribbin W, Tuminello S, Flores RM, Taioli E. Disparities in COVID-19 Testing and Positivity in New York City. American journal of preventive medicine. 2020;59(3):326–32. pmid:32703702
- 9. Raifman MA, Raifman JR. Disparities in the Population at Risk of Severe Illness From COVID-19 by Race/Ethnicity and Income. American journal of preventive medicine. 2020;59(1):137–9. pmid:32430225
- 10. Salisbury-Afshar EM, Rich JD, Adashi EY. Vulnerable Populations: Weathering the Pandemic Storm. American journal of preventive medicine. 2020;58(6):892–4. pmid:32371018
- 11. Thebault RBT A.;, Williams V. The coronavirus is infecting and killing black Americans at an alarmingly high rate. Washington Post. 2020 April 7.
- 12. Zhou F, Yu T, Du R, Fan G, Liu Y, Liu Z, et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. Lancet. 2020;395(10229):1054–62. pmid:32171076
- 13. Lithander FE, Neumann S, Tenison E, Lloyd K, Welsh TJ, Rodrigues JCL, et al. COVID-19 in older people: a rapid clinical review. Age Ageing. 2020;49(4):501–15. pmid:32377677
- 14. Turner-Musa J, Ajayi O, Kemp L. Examining Social Determinants of Health, Stigma, and COVID-19 Disparities. Healthcare (Basel). 2020;8(2).
- 15. Khalatbari-Soltani S, Cumming RC, Delpierre C, Kelly-Irving M. Importance of collecting data on socioeconomic determinants from the early stage of the COVID-19 outbreak onwards. Journal of epidemiology and community health. 2020;74(8):620–3. pmid:32385126
- 16. Thakur N, Lovinsky-Desir S, Bime C, Wisnivesky JP, Celedón JC. The Structural and Social Determinants of the Racial/Ethnic Disparities in the U.S. COVID-19 Pandemic: What’s Our Role? Am J Respir Crit Care Med. 2020. pmid:32677842
- 17. Edwards JK, Lessler J. What Now? Epidemiology in the Wake of a Pandemic. American journal of epidemiology. 2020.
- 18. Kuy S, Tsai R, Bhatt J, Chu QD, Gandhi P, Gupta R, et al. Focusing on Vulnerable Populations During COVID-19. Acad Med. 2020. pmid:32639264
- 19. Hurst SA. Vulnerability in research and health care; describing the elephant in the room? Bioethics. 2008;22(4):191–202. pmid:18405317
- 20. Karaye IM, Horney JA. The Impact of Social Vulnerability on COVID-19 in the U.S.: An Analysis of Spatially Varying Relationships. American journal of preventive medicine. 2020. pmid:32703701
- 21. Nayak A, Islam SJ, Mehta A, Ko YA, Patel SA, Goyal A, et al. Impact of Social Vulnerability on COVID-19 Incidence and Outcomes in the United States. medRxiv. 2020. pmid:32511437
- 22. Khazanchi R, Beiter ER, Gondi S, Beckman AL, Bilinski A, Ganguli I. County-Level Association of Social Vulnerability with COVID-19 Cases and Deaths in the USA. J Gen Intern Med. 2020:1–4. pmid:32578018
- 23. Wang C, Li Z, Clay Mathews M, Praharaj S, Karna B, Solís P. The spatial association of social vulnerability with COVID-19 prevalence in the contiguous United States. International Journal of Environmental Health Research. 2020:1–8. pmid:33228411
- 24. Technology NIoSa. Federal Information Processing Standards Publications (FIPS PUBS) 2018 [Available from: https://www.nist.gov/itl/publications-0/federal-information-processing-standards-fips.
- 25. Survey USG. Mapping, Remote Sensing, and Geospatial Data: How many counties are in the United States? 2020 [Available from: https://www.usgs.gov/faqs/how-many-counties-are-united-states?qt-news_science_products=0#qt-news_science_products.
- 26. USAFacts. US Coronavirus Cases and Deaths 2020 [Available from: https://usafacts.org/visualizations/coronavirus-covid-19-spread-map/.
- 27. Johns Hopkins University CfSSaE. COVID-19 case counts 2020 [Available from: http://www.arcgis.com/home/item.html?id=628578697fb24d8ea4c32fa0c5ae1843
- 28. Johns Hopkins University CfSSaE. FAQ—COVID-19 UNITED STATES CASES BY COUNTY 2020 [Available from: https://coronavirus.jhu.edu/us-map-faq.
- 29. United States Census Bureau. State Population Totals: 2010–2019 2019 [Available from: https://www.census.gov/data/tables/time-series/demo/popest/2010s-counties-total.html.
- 30. Centers for Disease Control and Prevention/Agency for Toxic Substances and Disease Registry/ Geospatial Research A, and Services Program. Social Vulnerability Index 2018 Database US 2018 [Available from: https://svi.cdc.gov/data-and-tools-download.html.
- 31. Centers for Disease Control and Prevention County Population Estimates 2019 [Available from: https://www2.census.gov/programs-surveys/popest/datasets/2010-2019/counties/asrh/.
- 32. Robert Wood Johnson Foundation. County Health Rankings & Roadmaps: Rankings Data & Documentation 2020 [Available from: https://www.countyhealthrankings.org/explore-health-rankings/rankings-data-documentation
- 33. Information NCfE. National Oceanic and Atmospheric Administration 2020 [Available from: https://www.ncei.noaa.gov.
- 34. Flanagan BE, Hallisey EJ, Adams E, Lavery A. Measuring Community Vulnerability to Natural and Anthropogenic Hazards: The Centers for Disease Control and Prevention’s Social Vulnerability Index. Journal of environmental health. 2018;80(10):34–6. pmid:32327766
- 35. Wu X, Nethery RC, Sabath BM, Braun D, Dominici F. Exposure to air pollution and COVID-19 mortality in the United States: A nationwide cross-sectional study. medRxiv. 2020. pmid:32511651
- 36. Grundy EJ, Suddek T, Filippidis FT, Majeed A, Coronini-Cronberg S. Smoking, SARS-CoV-2 and COVID-19: A review of reviews considering implications for public health policy and practice. Tob Induc Dis. 2020;18:58. pmid:32641924
- 37. Ssentongo P, Ssentongo AE, Heilbrunn ES, Ba DM, Chinchilli VM. Association of cardiovascular disease and 10 other pre-existing comorbidities with COVID-19 mortality: A systematic review and meta-analysis. PloS one. 2020;15(8):e0238215. pmid:32845926
- 38. Ku BS, Druss BG. Associations Between Primary Care Provider Shortage Areas and County-Level COVID-19 Infection and Mortality Rates in the USA. J Gen Intern Med. 2020:1–2. pmid:32827110
- 39. Bureau USC. National Population Totals and Components of Change: 2010–2019 2019 [Available from: https://www.census.gov/data/tables/time-series/demo/popest/2010s-national-total.html#par_textimage.
- 40. Kneib T, Konrath S, Fahrmeir L. High dimensional structured additive regression models: Bayesian regularization, smoothing and predictive performance. Journal of the Royal Statistical Society: Series C (Applied Statistics). 2011;60(1):51–70.
- 41. Jonathan WP, James S. Fully Bayesian inference for neural models with negative-binomial spiking. 2012:1898–906.
- 42. Dadaneh SZ, Zhou M, Qian X. Bayesian negative binomial regression for differential expression with confounding factors. Bioinformatics. 2018;34(19):3349–56. pmid:29688254
- 43. Farsalinos K, Barbouni A, Poulas K, Polosa R, Caponnetto P, Niaura R. Current smoking, former smoking, and adverse outcome among hospitalized COVID-19 patients: a systematic review and meta-analysis. Ther Adv Chronic Dis. 2020;11:2040622320935765. pmid:32637059
- 44. Changeux JP, Amoura Z, Rey FA, Miyara M. A nicotinic hypothesis for Covid-19 with preventive and therapeutic implications. C R Biol. 2020;343(1):33–9. pmid:32720486
- 45. Guest JL, Del Rio C, Sanchez T. The Three Steps Needed to End the COVID-19 Pandemic: Bold Public Health Leadership, Rapid Innovations, and Courageous Political Will. JMIR Public Health Surveill. 2020;6(2):e19043–e. pmid:32240972
- 46. Chiu WA, Fischer R, Ndeffo-Mbah ML. State-level impact of social distancing and testing on COVID-19 in the United States. Res Sq. 2020.
- 47. Neelon B, Chung D. The LZIP: A Bayesian latent factor model for correlated zero-inflated counts. Biometrics. 2017;73(1):185–96. pmid:27378066
- 48. Neelon B, Chang HH, Ling Q, Hastings NS. Spatiotemporal hurdle models for zero-inflated count data: Exploring trends in emergency department visits. Stat Methods Med Res. 2016;25(6):2558–76. pmid:24682266
- 49. Neelon B. Bayesian Zero-Inflated Negative Binomial Regression Based on Polya-Gamma Mixtures. Bayesian Anal. 2019;14(3):829–55. pmid:33584949
- 50. Kinner SA, Young JT, Snow K, Southalan L, Lopez-Acuña D, Ferreira-Borges C, et al. Prisons and custodial settings are part of a comprehensive response to COVID-19. Lancet Public Health. 2020;5(4):e188–e9. pmid:32197116
- 51. Rodriguez-Lonebear D, Barceló NE, Akee R, Carroll SR. American Indian Reservations and COVID-19: Correlates of Early Infection Rates in the Pandemic. J Public Health Manag Pract. 2020;26(4):371–7. pmid:32433389
- 52. Fallon A, Dukelow T, Kennelly SP, O’Neill D. COVID-19 in nursing homes. Qjm. 2020;113(6):391–2. pmid:32311049
- 53. Davidson PM, Szanton SL. Nursing homes and COVID-19: We can and should do better. Journal of clinical nursing. 2020;29(15–16):2758–9. pmid:32281165
- 54. Bibbins-Domingo K. This Time Must Be Different: Disparities During the COVID-19 Pandemic. Ann Intern Med. 2020;173(3):233–4. pmid:32343767
- 55. Laurencin CT, McClinton A. The COVID-19 Pandemic: a Call to Action to Identify and Address Racial and Ethnic Disparities. Journal of racial and ethnic health disparities. 2020;7(3):398–402. pmid:32306369
- 56. Ndugga N, Pham, O., Hill, L., Artiga, S., Mengistu, S. Early State Vaccination Data Raise Warning Flags for Racial Equity 2021 [Available from: https://www.kff.org/policy-watch/early-state-vaccination-data-raise-warning-flags-racial-equity/.