Abstract
Background Over one million COVID-19 deaths have been recorded in the United States. Sustained global SARS-CoV-2 transmission has led to the emergence of new variants with increased transmissibility, virulence, and/or immune evasion. The specific burden of mortality from each variant over the course of the U.S. COVID-19 epidemic remains unclear.
Methods We constructed an epidemiologic model using data reported by the CDC on COVID-19 mortality and circulating variant proportions to estimate the number of recorded COVID-19 deaths attributable to each SARS-CoV-2 variant in the U.S. We conducted sensitivity analysis to account for parameter uncertainty.
Findings Of the 1,003,419 COVID-19 deaths recorded as of May 12, 2022, we estimate that 460,124 (46%) were attributable to WHO-designated variants. By U.S. Census Region, the South recorded the most variant deaths per capita (median estimate 158 per 100,000), while the Northeast recorded the fewest (111 per 100,000). Over 40 percent of national COVID-19 deaths were estimated to be caused by the combination of Alpha (median estimate 39,548 deaths), Delta (273,801), and Omicron (117,560).
Interpretation SARS-CoV-2 variants that have emerged around the world have imposed a significant mortality burden in the U.S. In addition to national public health strategies, greater efforts are needed to lower the risk of new variants emerging, including through global COVID-19 vaccination, treatment, and outbreak mitigation.
Introduction
Over one million deaths have been recorded in the United States from coronavirus disease 2019 (COVID-19). Since severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) emerged in Wuhan, China in late 2019, the virus has continued to evolve, with multiple new variants characterized by increased transmissibility, virulence, and/or immune evasion.
Variants reported around the world have repeatedly changed the trajectory of the U.S. epidemic. As of May 12, 2022, the World Health Organization (WHO) has named five variants of concern (VOC) that were originally detected in four different continents [Table 1]. All five variants of concern were more transmissible than the original virus, and spread to the U.S.1–3 Delta (lineage B.1.617.2) was particularly severe from a clinical standpoint, and drove a surge of hospitalizations across the U.S in the summer and fall of 2021.4 While Omicron (lineage B.1.1.529) appears to cause less severe disease than Delta on a per-infection basis—likely due to a combination of lower intrinsic severity and the high observed concentration of cases among the previously vaccinated or infected5—its elevated transmissibility, including among the seropositive, resulted in the largest wave of SARS-CoV-2 transmission in the U.S. to date.6, 7 Vaccines developed against the non-variant SARS-CoV-2 are less effective at protecting against infection with Delta and Omicron, but remain highly effective against severe disease.8–10
In addition to variants of concern, the WHO and U.S. Centers for Disease Control and Prevention (CDC) have classified a number of additional SARS-CoV-2 lineages as variants of interest (VOI) or under monitoring (VUM).13 The variants have emerged in the context of sustained global transmission. More than 500 million COVID-19 cases have been reported globally, reflecting only a fraction of total infections, and giving SARS-CoV-2 many opportunities to mutate and evolve.14 One third of the global population has still not received a vaccine dose, limiting the benefits of vaccination on reducing transmission and viral replication at a population level.15
Recent studies have described clinical outcomes associated with SARS-CoV-2 variants, and analyzed factors associated with COVID-19 mortality in the U.S. including race and geography.16, 17 The aim of our study is to estimate the burden of deaths associated with each SARS-CoV-2 variant in the U.S. We construct a model using data reported by the CDC on COVID-19 deaths by jurisdiction and circulating variant proportions.
Methods
To estimate the number of recorded COVID-19 deaths attributable to each variant, we back-distributed deaths to estimate the number of ultimately fatal COVID-19 cases testing positive on each day in each jurisdiction, and compared these counts to proportions of variants among sequenced cases in the same time period and location, adjusted for differences in disease severity between variants.
Data
Our analysis relies on provisional counts of COVID-19 deaths from the CDC’s National Center for Health Statistics (NCHS).18 This dataset is based on death certificate records and reports the number of deaths with confirmed or presumed COVID-19 by jurisdiction (the 50 states, New York City, District of Columbia, and Puerto Rico) and time period (week, month, year, and total) of occurence. All data were accessed May 12, 2022.
The NCHS data are subject to change as new death certificate records are received and processed by the CDC. While the majority of COVID-19 deaths are reported within 10 days of the date of death, state reporting timelines vary significantly.19 Counts from recent weeks may be incomplete. However, because adjustments are almost always upwards, the NCHS provisional COVID-19 deaths can be considered a lower bound of the number of recorded COVID-19 deaths that have occurred.
NCHS death counts are also displayed by the week of occurrence, rather than the day of occurrence. Since the delay distributions that we applied to match deaths to the appropriate variant proportions (see below) require analysis at the level of days, we imputed daily deaths within each week by sampling from a uniform distribution. Finally, for each jurisdiction, the NCHS suppresses death counts between 1 and 9 for privacy reasons. As a result, the specific week is not known for 3,628 of the 1,003,419 COVID-19 deaths recorded in this dataset (0.4% of deaths overall, 0.002% to 39.4% of deaths in each jurisdiction). In this analysis, we do not assign these deaths to a week or otherwise attribute them to individual variants, meaning that our variant-specific death estimates should be treated as lower bounds.
Variant proportions were retrieved either directly20 from the CDC, or based on code and data made publicly available21 by the CDC. We considered 26 mutually-exclusive categories that deaths could be assigned to: the 5 current and former variants of concern (Alpha, Beta, Gamma, Delta, and Omicron), 19 other variants and lineages which have been recognized by the WHO and detected in the U.S. (A.23.1, A.27, A.28, B.1.1.318, B.1.1.519, B.1.214.2, B.1.617.3, B.1.620, B.1.640, C.16, C.36, Epsilon, Eta, Iota, Kappa, Lambda, Mu, Theta, or Zeta), a composite group of the early non-VOI/VOC/VBM virus lineages, which we will refer to as “non-variant”, and “unknown” (deaths for which the week of occurrence is suppressed).9
CDC genomic surveillance data was available from January 3rd, 2021. For the purpose of this analysis, we assumed that no variants were present in the U.S. prior to this date. In reality, there is evidence for circulation of the Alpha2, 22, 23 Epsilon24, and Iota22 variants in the U.S. in late 2020, albeit at low frequencies: thus, estimates of the mortality burden for these variants should be considered conservative. For the period between January 3rd, 2021, and January 1st, 2022, we used publicly available19 code and data underpinning the CDC’s variant proportion analysis to generate weighted variant proportions for each jurisdiction and U.S. Health and Human Services (HHS) region in non-overlapping weekly intervals. If no data on sequenced cases was available for a week in any given jurisdiction, we substituted variant proportions from the wider HHS region.
For the period after January 1st, 2022, the code and data described above was not publicly available. Instead, we used variant proportions retrieved directly from the CDC for national and HHS-regional levels in non-overlapping two-week intervals.18 We assumed that all states in a region had the same variant distribution during any given 2-week period. This simplifying assumption likely only had a marginal impact on our results, since the overwhelming majority of cases sampled in this period were attributed to Omicron.
Analysis
Given the delays between symptom onset, testing, and death, we applied lags to infer the likely time of sample collection for new deaths. For our primary analysis, we assumed that sample collection for testing always took place either 0, 1, 2, or 3 days after symptom onset, with an associated probability of 25% for each of these 4 possible lag times. This is a simplifying assumption on our part. We also assumed the lag from symptom onset to death, lagonset to death, follows a lognormal distribution with a mean and standard deviation of 20.2 and 11.6 days, respectively (mean and standard deviation of the log(lag) = 2.863 and 0.534 respectively), truncated at 50 days. This corresponds to the best-fitting distribution for right-censored onset-to-death observations in Linton et al.25
Before applying variant proportions to COVID-19 deaths, we adjusted variant proportions among cases for variant-specific differences in disease severity. For instance, during periods where Omicron and Delta co-circulated, we expect Delta to be overrepresented among deaths relative to its share of cases (most of which will not be fatal), since Delta infections are more likely to result in severe disease than Omicron infections (whether due to higher intrinsic severity, a greater concentration of cases in unvaccinated individuals, or a combination).5
We derived a variant i’s proportion of fatal cases, pi,fatal from its share of all cases pi,all and the relative risk of death RRi among confirmed cases infected with variant i compared to non-variant virus in a given time period t as
To parameterize RRi, we derived data on the relative severity of variants from a European Centre for Disease Prevention and Control (ECDC) review which was last updated in early April 2022.26 This review found no evidence for differences in severity (relative to the dominant variant(s) at the time of emergence) for the B.1.617.3, Epsilon, Eta, Iota, Kappa, Lambda, Mu, Theta, or Zeta variants. Evidence of differential severity was reported for the Alpha, Beta, Gamma, Delta, and Omicron variants [Table 2].
Estimates of the relative risk of death for different SARS-CoV-2 Variants of Concern
Based on the estimates above, we assumed that RRi (the relative risk of death among confirmed cases caused by variant i vs non-variant in any given period of time), is 1.4 for Alpha, 1.5 for Beta, 1.2 for Gamma, 2.3 for Delta, 0.5 for Omicron (∼0.21*RRDelta), and 1 for the non-variant and all other variants in the primary analysis.
As a sensitivity analysis, we varied the assumed relative severity of Omicron from RRomicron = 0. 15, the approximate ratio of observed Omicron and Delta case fatality rates in , which corresponds approximately to the upper 95% bound of the Omicron vs Delta hazard ratio of death estimated by Lewnard et al., 0.42, times RRdelta = 2. 3.29, 30 The results from these sensitivity analyses provide upper and lower limits for the plausible number of recorded COVID-19 deaths caused by the Omicron variant.
Finally, after deriving appropriate severity-adjusted variant proportion values pi, fatal for each jurisdiction and time period, we estimated the number of recorded COVID-19 deaths caused by each variant i as
for each jurisdiction, where Biweek(t) species the non-overlapping two-week period containing day t. We reported national, regional (based on the four U.S. Census Regions.31 For this analysis, we included Puerto Rico with states in the South region), and jurisdiction-specific estimates.
Median estimates and 95% credible intervals were derived by running the model described above 250 times. Within each model run, we performed bootstrap resampling of the variant proportions at each location and time period before adjusting for severity, and resampled the day of death within the NCHS-specified week, the onset-to-death delay, and the onset-to-testing delay for each death in the dataset.
Results
National estimates
Of the 1,003,419 (299 per 100,000 people) recorded COVID-19 deaths in our dataset at the national level (the 50 states, the District of Columbia, and Puerto Rico), our median estimate is that 539,667 (95% credible interval: 538,928 - 540,466 deaths, 54% of all COVID-19 deaths, 161 deaths per 100,000) were caused by non-variant SARS-CoV-2, while 460,124 (95% CI: 459,325 - 460,863 deaths, 46% of deaths, 137 per 100,000) were caused by a WHO-classified variant [Table 3]. Of the variants that have had the greatest impact on the COVID-19 epidemic in the United States, we estimate that the Alpha variant caused a median of 39,548 recorded deaths (95% CI: 38,890 - 40,133 deaths, 4% of deaths, 12 per 100,000) across model runs, the Delta variant caused a median of 273,801 recorded deaths (95% CI: 273,483 - 274,168 deaths, 27% of deaths, 82 per 100,000), and the Omicron variant has so far caused a median of 117,560 recorded deaths (95% CI: 117,252 - 117,835 deaths, 12% of deaths, 35 per 100,000).
Estimated Number of Recorded COVID-19 Deaths in the U.S. Attributable to Each Variant
In sensitivity analyses where we increased or decreased the assumed severity of Omicron relative to other variants, estimates of recorded deaths caused by Delta ranged from medians of 265,228 deaths (95% CI: 264,898 - 265,600 deaths, 26% of deaths, 79 per 100,000) to 291,946 (95% CI: 291,623 - 292,319 deaths, 29% of deaths, 87 per 100,000) across model runs, while median estimates of recorded deaths caused by Omicron ranged from 99,023 (95% CI: 98,727 - 99,352 deaths, 10% of deaths, 30 per 100,000) to 126,327 (95% CI: 126,022 - 126,582 deaths, 13% of deaths, 38 per 100,000).
Of the COVID-19 deaths recorded by NCHS, 3,628 recorded deaths (0.3%, 1 per 100,000) could not be attributed to a variant because the week of death was not given at the state-level in the dataset. National estimates of the burden of recorded COVID-19 deaths attributable to each variant are given in Table 3.
Regional estimates
The Northeast, South, and Midwest recorded the most COVID-19 deaths per capita (328, 317, and 306 per 100,000 respectively), while the West recorded the fewest (244 per 100,000) [Figure 1A]. However, our estimates indicate the Northeast experienced the fewest COVID-19 variant deaths per capita (median 111 per 100,000, 95% credible interval: 111 - 112). As a result, variants accounted for 34% of all COVID-19 deaths in the Northeast, compared to nearly half of deaths in each of the other three census regions (Midwest: 46%, South: 50%, West: 49%) [Tables 4-5].
Estimated COVID-19 Deaths per Capita by Variant and U.S. Census Region
Non-Variant
The median estimated burden of recorded deaths caused by non-variant SARS-CoV-2 was highest in the Northeast with 215 COVID-19 deaths per 100,000 (123,103 deaths, 66% of all COVID-19 deaths in this region), and lowest in the West with 124 deaths per 100,000 (97,549 deaths, 51% of all COVID-19 deaths in this region) [Figure 1B, Tables 4-5]. New York City experienced the highest median incidence of deaths from non-variant SARS-CoV-2 (315 deaths per 100,000, or 26,665 deaths overall, 76% of all COVID-19 deaths in NYC) [Supplementary Tables 1-2].
Estimated COVID-19 Deaths per Capita in U.S. by Census Region and Variants, Primary Analysis
Estimated COVID-19 Deaths in U.S. by Census Region and Variants, Primary Analysis
Alpha
After its emergence in the winter of 2020/21, the median estimated burden of deaths caused by the Alpha variant was highest in the South with 15 recorded COVID-19 deaths per 100,000 (19,143 deaths, 5% of all COVID-19 deaths in this region) and lowest in the West with 7 recorded COVID-19 deaths per 100,000 (5,339 deaths overall, 3% of all COVID-19 deaths in this region) [Figure 1D, Tables 4-5]. Michigan experienced the highest median incidence of Alpha deaths across all jurisdictions (31 deaths per 100,000, or 3,095 deaths overall, 10% of all COVID-19 deaths in Michigan) [Supplementary Tables 1-2].
Estimated COVID-19 Deaths per Capita in the U.S. by Jurisdiction and Variant, Primary Analysis
Estimated COVID-19 Deaths in the U.S. by Jurisdiction and Variant, Primary Analysis
Delta
The median estimated burden of deaths caused by the Delta variant was highest in the South with 97 recorded COVID-19 deaths per 100,000 (125,925 deaths, 30% of all COVID-19 deaths in this region), followed by the Midwest with 92 recorded COVID-19 deaths per 100,000 (63,049 deaths, 30% of all COVID-19 deaths in this region). The Northeast experienced the fewest deaths per capita with 57 recorded COVID-19 deaths per 100,000 (32,414 deaths, 17% of all COVID-19 deaths in this region [Figure 1E, Tables 4-5]. West Virginia experienced the highest median incidence of Delta deaths (154 per 100,000, or 2,740 deaths overall, 40% of all COVID-19 deaths in West Virginia) across all jurisdictions [Supplementary Tables 1-2].
Omicron
So far, the median estimated burden of deaths caused by the Omicron variant has been highest in the South with 41 recorded COVID-19 deaths per 100,000 (52,912 deaths, 13% of all COVID-19 deaths in this region) and lowest in the West with 29 recorded COVID-19 deaths per 100,000 (22,738 deaths, 12% of all COVID-19 deaths in this region, respectively) [Figure 1F, Tables 4-5]. Mississippi has experienced the highest incidence of Omicron deaths to date (60 per 100,000, or 1,773 deaths overall, 13% of all COVID-19 deaths in Mississippi) across all jurisdictions [Supplementary Tables 1-2].
The specific number and share of Omicron and Delta COVID-19 deaths varied in sensitivity analyses where we assumed Omicron was more or less severe (compared to our primary analysis), but the overall regional patterns described above persisted [Supplementary Tables 3-4].
Estimated Omicron and Delta COVID-19 Deaths in U.S. by Census Region, Sensitivity Analysis Assuming Higher or Lower Omicron Severity
Estimated Omicron and Delta COVID-19 Deaths in U.S. by Jurisdiction, Sensitivity Analysis Assuming Higher or Lower Omicron Severity
Other
The median estimated incidence of deaths from other variants (not Alpha, Delta, or Omicron) was higher in the West (16 COVID-19 deaths per 100,000) and Northeast (10 COVID-19 deaths per 100,000) than the Midwest and the South (5 and 6 recorded COVID-19 deaths per 100,000, respectively) [Figure 1G, Tables 4-5]. In addition to Alpha, the Epsilon and Iota variants accounted for a significant share of cases in the West and Northeast prior to the emergence of Delta.
Discussion
Sustained global transmission of SARS-CoV-2 has fueled the emergence of variants that have caused ∼460,000 deaths in the U.S. Alpha, Delta and Omicron—which were first detected in Europe, Asia, and Africa—have led to ∼430,000 deaths so far. The other variants, including those that were first detected in the U.S., accounted for an additional ∼30,000 deaths.
More than 40 percent of national COVID-19 deaths were caused by SARS-CoV-2 variants first detected outside the United States. The phrase “no one is safe until everyone is safe” has been repeatedly invoked as a clarion call during the pandemic. Our analysis provides support for this claim by documenting the significant impact of SARS-CoV-2 variants, which have emerged in the context of uncontrolled transmission both locally and around the world, on U.S. mortality.
Our analysis also suggests the potential benefit of rapidly implementing strategies to reduce the impact of novel variants after the onset of a viral epidemic. We estimate that Delta and Omicron together have caused over 390,000 deaths in the U.S., despite the fact both variants were designated as variants of concern after multiple COVID-19 vaccines had received emergency authorization. Although causality cannot be established definitively, inequalities in global COVID-19 vaccine production and distribution, along with uncontrolled spread of the virus, may have contributed to viral evolution and circulation. For example, Omicron was detected in November 2021 by scientists in Botswana and South Africa, where less than a quarter of the population was fully vaccinated.32
Robust global vaccination and treatment campaigns may continue to play an important role in reducing viral replication. A leading hypothesis holds that variants of concern have emerged from immunocompromised individuals with persistent infections that provide more opportunity for viral selection and evolution.33–35 While vaccine effectiveness is reduced against Delta and especially Omicron, booster doses provide modest, albeit waning, protection against infection and appear to reduce transmission risk by lowering infectious viral load.36–39 When breakthrough cases do occur, vaccinated persons also clear infections faster than unvaccinated persons.40 Second-generation vaccines in development could yield significant benefit if they offer greater protection against transmission. In addition, monoclonal antibodies have received emergency authorization for preexposure prophylaxis, and emerging evidence suggests oral antiviral treatments lower SARS-CoV-2 viral load, and clear infections faster.41–43
Finally, our analysis reveals the impact of the variants across the country. All regions faced a substantial burden of deaths from the variants. The Northeast recorded the lowest, but still substantial, number of variant deaths, both overall (63,542) and per-capita (111 per 100,000). Some regions were disproportionately impacted, likely reflecting differences in vaccination coverage, prior immunity44, use of non-pharmaceutical interventions, demographics and social vulnerability. For example, the South recorded the most variant deaths per capita (median estimate 158 per 100,000) and overall (205,729). It also experienced the highest deaths per capita in each of the Alpha, Delta, and Omicron waves (15, 97 and 41 per 100,000 respectively). Of the 18 states and territories included in the region, 10 had fully vaccinated less than 60% of their population as of early May 2022, according to the CDC.45
Our analysis is limited by the quality of public data. NCHS data are considered provisional because of the lag between the date of death and the processing of death certificates by the CDC. Nonetheless, a potential lag would lead to underestimates of deaths caused by variants (due to undercounting of deaths in more recent weeks when Omicron is dominant). In general, reported deaths represent an undercount of total COVID-19 deaths, given limitations in testing and data.46 In addition, there is considerable uncertainty associated with some state variant proportions due to insufficient sequencing. Our model also uses variant proportion figures based on clinical testing which may lag behind actual variant circulation. For example, wastewater surveillance suggests that Omicron was circulating in the country before the first case was reported.47 This would also lead to underestimates of deaths caused by variants. Finally, we do not model disease transmission dynamics so our estimates should not be interpreted as the number of deaths that would have been prevented if novel variants did not emerge. Even in the absence of new variants, the non-variant SARS-CoV-2 would have continued to spread, resulting in additional deaths.
SARS-CoV-2 variants detected around the world have imposed a significant mortality burden in the U.S. In addition to national public health strategies, greater efforts are needed to reduce the risk of new variants emerging globally. However, U.S. government officials have warned that they are running out of COVID-19 funds.48 Congressional lawmakers recently excluded $5 billion in funding for global treatment and vaccination campaigns from a new spending proposal, and there has been limited investment in next-generation vaccines that may offer greater protection against transmission.49, 50 Our analysis shows the significant impact of SARS-CoV-2 variants on U.S. public health. In doing so, it highlights the potential risk posed to Americans by new variants–a risk that is only amplified with low rates of global COVID-19 vaccination, and low availability of COVID-19 diagnostics, treatments, and prophylaxis.
Data Availability
Datasets and R code needed to replicate this analysis are available online at https://github.com/joewalker127/US_Deaths_COVID_Variants
Author Contributions
Concept and design: JW, ZR Statistical analysis: JW
Interpretation of data and findings: JW, NG, GG, VP, ZR
Drafting and revision of the manuscript: JW, NG, GG, VP, ZR
Conflicts of Interest
N.D.G. is a consultant for Tempus Labs and the National Basketball Association for work related to COVID-19 but is outside the submitted work. V.P discloses reimbursement from Merck and Pfizer for travel to Scientific Input Engagements unrelated to COVID-19 and membership on the WHO Immunization and Vaccine-related Implementation Research Advisory Committee (IVIR-AC).
Data and Code Availability
Datasets and R code needed to replicate this analysis are available online at https://github.com/joewalker127/US_Deaths_COVID_Variants
Acknowledgements
The authors are grateful to Zena Ahmed for research assistance, to Dr. Daniel Weinberger for providing constructive feedback on the analysis, and to Drs. Prabasaj Paul and Molly Steele for assistance in identifying relevant data sources.