Abstract
Objective Clinical sequelae have not been well characterized during the post-acute phase of SARS-CoV-2 among adults 18 to 65 years old, and this study sought to fill that gap by evaluating excess risk and relative hazards for developing incident clinical sequelae during the post-acute phase.
Design Retrospective cohort study including three propensity-matched groups.
Setting This study merged three data sources from a large United States health plan: a large national administrative claims database, an outpatient lab testing database, and an inpatient hospital admissions database.
Participants Individuals 18 to 65 years old with continuous health plan enrollment from January 2019 to date of SARS-CoV-2 diagnosis. Three comparator groups were identified and propensity-score matched to individuals infected with SARS-CoV-2: a 2020 comparator group, a historical 2019 comparator group and a historical comparator group with viral lower respiratory tract illness (vLRTI).
Main outcome measures Over 50 clinical sequelae during the post-acute phase (index date + 21 days) were ascertained using ICD-10 codes. Excess risk due to SARS-CoV-2 during the 4 months following the acute phase of illness and hazard ratios with 95% Bonferroni-corrected confidence intervals were calculated.
Results This study found 14% of adults ≤65 years of age who were infected with SARS-CoV-2 (n=193113) had at least one new clinical sequelae that required medical attention during the post-acute phase of illness. When considering risk for specific sequelae attributable to SARS-Cov-2 infection during the post-acute phase, clinical outcomes including chronic respiratory failure, cardiac arrythmia, hypercoagulability, encephalopathy, peripheral neuropathy, amnesia (memory difficulty), diabetes, liver test abnormalities, myocarditis, anxiety and fatigue were significantly elevated compared to the three propensity-matched comparator groups (2020, 2019, vLRTI). Significant risk differences due to SARS-CoV-2 infection ranged from 0.02 to 2.26 per 100 people and hazard ratios ranged from 1.24 to 25.65 when compared to the 2020 comparator group.
Conclusions Our results confirm excess risk for developing clinical sequelae due to SARS-CoV-2 during the post-acute phase, including specific types of sequelae less commonly seen among other viral illnesses. Although individuals who were older, had pre-existing conditions, and were hospitalized due to COVID-19 were at greatest excess risk, younger adults (≤50 years), adults who did not have pre-existing conditions or adults who were not hospitalized due to COVID-19 were still at elevated risk for developing new clinical sequelae. The elevated risk for incident sequelae during the post-acute phase is relevant for healthcare planning.
What is already known on this topic Small observational studies and case reports of hospitalized patients have shown some COVID-19 survivors suffer from short- and long-term sequelae. Few studies have characterized the excess risk of clinical sequelae attributable to SARS-CoV-2 during the post-acute phase among adults ≤65 years of age in a large generalizable sample.
What this study adds This study found 14% of individuals ≤65 years of age who were infected with SARS-CoV-2 (n=193113) had a diagnosis of at least one new sequelae that required medical attention during the post-acute phase of illness. Elevated risk for specific clinical sequelae during the post-acute phase of illness was noted across a range of organ systems including cardiovascular, neurologic, kidney, respiratory, and mental health complications. The risk for incident sequelae increases with age, pre-existing conditions, and hospitalization for COVID-19; however, even among adults ≤ 50 years of age and individuals without pre-existing conditions or hospitalization due to COVID-19, risk for some clinical sequelae is still elevated. These results indicate where additional diagnostic follow-up, rehabilitation, and symptom management may be warranted among younger adults with milder infection.
Introduction
Emerging data suggest sequelae of infection with SARS-CoV-2 and the disease it causes, COVID-19, may vary in presentation and extend beyond the typical post-viral recovery period. As such, epidemiologic interest in post-acute morbidity among survivors is growing. Some survivors experience serious complications during the acute phase of the illness affecting pulmonary, cardiovascular, hepatic, renal, cognitive, and neurologic function.(1–4) Survivors also report a range of persistent symptoms adversely impacting physical, mental, and social well-being.(5–8) At least some of these complications may occur independent of COVID-19 severity.(9) While individuals hospitalized with community-acquired (non-COVID-19) pneumonia or influenza face risk of cardiovascular, cerebrovascular, and neurological complications,(10,11) the degree of increased risk resulting from SARS-CoV-2 infection remains unclear. Longitudinal studies on survivors from other coronaviruses (MERS and SARS) suggest long-term physical and mental sequelae are not uncommon.(12–14)
Most published studies to date have been small and primarily focused on clinical sequelae among hospitalized patients.(15) As a result, many of these studies may not be generalizable to the larger population of SARS-CoV-2 infected individuals. Little is known about the incidence of clinical sequelae due to SARS-CoV-2 infection during the post-acute phase of illness among adults 18 to 65 years old who may be considered at lower risk for severe COVID-19. Moreover, few studies have been powered to evaluate whether factors such as age, gender, pre-existing conditions, and hospitalization status modify risk of clinical sequelae in the post-acute phase of illness.
We have estimated the excess risk and hazard ratios of new clinical sequelae attributable to SARS-CoV-2 among adults 18 to 65 years old during the post-acute phase of the SARS-CoV-2 infection. This analysis includes a large generalizable sample of commercially insured adults, has ascertained objective outcomes utilizing valid International Classification of Diseases Tenth Revision (ICD-10) codes in claims, and has the power to both detect rare diagnoses and evaluate associations across subgroups.
Methods
Data Source
We conducted a retrospective cohort analysis utilizing three data sources within the UnitedHealth Group Clinical Discovery Database: de-identified administrative outpatient and inpatient claims, outpatient lab results for SARS-CoV-2, and a hospital admissions database updated daily with patients admitted with primary, secondary, or tertiary diagnosis of COVID-19. Quality control efforts for data were applied (eAppendix A). All data were for commercially enrolled patients from a single large national health insurance provider in the United States. This research was determined to be exempt from human research regulations by the UnitedHealth Group Office of Human Research Affairs (Action ID: 2020-0076-01).
Study Population
Individuals Diagnosed with SARS-CoV-2
This group consisted of individuals 18 to 65 years old with continuous health plan enrollment from January 1, 2019 to first date of one of the following, which was denoted as the index date:
primary, secondary, or tertiary diagnosis of COVID-19 identified by administrative claims with ICD-10 code U07.1 or either B34.2 or B97.29 before April 1, 2020 OR
documentation of a positive PCR test within an outpatient lab dataset OR
hospitalization due to COVID-19 identified from a hospital admissions database with admitting diagnosis code or primary, secondary, or tertiary diagnosis code U07.1 or U07.2.
We excluded individuals with a positive SARS-CoV-2 antibody serology without documented infection (n= 28,810) because index dating of illness was not possible. We also excluded individuals with a diagnosis code of B34.2 or B97.29 on or after April 1, 2020 (n=24,865) and individuals hospitalized due to suspected COVID-19 but missing diagnosis code U07.1 or U07.2 in primary, secondary, or tertiary position (n=1,247). A flow chart with details of population sampling is provided in Figure 1.
2020 Comparator Group
The 2020 comparator group consisted of individuals 18 to 65 years old without a COVID-19-related clinical diagnosis, positive PCR test, or hospitalization due to COVID-19 in 2020. Continuous health plan enrollment was required from January 1, 2019 to a randomly assigned index date drawn from the SARS-CoV-2 infection group.
2019 Comparator Group
We created this historical comparison group to account for possible ascertainment bias due to lower utilization of healthcare services during the 2020 pandemic. Individuals 18 to 65 years old were required to have continuous health plan enrollment from January 1, 2018 to a randomly assigned month and day in 2019 drawn from the SARS-CoV-2 infection group.
Viral Lower Respiratory Tract Illness (vLRTI) Comparator Group
We created this historical comparison group to evaluate the clinical sequelae specific to SARS-CoV-2 infection, as many serious viral illnesses carry risk for developing post-acute illness morbidity. The vLRTI group included individuals 18 to 65 years old with a diagnosis of any one of the following: influenza (J09, J10, J11), non-bacterial pneumonia (J12, J18.9), acute bronchitis (J20), acute lower respiratory infection (J22), and COPD with acute lower respiratory infection (J44.0) between January 1, 2017 and October 31, 2017, January 1, 2018 and October 31, 2018, or January 1, 2019 and October 31,2019. We included COPD exacerbation since cases are typically virally induced and identified only with a J44 code. We defined the index date as the first date of diagnosis for vLRTI. Individuals were required to have continuous enrollment from January 1, 2016, January 1, 2017, or January 1, 2018 to the index date, respectively.
Main Outcomes
We utilized ICD-10 codes to identify new clinical diagnoses from the administrative claims data between January 1, 2020 and October 31, 2020. See eTable 1 for ICD-10 classification details. We created domain clusters based on clinically similar diagnoses, and we included atopic dermatitis as a negative control.(16)
Study Variables
We used administrative claims between January 1, 2019 and 30 days prior to index date to ascertain length of hospital stay, prior clinical conditions, history of comorbidities derived from the Charlson Index and Elixhauser score, and prior PCP and specialist visits. Demographic, clinical, and testing data were obtained from administrative claims between January 1, 2020 and October 31, 2020. We derived zip code-specific socioeconomic status scores (SES) and proportions of White, African American and Hispanic populations by zip code.(17) Missing values of zip-code derived variables (SES n=11,713; race n= 444,080) were imputed with the median of non-missing values for these variables. Since all other variables were derived from administrative claims and the continuous enrollment criteria were applied, clinical events without a claim were considered to have not occurred and were given a zero value.
Follow-Up
Follow-up periods for the primary analysis for SARS-CoV-2 infected individuals and comparator groups started at index date + 21 days and continued through to a diagnostic event, disenrollment from insurance plan (due to death or withdrawal), or end of study period (October 31, 2020 or October 31, corresponding year for the historical comparator groups), whichever occurred first. We performed a secondary analysis evaluating rates by month of follow-up starting 30 days prior to index date and ending at 6 months follow-up.
Public and Patient Involvement
This retrospective analysis did not directly involve patients or the public in the development of the research question or conduct of the analysis. A patient reviewer provided insightful comments and contributed to the expansion of the outcomes reported in this manuscript. The content of this paper will be summarized and made publicly available on the United In Research (unitedinresearch.com) platform sponsored by OptumLabs.
Statistical Methods
Propensity-score matching
We used propensity-score matching to create three cohorts similar on baseline characteristics and relevant confounders.(18) We constructed a propensity score for every individual based on 108 variables (age, numeric; gender, male Y/N; zip code-specific SES, numeric; proportion of White, African American and Hispanic populations in zip code, numeric; state, 10 binary variables in top 10 states; index month, Jan-Mar, Apr, May, Jun, Jul, Aug, Sept Y/N; pre-existing comorbidities, Y/N; total inpatient length of stay in the previous year, numeric in days; prior PCP, cardiologist, and nephrologist visit day count, numeric; and prior clinical conditions, Y/N) using logistic regression with ridge penalty.(19,20)
Due to the large study population (about 10 million), we did not perform conventional 1-1 nearest neighbor matching. We generated 40,000 bins based on the SARS-CoV-2 infected group’s propensity scores. In each small bin, we created equal-size sets of individuals with SARS-CoV-2 and the comparator group. We matched within the bins and then combined all sets to form the propensity-matched groups. This process was repeated for each of the comparator groups (2020, 2019, vLRTI). We achieved three balanced cohort groups with this approach, as shown in eTable 1a and eFigure 1a-c.
Data Analysis
We evaluated demographic and clinical factors using the t-test and Pearson chi-square test for the unmatched population to compare numeric and categorical data, respectively. We calculated the proportion of individuals with follow-up time after 21 days post-index date who were diagnosed with no clinical sequelae, one, or more than one clinical sequelae during the post-acute period. Symptoms such as fatigue, myalgia, and anosmia as measured by ICD-10 codes were included in this post-acute assessment of clinical sequelae.
When calculating the risk for a specific incident outcome in the post-acute phase, individuals who had the diagnosis of interest prior to index date (in the SARS-CoV-2 and respective comparator group) or individuals (and their comparator match) who experienced the diagnosis of interest during the acute phase were removed from the post-acute calculation. Therefore, the denominator for the incidence of each outcome included only those at risk for the diagnosis of interest at index date+21 days. We reported the risk difference as the difference between the cumulative incidence calculated by the Kaplan-Meier estimator at the 120th day from time origin (index date + 21 days) in the SARS-CoV-2 group and the comparator group, multiplied by 100. We calculated the 95% confidence intervals and p values using a pairwise bootstrap method, with a one-sided test We tested the proportional hazards assumption using Schoenfeld residuals.(21) Hazard ratios and 95% confidence intervals were estimated by fitting Cox proportional hazards models using robust variance estimator with clustering for matched pairs. We used the Wald-test to evaluate significance at the 0.05 level. Individuals were censored based on the outcome of interest, disenrollment from insurance plan, or end of the study period (October 31, 2020).
In a secondary analysis, we created seven 1-month time intervals (−1 month prior to index date to 6 months after index date). Within each time interval, we only included the matched pairs who are both at-risk for the diagnosis of interest at the beginning of each time interval. We estimated the hazard ratios and confidence intervals by fitting Cox proportional hazard models using robust variance estimator with clustering for matched pairs. Individuals were censored based on diagnosis of interest, disenrollment, or end of the time interval.
Due to the large number of comparisons, we applied the Bonferroni correction for all p values and confidence intervals, by multiplying p values by N and estimating confidence intervals, where N= 51 and is the number of clinical sequelae we tested.
For all data analysis, we used Python with scikit-learn, statsmodels, lifelines, and scipy libraries and R with survminer,(22) survival,(23) glmnet,(24) and stats libraries.(25)
Subgroup Analysis
We performed four stratified analyses [age (18-≤34, >34-≤50 >50), gender (M/F), any pre-existing clinical comorbidity (Y/N), and hospitalization due to COVID-19 (Y/N)] with the SARS-CoV-2 and 2020 comparator group. We constructed a propensity score and performed propensity score matching within each stratum before calculating the risk difference or hazard ratio. To test for interaction, we evaluated the significance of the fitted coefficient for the interaction term in a model including main effect variables, with a Bonferroni correction (N=51*number of levels in each subgroup analysis).
Sensitivity Analysis
We evaluated the robustness of the post-acute phase definition by varying the cut points from index date at 14 and 28 days. We also considered a period effect due to changes in the availability of testing and advancements in management and treatment over time. We evaluated differences by two periods (January–June; July–October). Finally, we evaluated medical utilization during the post-acute phase by type of diagnostic method (PCR-positive, clinical diagnosis, hospitalization).
Results
Among 9,247,505 individuals meeting the study criteria in 2020, we identified 266,586 individuals (2.9%) with SARS-CoV-2 infection. After propensity score matching, we identified 266,586 matched pairs for the primary (2020) and secondary (2019) comparison group (100% of SARS-CoV-2 individuals matched). We identified 244,276 matched pairs for the vLRTI comparison group (91.6% of SARS-CoV-2 individuals matched).
Individuals with SARS-CoV-2 infection were more likely than their unmatched 2020 and 2019 comparators to be female, have lower SES index, live in a zip code with majority black or Hispanic individuals, have a pre-existing comorbidity or at-risk condition, have an in-patient hospital stay during the preceding year, visit a PCP and other specialists more often, and live in the Northeast or Southern U.S (areas of high incidence during the study period; all p-values<0.05) (Table 1). Among those individuals with SARS-CoV-2, 8.2% were hospitalized and 1.1% were admitted to the ICU. We observed different patterns between the SARS-CoV-2 individuals and the vLRTI comparison group who were more likely to be older, female, have a comorbidity, smoke, visit a PCP or cardiologist, and live in the South. Following propensity score matching, most of these differences were resolved, although some small differences between the matched SARS-CoV-2 infected group and the vLRTI comparator remained significant (eTable 1a). Despite these minor differences, balance was achieved overall as shown by the standardized mean differences between matched groups by key variables at less than 10% (eFigure 1a-c).
Among the matched SARS-CoV-2 infected individuals identified in our study with follow-up time past 21 days from index date (n=193113), 85.98% did not have any new post-acute clinical sequelae that required medical attention during their follow-up, 10.01% had one new sequelae that required medical attention, and 4.01% had more than one type of new sequelae (Table 2). The proportion of individuals diagnosed with any new clinical sequelae during the post-acute phase was higher among the SARS-CoV-2 infected group than the three comparator groups, although the differences were smaller when compared to the vLRTI comparator group.
Estimates of risk difference by type of new clinical sequelae were calculated among individuals infected with SARS-CoV-2 who were still at-risk starting at 21 days post-index date (n=193113 for 2020 comparison). Follow-up time was considered through day 141 after index date (120 days from the start of post-acute period and 73.5 percentile of the follow-up distribution, with median = 87 days [IQR 45– 124 days]). The most common new clinical outcomes among the SARS-Cov-2 group (incidence ≥0.1%) are summarized in Figure 2, with less common outcomes in eTable 2a. Symptoms are included separately in eTable 2b and were not represented in Figure 2. Overall, the excess risk attributable to SARS-CoV-2 infection is low for incident diagnoses (0.02%–2.26%) 4 months from the start of the post-acute phase (Figure 2). The elevated risk, however, is consistently observed for many outcomes across all three comparison groups (2020, 2019, and vLRTI; eTable 2a).
Despite the small absolute risk attributable to SARS-CoV-2 infection, the HRs between SARS-CoV-2 infected individuals and 2020 comparator group during the post-acute phase can be large (significant HRs between 1.24 to 25.65; eTable 2c-c; Figure 2). When evaluating rates over time, HRs are highest in the first month of index date, but remain significantly elevated through month 6 suggesting the hazard for new clinical sequelae is sustained months after initial SARS-CoV-2 infection (eTable 3). Elevated risk seen 30 days prior to index date was likely due to a delay in testing or documentation of a confirmed case among symptomatic individuals. Select graphs of the cumulative hazards for the most common or most significant clinical sequelae are detailed in Figure 3a-c.
Excess risk for developing many new outcomes during the post-acute phase increased significantly with age (Figure 4a; eTable 4a). Risk for clinical sequelae was greatest in individuals >50 years old, however, the absolute risk among young adults 18 to 34 years old remained statistically significantly elevated for some conditions, albeit modestly so. Risk for developing any mental health outcome was significantly elevated regardless of age (pinteraction =0.35).
Excess risk for new clinical sequelae during the post-acute phase rarely differed by gender, apart from fatigue and anosmia (more commonly diagnosed in women) and myocarditis, hypercoagulability, DVT, kidney injury, and sleep apnea (more commonly diagnosed in men) (Figure 4b; eTable 4b). With a few exceptions, individuals with pre-existing conditions (Figure 4c; eTable 4c) and individuals hospitalized with COVID-19 (Figure 4d; eTable 4d) had greater excess risk for developing new clinical sequelae due to SARS-CoV-2 infection.
We did not see a significant period effect for most outcomes (eTable 4e). In our sensitivity analysis, the RDs increased when the post-acute phase was shortened to index date + 14 days (eTable 5a). Similar RDs were observed for index date + 21 days and index date + 28 days (eTable 5b), suggesting index date + 21 days is a reasonable start to the post-acute phase of illness.
Discussion
Our retrospective study conducted within a large administrative database evaluated the excess risk of receiving a new diagnosis for a wide range of clinical sequelae during the post-acute phase of SARS-CoV-2 infection among commercially insured adults 18 to 65 years old. We report 14% of individuals ≤65 years of age who were infected with SARS-CoV-2 had a diagnosis of at least one new clinical sequelae that required medical attention during the post-acute phase of the SARS-CoV-2 infection. Our results confirm excess risk for specific types of sequelae during the 4 months following the start of the post-acute phase (index date + 21 days). Our analysis also demonstrates that while risk increases with age, pre-existing conditions, and hospitalization due to COVID-19, younger adults (≤50y), individuals with no pre-existing conditions, and non-hospitalized SARS-CoV-2 infected individuals are still at risk for new clinical sequelae in the post-acute phase. Finally, our results suggest risk for some clinical sequelae such as anxiety are elevated regardless of age and pre-existing condition.
When considering risk attributable to SARS-Cov-2 infection, several clinical sequelae remained elevated among survivors during the post-acute phase regardless of comparison group (2020, 2019, vLRTI). These outcomes included chronic respiratory failure, cardiac irregularities such as tachycardia and arrythmia, hypercoagulability in the form of pulmonary embolism and DVTs, anxiety, encephalopathy, peripheral neuropathy, amnesia, diabetes, liver test abnormalities, myocarditis, and fatigue. Many of these outcomes have been previously reported in case studies or observational studies during the acute phase of COVID-19 (including tachycardia,(26) hypercoagulability,(27) mental health outcomes,(28) encephalopathy,(29,30) diabetes,(31) amnesia(32)). A few studies have also highlighted persistent symptoms or new clinical diagnoses during the post-acute phase,(32–38) although few report on a full range of new clinical diagnoses across multiple organ systems in such a large population. Studies of individuals infected with other coronaviruses have shown arrythmia/tachycardia sequelae in SARS survivors(39,40) and central and peripheral nervous system sequelae among both SARS and MERS survivors.(41) The proportion of SARS-CoV-2 infected individuals with a new diagnosis of encephalopathy (0.23) and peripheral neuropathy (0.31) in our study (encephalopathy RD range 0.09-0.19 and peripheral neuropathy RD range 0.12-0.16) was closer to the higher rates of CNS and PNS sequelae seen with MERS.(41) Anxiety was reported in another study to be the most common mental health sequelae among individuals with COVID-19 diagnosis (HR=1.59–2.62) at 14–90 days post-SARS-CoV-2 infection.(42) Our results on mental health outcomes (anxiety; HRs 1.24–1.54) are similar, though absolute risks were lower in our study population.
We also identified excess risk for clinical sequelae that are not unique to SARS-CoV-2 and are commonly seen with other serious viral infections. Still, the magnitude of relative risk for these incident sequelae (e.g, hypertension, stroke, and kidney injury) are nearly twice what is typically seen in the general population in an ordinary year. Because of the scale of the SARS-CoV-2 pandemic in the U.S., these findings suggest additional healthcare resource planning is needed to address the health complications among survivors.
This is the first study, to our knowledge, to have the power to quantify the small, but non-trivial, risk among younger, healthier adults. Within our study population of adults ≤65, greater than 90% of individuals recovered at home. Younger individuals may experience complications due to COVID-19, especially if they are hospitalized and have pre-existing conditions(43), but few studies have reported new outcomes during the post-acute phase of illness among milder cases.
Fatigue is often the most commonly reported symptom during the post-acute phase, with self-reported estimates on surveys ranging from 13.6%(36) to 77.7%(44) depending on hospitalization status and length of follow-up. While fatigue was also the most common post-acute diagnosis in this cohort (4.64%), our estimates only reflect fatigue that was brought to medical attention and noted by the physician. Most symptoms, when associated with a primary viral illness like COVID-19, are frequently not coded by the clinician as they are presumed to be part of the infectious process. Moreover, many individuals with COVID-19 may not seek medical care for a symptom unless it was unusual in duration or severity. ICD-10 codes have been shown to be valid for many clinical diagnoses (45), but an inaccurate and unreliable method for ascertaining symptoms.(46) Therefore, we intentionally did not evaluate a broad list of symptoms in this study as we anticipate the true incidence of symptoms are not accurately reflected when ascertained by ICD-10 codes and would be best ascertained through patient-reported surveys.
We were also not able to ascertain race or ethnicity at the individual level in commercial claims. Additional research is needed to better understand how race and ethnicity modify risk for long-term clinical sequelae. Moreover, our population required continuous enrollment from January 2019 until index date, so individuals who may have been uninsured were not included in our sample.
We may have misclassified individuals due to the retrospective nature of our study and the inherent limitations of using claims to define variables. First, we relied on a lab results database to identify PCR-positive individuals. SARS-CoV-2 infected individuals may have been misclassified into the 2020 comparator group if they tested positive outside our network and were not symptomatic enough to require medical attention. This misclassification would likely have biased our findings towards the null. Second, due to our continuous enrollment requirement for study eligibility, we captured clinical diagnoses and comorbidities that occurred in the previous year and the year of the index date up until 30 days prior to the index date. It is possible that what we observe to be an incident or new diagnosis in the post-acute phase for a specific condition might be an exacerbation of a pre-existing condition that did not receive medical attention within the 13-to 22-month window created by our continuous enrollment criteria. Moreover, we may have misclassified some individuals in the no pre-existing comorbidities subpopulation, potentially inflating the incidence within that subgroup. This misclassification is likely minimal, however, because most medical conditions and comorbidities require individuals to participate in at least annual check-ups with PCPs or specialists. Third, we may not have captured a comprehensive list of all ICD-10 codes for each outcome, although the most common ones were included. Finally, since SARS-CoV-2 is a novel virus, we anticipate physicians may have underestimated the clinical significance of some outcomes, especially early in the pandemic. Our sensitivity analysis, however, does not support this temporal lack of documentation, as we observed no significant difference in risk by period.
Our small excess risks may be due to increased medical attention following SARS-CoV-2 infection, such that it is the ascertainment (diagnosis) rather than the incidence of a new diagnosis that is triggered by SARS-CoV-2 infection. This ascertainment bias is unlikely to fully explain our results, however, as individuals with vLRTI should receive similar medical attention post-acute illness and yet have fewer visits to PCP than any SARS-CoV-2 infected individual, regardless of diagnostic method (eFigure 2).
This retrospective analysis was strengthened by its large sample size that powered the assessment of multiple and rare outcomes simultaneously overall and across multiple subgroups, utilized valid ICD-10 codes to ascertain outcomes, and included a broad definition of SARS-CoV-2 infection (positive PCR results or clinical diagnosis), producing a more generalizable sample of individuals than has been previously studied.
With close to 70 million cases worldwide and rising, the number of SARS-CoV-2 infected survivors with potential post-COVID sequelae will continue to grow. To optimally manage these patients, it is important to understand the incidence and the natural history of each of these sequelae. Our results provide clinicians a comprehensive understanding of the excess risk for over 50 clinical morbidities across multiple organ systems affecting adults ≤65 years of age during the post-acute phase of SARS-Cov-2. Knowing the magnitude of risk for rare and common clinical sequelae may enhance diagnostic evaluation and management for individuals infected with SARS-CoV-2. Moreover, our results help providers and other key stakeholders anticipate the scale of future health complications as well as improve healthcare resource utilization planning.
Conclusion
We observed 14% of individuals with SARS-CoV-2 infection diagnosed with a new clinical sequelae that required medical attention during the post-acute phase of illness. Elevated and sustained risk for clinical sequelae during the 4 months following acute illness, particularly but not exclusively among individuals with pre-existing conditions or hospitalized due to COVID-19, was observed. Additional follow-up is needed to determine resolution of risk over time.
Data Availability
The data are proprietary and are not available for public use but, under certain conditions, may be made available to editors and their approved auditors under a data use agreement to confirm the findings of the current study.
Transparency Statement
The lead author, Sarah Daugherty, affirms that this manuscript is an honest, accurate, and transparent account of the study being reported; that no important aspects of the study have been omitted; and that any discrepancies from the study as planned (and, if relevant, registered) have been explained.
Data Sharing Statement
The data are proprietary and are not available for public use but, under certain conditions, may be made available to editors and their approved auditors under a data use agreement to confirm the findings of the current study.
Contributors and sources
SED, senior researcher, epidemiologist
YG, director, data scientist
KH, National Medical Director for Clinical Intelligence, Physician
JS, principal data scientist, statistician
MCD, data scientist, data scientist
KGJ, data scientist, data scientist.
ML, Professor, epidemiology
KC, Executive Director Translational Research OptumLabs, clinician
SED, YG, KH, JS, MCD, KGJ, ML, KC played an active role in all aspects of the development of the research, including design, conduct, and interpretation of the data; preparation, review, and approval of the manuscript; and decision to submit the manuscript for publication. A description of the data source is included in Supplemental Appendix A.
Copyright/license for publication
The Corresponding Author has the right to grant on behalf of all authors and does grant on behalf of all authors, a worldwide licence to the Publishers and its licensees in perpetuity, in all forms, formats and media (whether known now or created in the future), to i) publish, reproduce, distribute, display and store the Contribution, ii) translate the Contribution into other languages, create adaptations, reprints, include within collections and create summaries, extracts and/or, abstracts of the Contribution, iii) create any other derivative work(s) based on the Contribution, iv) to exploit all subsidiary rights in the Contribution, v) the inclusion of electronic links from the Contribution to third party material where-ever it may be located; and, vi) licence any third party to do any or all of the above.
Contributor and guarantor information
The corresponding author attests that all listed authors meet authorship criteria and that no others meeting the criteria have been omitted. SED and YG serve as joint guarantors who accept full responsibility for the work and/or the conduct of the study, had access to the data, and controlled the decision to publish.
Competing interests statement
All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf and SED, YG, KH, JS, MD, KGJ declare that they have had no support from any organisation for the submitted work; no financial relationships with any organisations that might have an interest in the submitted work in the previous three years; no other relationships or activities that could appear to have influenced the submitted. KC declares consulting from Pfizer. ML declares honoraria/consulting from Merck, Sanofi-Pasteur, Bristol Myers-Squibb, and Antigen Discovery; research funding (institutional) from Pfizer, and an unpaid scientific advice to Janssen, Astra-Zeneca, One Day Sooner, and Covaxx (United Biomedical).
Funding
The study was funded by OptumLabs, the research and development arm of UnitedHealth Group, and the authors Drs. Daugherty, Cohen, Heath, and Mr. Guo, Ms. Dasmariñas, Mr. Jubilo, and Mr. Samranvedhya are full-time employees at UnitedHealth Group.
Acknowledgements
We would like to thank Andrea Rotnitzky and Jamie Robins for reviewing and providing feedback on our methods.