Viral and host factors associated with SARS-CoV-2 disease severity in Georgia, USA

While SARS-CoV-2 vaccines have shown strong efficacy, their suboptimal uptake combined with the continued emergence of new viral variants raises concerns about the ongoing and future public health impact of COVID-19. We investigated viral and host factors, including vaccination status, that were associated with SARS-CoV-2 disease severity in a setting with low vaccination rates. We analyzed clinical and demographic data from 1,957 individuals in the state of Georgia, USA, coupled with viral genome sequencing from 1,185 samples. We found no difference in disease severity between individuals infected with Delta and Omicron variants among the participants in this study, after controlling for other factors, and we found no specific mutations associated with disease severity. Compared to those who were unvaccinated, vaccinated individuals experienced less severe SARS-CoV-2 disease, and the effect was similar for both variants. Vaccination within 270 days before infection was associated with decreased odds of moderate and severe outcomes, with the strongest association observed at 91–270 days post-vaccination. Older age and underlying health conditions, especially immunosuppression and renal disease, were associated with increased disease severity. Overall, this study provides insights into the impact of vaccination status, variants/mutations, and clinical factors on disease severity in SARS-CoV-2 infection when vaccination rates are low. Understanding these associations will help refine and reinforce messaging around the crucial importance of vaccination in mitigating the severity of SARS-CoV-2 disease.


INTRODUCTION
Vaccinations against SARS-CoV-2 have undeniably demonstrated high efficacy in preventing COVID-19 infections and improving disease outcomes [1][2][3] , but their impact is challenged by the emergence of new variants carrying immune evasion mutations and the waning of immune responses over time 4 .Partly due to these issues, public trust in COVID-19 vaccines has diminished with each round of booster recommendations, especially in the southeast United States (US) 5 .Throughout the SARS-CoV-2 pandemic, the state of Georgia has consistently held one of the lowest rates of vaccine coverage in the nation 6 .One important approach to addressing vaccine hesitancy is to conduct studies -and disseminate their resultsto populations with low vaccine uptake.
In order to elucidate viral and host factors that contribute to disease outcomes in the context of low vaccination rates, we analyzed demographic and clinical data from 1,957 individuals in the state of Georgia.We also analyzed full viral genome sequences from 1,185 of these individuals to examine the influence of variants and mutations on disease severity.We leveraged a large study population and extensive demographic, clinical, and sequence data to .define factors associated with SARS-CoV-2 disease severity in a region marked by low vaccination rates.

Clinical and demographic factors differ by SARS-CoV-2 vaccine status
Between May 2021 and May 2022, we identified 1,957 individuals who tested positive for SARS-CoV-2 within the Emory Healthcare system in Atlanta, Georgia.The majority of participants (66%) were residents of the Metro Atlanta area (Table 1, Figure 1A).The median age was 51 years (Interquartile range [IQR]=36,65), and 56% of individuals were female.The racial distribution of participants was predominantly Black (58%) or White (31%).Individuals experienced a range of clinical presentations and outcomes, from asymptomatic infection (15%) to death (2.8%) (Table 2).Among the 967 individuals in this study who were hospitalized, 625 (65%) of the hospitalizations were due to COVID-19.
During the period of this study, the state of Georgia had the 7 th lowest vaccination rate in all 50 United States and the District of Columbia, with 55.1% of the population vaccinated (Figure 1B).In our study, a slightly lower proportion (48%) of individuals were vaccinated, in part based our study was designed to ensure inclusion of unvaccinated individuals.Vaccinated individuals were significantly older than unvaccinated individuals (median age of 58 years vs 43 years, p<0.001) (Table 1).We observed a large disparity in vaccine status by race; among the unvaccinated participants, 70% were Black and 21% White, while among the vaccinated participants, 45% were Black and 43% were White (Table 1).Vaccination status was not significantly associated with any other demographic variable evaluated.
We found that vaccinated individuals were more likely to have underlying medical comorbidities than unvaccinated individuals.Notably, 28% of vaccinated individuals were immunocompromised, compared to 17% of unvaccinated individuals (p <0.001,Table 2).
Vaccinated individuals were less likely to be pregnant (1% vs. 6%, p <0.001) (Table 2, Table S1), however, at the time of the study, vaccines were not yet approved for pregnant individuals.These findings are consistent with higher rates of vaccination in individuals with medical comorbidities.
We also found differences in clinical symptoms by vaccination status.Compared to unvaccinated individuals, those who were vaccinated were less likely to have fever (52% vs. 42%, p <0.001), chills (38% vs. 32%, p = 0.02), nausea/vomiting (35% vs. 25%, p <0.001), and shortness of breath or difficulty breathing (49% vs. 39%, p <0.001); they were more likely to have sore throat (17% vs. 22%, p = 0.02) and runny nose/nasal congestion (27% vs. 45%, p <0.001) (Table S1).These findings are consistent with milder disease in vaccinated individuals.SARS-CoV-2 C T value by qRT-PCR is often used as a rough proxy for viral load, and we found several key factors associated with SARS-CoV-2 C T .Because multiple qRT-PCR assays were used within the Emory Healthcare system during this time, we controlled for assay variability (Table 3).Vaccinated individuals had a slightly lower C T than those who were unvaccinated (-0.70,SE=0.26, p =0.01).Though this finding seems counterintuitive, we attribute it to vaccinated individuals presenting for testing earlier after symptom onset, compared to unvaccinated individuals (Table 2).Symptom duration was significantly inversely associated with C T (Table 3).Specifically, after controlling for other variables, and compared to asymptomatic individuals, those who were tested within 0-3 days after symptom onset had a lower C T value by 3.0 cycles (Standard Error [SE]=0.55,p <0.001) and those who were tested within 4-7 days had a lower C T value by 1.62 cycles (SE=0.61,p = 0.01).Those who were tested 8 or more days from symptom onset did not have a significantly different C T than those who were asymptomatic (p = 0.77) (Table 3).No significant differences in C T were observed between variants, after adjusting for other factors (Table 3).

Variant frequency differs between vaccinated and unvaccinated individuals
We sequenced full SARS-CoV-2 genomes from residual nasopharyngeal swab samples from 1,185 individuals.The minimum genome coverage of the samples was 76% and the median sequencing depth was 1,804 (Supplementary Data File).Sequences were primarily Delta (68%) and the BA.1 sublineage of the Omicron variant (23%), followed by other Omicron sublineages (5.1%), Alpha (2.5%) and less than 1% each of Beta, Gamma, Mu, A.2.5, and B.1 (Table 4).In Georgia, during the time of this study (May 2021-May 2022), Delta accounted for 55% of infections, Omicron for 40%, and Alpha for 2.5% (Figure 1C, Table S2).Therefore, our study included a somewhat higher proportion of Delta and a lower proportion of Omicron than was circulating in the state.The distribution of all other variants aligned with the overall variant distribution observed in Georgia (Table S2, Table 4).
The distribution of SARS-CoV-2 variants in our study was different for vaccinated and unvaccinated individuals (p <0.001) (Table 4).Omicron had a higher frequency in vaccinated individuals (32%) than unvaccinated (25%), whereas Alpha and Delta were more common in unvaccinated compared to vaccinated individuals (Table 4).These differences correspond with the timing of each variant's circulation compared to vaccine rollout, though decreased vaccine effectiveness against Omicron may also contribute 21 .
In addition to the frequency of VOCs among vaccinated and unvaccinated individuals, we investigated the frequency of individual mutations.Within each VOC -Alpha, Delta, and Omicron -no sequence characteristics, including mutations, deletions, and insertions, were different between viruses infecting vaccinated and unvaccinated individuals (Figure 2, Figure S1).However, the number of non-lineage defining mutations was lower in vaccinated compared to unvaccinated individuals (Figure 1D) suggesting that vaccination may have an impact on viral diversity within a host.Phylogenetic analysis demonstrated that sequences from vaccinated individuals were intermixed with sequences from unvaccinated individuals, further confirming no distinct features of post vaccination infections (Figure S2).
In-depth metagenomic analysis of 513 samples did not reveal any viral co-infections (Supplementary Data File).

Age, underlying conditions, and vaccination status are associated with disease severity
We evaluated associations between disease severity and demographic characteristics, underlying health conditions, vaccination status, and SARS-CoV-2 variant.Disease severity was defined according to the WHO clinical progression scale 22 .Mild disease included asymptomatic infection and symptomatic infection without hospitalization.Moderate disease included hospitalized individuals without oxygen therapy or oxygen by mask or nasal prongs.Severe disease included the use of oxygen by noninvasive or high flow, intubation and mechanical ventilation, vasopressors, dialysis, or extracorporeal membrane oxygenation.Death included inhospital deaths directly linked to COVID.
Using an adjusted multinomial logistic regression model that included demographic characteristics, underlying conditions, vaccination status and variant, we found that age, certain underlying medical conditions, and time since most recent vaccination were significantly associated with disease severity (Table 5).Underlying health conditions significantly associated with disease severity included chronic lung disease, renal disease, and the use of systemic immunosuppressive therapy prior to hospitalization.Other conditions showed weak or nonsignificant associations with disease severity, including pregnancy, diabetes, liver disease, and autoimmune disease.The Omicron variant did not show a significant association with disease severity compared to the Delta variant in the adjusted model although an association was observed in an unadjusted model (Table 5, Table S3).Finally, compared to unvaccinated individuals, vaccination within 91-180 days (about 3 -6 months) was associated with lower odds of moderate disease, severe disease, and death (Table 5).There were similar effects across all disease severity outcomes when vaccination occurred between 181 -270 days (about 6 to 9 months).When vaccination occurred more than 270 days ago, however, there were no significant differences in disease severity when compared to the unvaccinated group.Additionally, vaccination that had occurred within the past 90 days (about 3 months) was associated with lower odds of moderate disease relative to mild disease but was not associated with lower odds of severe disease or death.In short, vaccinations were most protective against moderate disease, severe disease, and death when they occurred within the prior 3-9 months.Overall, after controlling for multiple host and viral factors, we found that age, chronic lung disease, renal disease, and immunosuppressive therapy increased the odds of progressively more severe COVID-19 disease, while vaccination decreased the odds and SARS-CoV-2 variant had no effect. .

DISCUSSION
Our comprehensive analysis of host and viral factors associated with SARS-CoV-2 disease severity in a setting with low vaccination rates led to several key findings.First, age and underlying health conditions -especially chronic lung disease, renal disease, and the use of immunosuppressive therapy -were associated with more severe disease and death.Second, SARS-CoV-2 variant and viral mutations were not associated with disease severity in this study population, which was comprised of individuals who sought medical care.And third, vaccination was protective against severe outcomes for both Delta and Omicron variants to a similar degree.
Unique features of our study included the analysis of a large number of SARS-CoV-2 full viral genome sequences linked to extensive clinical and demographic data, and our focus on a relatively under-studied region of the U.S.
Georgia is an important proxy for the southeastern U.S. and other populations with high numbers of vaccine refusals, inequitable access to healthcare, and low insurance coverage [23][24][25] .
Emphasizing the positive impact of SARS-CoV-2 vaccination among this population, similar to others in the U.S. 26,27 , is critical as new variants emerge.It is also important to note that among the individuals in this study who contracted SARS-CoV-2 after being vaccinated, a greater proportion reported milder upper respiratory symptoms like sore throat and runny nose, while a lower percentage experienced more severe symptoms such as nausea/vomiting, fever, and shortness of breath/difficulty breathing, in comparison to unvaccinated individuals.Vaccinations were most protective against severe COVID outcomes when they occurred within the prior 3-9 months.This finding has timely implications on a national level, given persistently low vaccine uptake, especially of the bivalent SARS-CoV-2 vaccine 5 , and the need for ongoing vaccine updates targeting emerging variants 4 .Results from our study will help emphasize the benefits of vaccination to the public as a means of safeguarding against severe COVID outcomes.
Our results also indicated the importance of demographic and clinical factors associated with SARS-CoV-2 disease severity, despite vaccination.Age was a key risk factor; after accounting for vaccination status, demographic factors, health conditions, and SARS-CoV-2 variant, our analysis revealed that for each additional year of age, the odds of experiencing more severe outcomes compared to mild disease increased by 5%.The association between age and disease severity has been consistently observed, particularly among individuals aged 65 and above [7][8][9][10] .Thus, relying solely on vaccination may be insufficient for reducing disease severity and mortality among older individuals.U.S. Census Bureau data indicates that the population is aging, with Georgians aging at an even faster rate 28 , underscoring the need for additional preventative and treatment measures.
In addition to age, we found that chronic lung disease, renal disease, and the use of immunosuppressive therapy also increased the odds of experiencing moderate and/or severe infection and/or death.In previous studies, cardiovascular disease, diabetes, chronic respiratory conditions, obesity, and compromised immune systems have also been found to increase the risk of severe illness [10][11][12][13][14][15][16][17] .Interestingly, we found that cardiovascular disease was only associated with moderate disease (without any association with severe disease or mortality), while diabetes and being overweight were not associated with disease severity in our final multivariate model.One explanation for this discrepancy could be that these two conditions serve as indicators for factors that we controlled in our study.It is noteworthy that when we did not adjust for any factors, these conditions were significantly associated with increased odds of severe disease.
. It is surprising that we found no difference in disease severity between Delta and Omicron variants, since multiple prior studies have found that the risk of hospitalization, ICU admission, and mortality vary by variant 11,18,19,29 .This discrepancy may be due to the fact that all individuals in our study sought medical care, so we did not include individuals with minimal symptoms.Our observation that vaccination was similarly protective for individuals with Delta and Omicron is consistent with results from another recent study among hospitalized patients 30 .
We did not find viral factors associated with post-vaccine infection.Most post-vaccine infections were caused by the predominant lineage of the time.Within each variant (Delta and Omicron), no SARS-CoV-2 SNPs, deletions, or insertions were more common in vaccinated individuals than unvaccinated individuals.These results are different from a prior study of similar size, which found more resistance mutations (e.g.L452* and E484*) in vaccinated compared to unvaccinated individuals in the pre-Omicron era 16 .Our negative finding likely reflects the challenge of identifying the effect of an individual virus mutation in an increasingly complex immune landscape.Interestingly, we found that vaccinated individuals had fewer nonlineage-defining SNPs than unvaccinated individuals, suggesting less diversity and potentially less viral evolution within vaccinated individuals.This is consistent with a recent study investigating within-host genetic diversity of SARS-CoV-2 in unvaccinated and vaccinated individuals. 31study had several limitations: We only included individuals who presented to care, thus skewing our study population towards individuals with more severe disease than the general population.In addition, due to our study design, we could not collect reliable data regarding prior SARS-CoV-2 infection(s) thus, we were unable to account for natural immunity.Given the retrospective study design, there may be residual confounding, however we adjusted for important variables such age, demographics, pre-existing health conditions and vaccination status.
In summary, our findings underscore the critical role of vaccination status, age, and medical comorbidities -especially immunosuppression, chronic kidney disease, and chronic lung disease -in determining disease severity and outcomes among SARS-CoV-2 infected individuals, regardless of virus variant.We contribute valuable insights into the nuanced relationship between these factors, highlighting the importance of considering demographic, clinical, and genetic variables when evaluating disease severity.Ultimately, our results will be valuable in strengthening and reinforcing messaging around SARS-CoV-2 vaccination, especially in settings of low vaccine uptake.

Clinical and demographic data
This study was approved by the institutional review board at Emory University under protocol STUDY00000260, with a waiver of consent.All positive SARS-CoV-2 samples from Emory University Hospital Molecular and Microbiology Laboratories collected between 5/3/21 and 5/31/22 were reviewed for inclusion in this study.In addition to symptomatic testing, SARS-CoV-2 tests were administered before admission or outpatient procedure as part of Emory Healthcare System's universal SARS-CoV-2 screening.Individuals were considered vaccinated if they had received a complete vaccine series (2 doses of the Pfizer-BioNTech or Moderna vaccines or 1 dose of the Janssen vaccine) at least 14 days before their first positive test result.
Individuals were excluded from the study if they were partially vaccinated or reported out-of- illness, hospitalization, and disease outcome were obtained from the electronic medical record (EMR).In constructing symptom categories, systemic symptoms were defined as fatigue, fever, chills, rigors, myalgia, and headache.Gastrointestinal symptoms were defined as nausea/vomiting and diarrhea.Upper respiratory symptoms were defined as sore throat, runny nose, and nasal congestion.Lower respiratory symptoms were defined as cough, and shortness of breath/difficulty breathing.Immunosuppressed was defined as HIV, active cancer, autoimmune disease, or immunosuppressive therapy."Other" underlying conditions were defined as overweight, diabetes, renal, cardiovascular, pregnant, and liver disease.
Disease severity was defined according to the WHO clinical progression scale 22 .The scale includes mild disease-asymptomatic and symptomatic SARS-CoV-2 infection without hospitalization, moderate disease-hospitalization without oxygen therapy or hospitalization with oxygen by mask or nasal prongs, severe disease-hospitalization with use of oxygen by noninvasive or high flow, intubation and mechanical ventilation, vasopressors, dialysis, or extracorporeal membrane oxygenation, and death.Data management and cleaning were conducted in Excel v16.73 and SAS studio v3.81.

SARS-CoV-2 sequencing and analysis
Residual nasopharyngeal (NP) swab samples were obtained from the Emory University Hospital Molecular and Microbiology Laboratories.NP samples underwent RNA extraction, DNase treatment, and cDNA synthesis followed by metagenomic or amplicon-based library construction.For metagenomic sequencing, Nextera XT (Illumina) and Illumina sequencing were performed as previously described 32 .Amplicon-based sequencing was performed using the xGEN SARS-CoV-2 kit (IDT) as previously described 33 .
Consensus-level single nucleotide polymorphisms (SNPs) and insertions/deletions were identified using the Nextstrain web-based mutation calling tool 37 .
For phylogenetic analysis, 411,634 reference sequences, collected between May 1, 2021, and May 31, 2022, were downloaded from NCBI and were aligned with our study sequences to reference strains Wuhan/Hu-1/2019 and Wuhan/WHO/2019 using Nextalign within the Nextstrain v3.2.4 pipeline 7.This dataset was subsampled in Nextstrain using a custom scheme, in which crowd penalty was set to 0.0 to select 1000 sequences most genetically similar to our sequence dataset.Maximum likelihood phylogenetic trees were constructed using default settings of the Nextstrain SARS-CoV-2 Workflow with TreeTime v0.8.6 38 .

Statistical Analysis
Demographics, symptoms, underlying conditions, and outcomes were described using frequency distributions for categorical variables and medians and interquartile ranges for continuous variables.Subgroup differences were evaluated to compare individuals who were vaccinated with individuals who were not vaccinated, using chi-square tests and Fisher's exact tests for categorical variables and Wilcoxon rank-sum test for continuous variables.
Prior to testing, the association between clinical factors and mean C T was compared between qRT-PCR testing platforms using ANOVA.There were significant differences between the platforms indicating that platform is an important covariate to control for while modeling factors associated with C T .
Missing values were imputed using the chained equations algorithm in the MICE R package 39 (R Version 4.1.3)to create 10 imputed data sets.Predictive mean matching, logistic regression imputation, and polytomous regression imputation were used for numerical, binary, and multicategory variables, respectively.Results from the 10 linear regression models ran on the imputed datasets were then pooled according to Rubin's rule to provide an overall estimate for the variables.
Multinomial logistic regressions were used to test the association between demographic characteristics, underlying conditions, vaccine status, and SARS-CoV-2 variant with disease severity (1=Mild [Reference Category], 2=Moderate, 3=Severe, 4=Death).First, unadjusted models tested each variable's association with disease severity separately.Next, multivariable models were constructed in a step-wise fashion, adding variables in blocks of demographic variables, including age (continuous), sex (0=Female[Reference], 1=Male), and race(0=Black[Reference], 1=White), followed by a block of underlying condition variables, all of which were binary (0=No, 1=Yes; pregnant, chronic lung disease, hypertension, overweight,

.
cardiovascular disease, diabetes, renal disease, liver disease, autoimmune disease, immunocompromised, systemic immunosuppressive therapy/medications), and the final model added a block of SARS-CoV-2-related characteristics including vaccination status (1=Unvaccinated [Reference], 2=Vaccinated, 3=Vaccinated and Boosted), Days Since Most Recent Vaccination/Booster (1=Unvaccinated [Reference], 2=Within past 90 Days, 3=91 -180 Days Ago, 4=181 -270 Days Ago, 5=More than 270 Days Ago) and variant (1=Delta[Reference], 2=Omicron).The multivariable models were tested for multicollinearity using variance inflation factor (VIF), though collinearity was not present, so all variables were retained.All analyses were conducted in R Version 4.1.3(R Foundation for Statistical Computing, Vienna, Austria).ACKNOWLEDGEMENTS: The study was supported by Centers for Disease Control and Preventions contract 75D30121C10084 under BAA ERR 20-15-2997 (to A.P) and by the Emory WHSC COVID-19 Urgent Research Engagement (CURE) Center, made possible by generous philanthropic support from the O. Wayne Rollins Foundation and the William Randolph Hearst Foundation (to A.P.).L.R.C. was supported by Award Number T32AI074492 from the National Institute of Allergy and Infectious Diseases.EIP Surveillance of COVID-19 was funded through the Centers for Disease Control and Preventions Emerging Infections Program [U50CK000485].The study was supported by the National Institute of Biomedical Imaging and Bioengineering at the National Institutes of Health under award U54 EB027690 (to W.A.L.) and the National Center for Advancing Translational Sciences of the National Institutes of Health under Award Number UL1TR002378 (to W.A.L.).Additional support was provided by the Georgia Clinical & .Translational Science Alliance of the National Institutes of Health under Award Number UL1TR002378.The study was supported in part by the Emory Integrated Genomics Core (EIGC) and Emory Integrated Computational Core (EICC), which are subsidized by the Emory University School of Medicine and are one of the Emory Integrated Core Facilities.The study was supported by the Data Analytics and Pediatric Biostatistics Core, Department of Pediatrics, Emory University School of Medicine.The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of Allergy and Infectious Diseases or the National Institutes of Health.The funders played no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Figure S2 .
Figure S2.Phylogenetic analysis does not reveal differences in SARS-CoV-2 sequences

TABLES : Table 1 . Demographic Characteristics by Vaccination Status
Pearson's Chi-squared test or Fisher's exact test for categorical variables and Wilcoxon rank-sum test for continuous variables

Table 2 . Clinical Characteristics by Vaccination Status
Chi-squared test or Fisher's exact test for categorical variables and Wilcoxon rank-sum test for continuous variables Any Systemic symptoms included fatigue, fever, chills, rigors, myalgia, and headache Any GI symptoms included nausea/vomiting, and diarrhea Any Upper Respiratory symptoms included sore throat, runny nose, and nasal congestion Any Lower Respiratory symptoms included cough, shortness of breath, and difficulty breathing Any Immunosuppressed Underlying Condition included HIV, active cancer, autoimmune disease, immunosuppressed, or immunosuppressive therapy Any Other Underlying Condition included overweight, diabetes, renal, cardiovascular, pregnant and liver disease

Table 5 . Adjusted Multinomial Logistic Regression Models Testing the Association between Demographics, Underlying Health Conditions, and COVID-Related Characteristics with Disease
5 " Mild" is the Reference Category 2 aOR = Adjusted Odds Ratio, CI = Confidence Interval, Ref = Reference Level 3 HIV infection, active cancer, solid organ transplant, hematopoietic stem cell transplant4Compared to unvaccinated individuals5Compared to Delta

Table S2 . Circulation of VOC/VOI in Georgia, USA as measured by available sequences on GISAID VOC/VOI Period variant circulated in GA 1
3earson's Chi-squared test or Fisher's exact test for categorical variables and Wilcoxon rank-sum test for continuous variables3Immunocompromised (e.g., HIV infection, active cancer, solid organ transplant, hematopoietic stem cell transplant)

Total sequences available on GISAID from May 2021 to May 2022 from the state of Georgia 51,075 ---
Period of circulation based on sequences available on GISAID including Emory Healthcare samples. 1