Association between SARS-CoV-2 Transmission Risk, Viral Load, and Age: A Nationwide Study in Danish Households

Aim The objective of this observational study was to investigate the association between SARS-CoV-2 transmission risk, RT-PCR Cycle threshold (Ct) values, and age of infected cases in Danish households. Background The Covid-19 pandemic is one of the most serious global public health threats in recent times. Understanding transmission of SARS-CoV-2 is of utmost importance to be able to respond to outbreaks and take action against the spread of the disease. Viral load is generally thought to correlate with transmission risk. Methods We used comprehensive administrative register data from Denmark, comprising the full population and all SARS-CoV-2 tests (August 25, 2020 to February 10, 2021), to estimate household transmission risk. Results We found that the transmission risk was negatively associated--approximately linear--with the Ct values of the tested primary cases. Also, we found that even for relatively high Ct values, the risk of transmission was not negligible; e.g., for primary cases with a Ct value of 38, we found a transmission risk of 8%. This implies that there is no obvious cut-off for Ct values for risk of transmission. We estimated the transmission risk according to age and found an almost linearly increasing transmission risk with the age of the primary cases for adults ([≥]20 years) and negatively for children (<20 years). Age had a higher impact than Ct value on the risk of transmission. Conclusions Lower Ct values (indicating higher viral load) are associated with higher risk of SARS-CoV-2 transmission. However, even at high Ct values, transmission occurs. In addition, we found a strong association between age and transmission risk, and this dominated the Ct value association.


Introduction
Person-level data, which included information on the test results, dates and times of 81 sampling as well as the times the results were available, were linked to individuals within 82 households. For each household, we identified the first positive test for SARS-CoV-2; 83 called the primary case throughout this paper. We considered all subsequent tests from 84 other members in the same household as tests taken in response to the primary case. We where Ct p,1 is the Ct value (rounded to the nearest integer) of the primary case. β measures the transmission risk for each Ct value. ε p denotes the error term, clustered on 107 the household (event) level. 108 To estimate the association between age and transmission risk, stratified by the median 109 Ct value (28), we estimated the non-parametric regression equation: where Age p,5 is the age (in five-year groups) of the primary case. β measures the trans-111 mission risk for each five-year age group of the primary cases. ε p denotes the error term, 112 clustered on the household (event) level.

113
To estimate the association between Ct value, age, and transmission risk, we estimated 114 the non-parametric regression equation: where Ct p,2 is the Ct value (in bi-value groups) and Age p,10 is the age (in ten-year groups) 116 of the primary case. β measures the transmission risk of the interaction between Ct value 117 and age of the primary case. Age s is the age of the potential secondary cases (s) and α 118 measures the linear association with age of the potential secondary cases (Lyngse et al., 119 2020). ε p denotes the error term, clustered on the household (event) level.

120
To quantify the increased transmission risk across different observable characteris-121 tics, we estimated a univariable and a multivariable logistic regression. In particular, to 122 estimate the odds ratio, we estimated the logistic regression equation: 123 log y p 1 − y p = β × Ct p,2 + γ × Age p,5 + φ × F emale p + δ × HouseholdSize p + ε p , (4) where β measures the non-parametric association with Ct values, γ measures the non-124 parametric association with age of the primary case, φ measures the association with sex, 125 6 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 5, 2021. ; and δ measures the association with the size of the household. ε p denotes the error term, clustered on the household (event) level. 127 3.4.1 Sensitivity analyses 128 To investigate whether age had an impact on the result, we estimated the association 129 between Ct value and transmission risk, controlling for age of the primary case. Further-130 more, we estimated the age structured transmission risk stratified by sex to see whether 131 there were different patterns across men and women. We then estimated the age struc-132 tured transmission risk stratified by Ct value quartiles to see whether the pattern was 133 independent of the median Ct value cut-off. 134 As Ct values were only available for the primary cases that were identified by being 135 tested in TCDK, only these primary cases were included in the analyses. To address 136 the potential bias from not including primary cases that were identified at hospitals, we 137 performed sensitivity analyses by estimating transmission risk stratified by TCDK and 138 hospitals. 139 We also estimated the transmission risk of the interaction between Ct value and age 140 of the primary case without controlling for age of the potential secondary case to see 141 whether our results were driven by the age of the potential secondary cases. Because 142 people normally live with a partner around their own age and parents with their children, January 25 (73% of all primary cases) (Table S1). Of these primary cases, a Ct value was 157 available for 99.6%.

158
In this study, we had 66,311 primary cases living with 213,576 potential secondary 159 cases, of which 103,389 (48%) tested positive for SARS-CoV-2 and were thus actual 160 secondary cases (Table S2). Approximately half were men and half women. 25% of 161 primary cases had a Ct value ≤25, 50% had a Ct value ≤28, and 75% had a Ct value 162 ≤32 ( Figure S1). The distribution of Ct values was relatively similar across age groups, 163 suggesting that differences in test strategy across age were not driving our results ( Figure   164 S2 and Table S4).  CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted March 5, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021  Notes: A primary case with a Ct value of 18 has a transmission risk of 43%, and a primary case with a Ct value of 38 has a transmission risk of 8%. The estimates come from estimating regression equation 1. The shaded area shows the 95% confidence bands clustered on the household level. Figure 2 shows the cumulative distribution of secondary cases by Ct values, e.g.,

169
primary cases with Ct values ≥30 account for 39% secondary cases. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 5, 2021. ; https://doi.org/10.1101/2021.02.28.21252608 doi: medRxiv preprint Notes: This figure illustrates the proportion (%) of positive secondary cases from primary cases with a Ct value ≥X. E.g., primary cases with a Ct value ≥30 account for 39% of total secondary cases; primary cases with a Ct value ≥32 account for 27% of total secondary cases. An RT-PCR test is positive if the Ct value is ≤38. Figure 3 shows the age structured transmission risk stratified by the median Ct value.

171
There is an overall positive association between age and transmission risk for adults 172 (≥20 years) and a negative association for children (<20 years), i.e., the transmission risk 173 increased with younger age for children. Across all age groups, we found that primary cases 174 with a lower Ct value (<28, red) had a significantly higher transmission risk compared to 175 primary cases with a higher Ct value (≥28, blue).

176
10 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 5, 2021.

Ct Value
Notes: A primary case aged 0-5 years with a Ct value <28 has a transmission risk of 26%, while a primary case aged 0-5 years with a Ct value ≥28 has a transmission risk of 15%. The estimates come from estimating regression equation 2. The shaded areas show the 95% confidence bands clustered on the household level. The median Ct value for primary cases is 28. Figure 4 shows the association between age, Ct value and transmission risk. The 177 transmission risk generally increases with higher age and lower Ct value.

11
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 5, 2021. ; https://doi.org/10.1101/2021.02.28.21252608 doi: medRxiv preprint   12 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 5, 2021. ;

13
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 5, 2021. ;

Sensitivity analyses
We found an approximately linear negative association between Ct value and trans-187 mission risk that was also present when controlling for age of the primary case ( Figure   188 S3). We found similar transmission risks for men and women ( Figure S4). The age struc-189 tured transmission risk stratified by Ct value quartiles showed the same picture as when 190 stratifying by the median Ct value ( Figure S5). 191 We also found that primary cases tested at hospital test facilities had a slightly higher 192 transmission risk-across five-year age groups ( Figure S6).

193
When excluding the age of the potential secondary case as a control variable, we found 194 a slightly reduced association between the transmission risk and the interaction between 195 the Ct value age of the primary case ( Figure S7). Furthermore, when stratifying by Ct 196 value quartiles and estimating the transmission risk by the interaction of the age of the 197 primary case and the age of potential secondary cases, we still found the same overall 198 pattern ( Figure S8). To our knowledge, this is the first nationwide study investigating the association be-201 tween transmission risk, age and Ct values. We here exploited the detailed Danish register 202 data comprising the full population and all RT-PCR tests for SARS-CoV-2. 203 We found an approximately linear association between Ct value and transmission risk, 204 implying that cases with a higher viral load are more infectious than cases with a lower CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 5, 2021. ; https://doi.org/10.1101/2021.02.28.21252608 doi: medRxiv preprint cases with a Ct value above a certain cut-off are not contagious. A Ct value cut-off of 30,  (Table S5).

245
Our estimates here benefit from a large sample size and objective selection of potential 246 secondary cases. We included all household members as potential secondary cases (un-247 conditional on them being contacted by the official contact tracing system), of which 88%

252
Another main result from this study is that age was strongly associated with trans-253 mission risk-even when controlling for viral load (Figure 3 and S5). This result was not 254 driven by the age of the potential secondary cases, as we found roughly the same pattern 255 when adjusting for this ( Figure S7). We found an overall positive association between 256 age and transmission risk for adults (≥20 years), whereas the transmission risk decreased 257 with age for children (<20 years) (Figure 3). This pattern was found independently of the 258 Ct value of the primary case, but the overall the transmission risk increased with lower 259 Ct values ( Figure S8).

260
It is noteworthy, that we found that age dominated viral load in predicting transmis-261 sion risk. For instance, the transmission risk doubles when the Ct value decreases from 262 36-38 to 18-20; similarly, the risk doubled when the age of the primary case increases 263 from 20-25 years to 65-70 years, and triples, when the case is 80-85 years (Table 1). This 264 pattern could be driven by the susceptibility of the potential secondary cases, as people 265 tend to live with their partner, who is around their own age, and parents live with their 266 children (Lyngse et al., 2020;Madewell et al., 2020). We investigated this and did find 267 that primary cases tend to infect other persons around the same age within the household 268 ( Figure S8). However, when we control for age of the potential secondary cases, we still 269 found an association between age and transmission risk (Figure 4). 270 16 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 5, 2021. ; https://doi.org/10.1101/2021.02.28.21252608 doi: medRxiv preprint Possible explanations for this finding could be that age is associated with increased viral exhalation (Edwards et al., 2021) or that the immunological response is associated  (Table S4). To calculate the distribution of Ct values across age groups, it is 280 necessary to include a relatively large sample comprising all age groups.

281
As Ct values were only available for the primary cases identified by being tested 282 in TCDK, only these primary cases were included in the analyses. We addressed the 283 potential bias from not including primary cases that had been identified at hospitals 284 (27%) by estimating the transmission risk stratified by primary cases identified at TCDK 285 and hospitals ( Figure S6). Primary cases identified in hospitals generally had a higher 286 transmission risk, possibly because they were symptomatic. However, the trend in the 287 age structured transmission risk was approximately the same across TCDK and hospitals, 288 indicating that sample selection did not affect the general results. 289 We defined the primary cases as the first positive test within a household and all 290 other persons living in the same household as potential secondary cases. We defined all 291 secondary cases as those testing positive 1-14 days after the primary case. However, some 292 of these co-primary and secondary cases may be misclassified, e.g., if they were infected 293 earlier but not diagnosed, because they were pre-or asymptomatic. Including secondary 294 cases found >14 days after the primary case could result in misclassification of secondary 295 cases being either tertiary cases or having somewhere else as the source of secondary 296 infections.

297
17 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 5, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 Optimal contact tracing naturally has to prioritize the order of new cases and their 298 contacts. Our results suggest that contact tracing should prioritize cases according to Ct 299 values, but more so, according to age.

300
In conclusion, we found that lower Ct values (indicating higher viral load) is associated 301 with higher risk of SARS-CoV-2 transmission. However, even at high Ct values, trans-302 mission occurs. In addition, we found a strong association between age and transmission 303 risk that dominated the Ct value association.

304
18 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

22
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 5, 2021. ; 23 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

24
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

25
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

26
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 5, 2021. ;

27
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 5, 2021. 28 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 5, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021  29 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

30
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

31
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 5, 2021. ;

32
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted March 5, 2021.         Notes: This figure shows the transmission risk from the interaction between age of the primary case and age of the potential secondary cases, stratified by Ct value quartiles. Panel (a) shows the transmission risk for primary cases with Ct values in the lowest quartile. Standard errors clustered on the household level in parentheses.

33
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted March 5, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021