A model for COVID-19 transmission in Connecticut

To support public health policymakers in Connecticut as they begin phased lifting of social distancing restrictions, we developed a county-structured compartmental SEIR-type model of SARS-CoV-2 transmission and COVID-19 disease progression. We calibrated this model to the local dynamics of deaths and hospitalizations and the exact timing of state interventions, including school closures and stay-at-home order. In this technical report, we describe the details of the model design, implementation and calibration, and show projections of epidemic development through the Summer of 2020 under different assumptions about the increase in contact rates following partial state reopening. Our model results are consistent with high effectiveness of state lockdown measures, but changes in human interaction patterns during the coming months are unknown. In addition, a lot of uncertainty remains with respect to several key epidemiological parameters and the effectiveness of increased testing and contact tracing capacity. As more information becomes available, we will update the projections presented in this report. Reports in this series are posted to https://crawford-lab.github.io/covid19_ct/.


Introduction
Epidemiological models of disease transmission play an important role in supporting public health decision-making. Trajectories from these models can provide insights into historical trends in epidemic dynamics or future outcomes under hypothetical intervention scenarios. Transmission models are especially useful in situations of high uncertainty, offering a structured way to assess the potential effects of interventions given plausible assumptions about disease transmission. Models cannot predict the future with certainty, but they can be helpful for scenario analysis by bounding the range of plausible future trajectories [1].
In the absence of effective pharmaceutical interventions, many countries, including the US, implemented social distancing measures and stay-at-home orders to slow transmission of SARS-CoV-2. As many states, including Connecticut, begin phased lifting of social distancing restrictions, public officials and public health policymakers are faced with several questions: 1) How effective are public health interventions like school closures and stay-at-home orders in reducing cases, hospitalizations, and deaths? 2) How should public health interventions be implemented in the future to minimize the risk of a resurgence? 3) What will be the effect of phased reopening plans? Multiple transmission models have been developed recently to help answer these and other policy questions . When models are developed with a primary goal to support decision-making, they must balance parsimony and realism.
Projection models should be simple enough to fit to data, and should provide clear outputs in a timely manner with assumptions that can be understood by policymakers. At the same time, for such models to be useful, they need to be flexible enough to accommodate realistic epidemiological features and likely policy scenarios identified by stakeholders.
Many of the nation-wide COVID-19 forecasting models that have been developed in the United States either do not use local data, or make simplifying assumptions not appropriate for localities [2]. These models may capture national-level dynamics, but are less useful for supporting decision-making in individual states or counties. Several nation-wide models have been developed with a goal to provide projections at the state level [24][25][26][27][28][29][30][31][32][33][34]. These models employ varying methods, make different structural and parameter value assumptions, and project potential effects of different future policy interventions. They also vary in terms of model outputs and the ways of handling uncertainty. Most of these models use the same set of assumptions and estimates of key model parameters across all US states, and may not be able to capture important local variation.
In this technical report, we introduce a county-structured transmission model of SARS-CoV-2 transmission and COVID-19 disease progression in Connecticut. The model was developed with a goal to support intervention planning and decision-making in Connecticut, but could be adapted to other states or regions. This report provides an in-depth technical description of the model and data calibration approach, along with projections for Connecticut through the Summer of 2020. Additional COVID-19 reports for Connecticut in this series are available from https://crawford-lab.github.io/covid19_ct/.

Model specification
We developed a deterministic compartmental model of SARS-CoV-2 transmission and COVID-19 disease progression. The model is based on the SEIR (Susceptible, Exposed, Infectious, Recovered) framework [35], which we extend to accommodate geographical variation in Connecticut and distinct features of COVID-19 disease. We calibrate the model to observed dynamics of deaths and hospitalizations in Connecticut, and produce estimates of incidence and prevalence that may be helpful in designing population-level surveys. Similar models have been published recently, offering intervention effect estimates and projections in various locations [2,3,6,7,9,13,14,36,37]. Figure 1 shows a schematic representation of the model structure. A mathematical description of the model, which includes information about how the model accommodates geographic variation across Connecticut counties, appears below. We categorize infections as asymptomatic, mild symptomatic, and severe. Only severe infections may lead to death. Severe infections are defined as those requiring hospitalization. Since one of the key observable features of the model is dynamics of hospitalizations in the community, we use separate compartments for community hospitalizations and for severe cases occurring in institutional settings, such as nursing homes, assisted living facilities, or prisons. The case fatality ratio (CFR) among severe cases in closed communities is assumed to be higher than the community hospital CFR. If hospitalization capacity is overwhelmed, some severe cases in the community are denied hospitalization, and experience a higher probability of death compared to hospitalized cases. Mild symptomatic infections are assumed to self-isolate shortly after they develop symptoms and remain isolated until they recover. During the infectiousness period of symptomatic cases, we assume a period of presymptomatic viral shedding, i.e. the latency period is shorter than incubation period [38][39][40]. In line with other similar models, we assume that individuals with asymptomatic infection exert a lower force of infection, but remain infectious for a longer period of time, since they are less likely to self-isolate in the absence of widespread testing [4,9,37]. The average time that severe cases spend in the infectious state is approximated by the time between onset of infectiousness and hospitalization (or attempted hospitalization in case of hospital overflow). Since severe cases are likely to be isolated at home before hospitalization, and those in nursing homes, assisted living facilities, or prisons are likely to be isolated within a few days after symptom onset, we assume reduced force of infection from severe infections in this setting. The force of infection from hospitalized patients to unhospitalized susceptible individuals is assumed Individuals begin in the susceptible ( ) compartment. Exposed individuals ( ) may develop either asymptomatic ( ), mild ( ), or severe ( ) infection. Asymptomatic and mild infections resolve without hospitalization and do not lead to death. Mild symptomatic cases self-isolate ( ) shortly after development of symptoms, and transition to recovery ( ) when infectiousness ceases. A proportion of severe cases require hospitalization ( ) unless hospitalization capacity is exhausted, in which case they transition to¯ representing hospital overflow, then to recovery ( ) or death ( ). Some severe cases, including nursing homes and assisted living facilities residents, people in prisons, or individuals, who do not have access to hospitalization for other reasons, transition to compartment and may later recover or die. The model assumes a closed population without births and does not capture non-COVID-19 deaths.
to be negligible. We further assume that recovered individuals remain immune to reinfection for the duration of the study period. The model is implemented at the level of individual counties in Connecticut assuming that most contacts are happening within a given county. A small proportion of contacts is allowed to happen between adjacent counties. The analysis was performed using the R statistical computing environment [41]. We used package deSolve to perform numerical integration of the system of ordinary differential equations (ODE) [42].

Compartmental model and parameters
The model divides the population into 11 compartments: susceptible ( ), exposed, latent infections ( ), infectious and asymptomatic ( ), infectious and mild symptomatic ( ), infectious and severe ( ), isolated mild infections removed from the pool of infectious individuals ( ), hospitalized ( ), severe in need of hospitalization, but denied it due to hospital capacity overflow (¯ ), severe in nursing homes, assisted living facilities, prisons or otherwise not having access to hospitalization ( ), recovered ( ), and died ( ). Let be the population size of county and the set of counties adjacent to county . Let ( ) represent hospitalization capacity in county , which may vary over time. Transmission dynamics for county are given by the following ODE system:  Figure 2: County map of Connecticut and county adjacency matrix. The dark gray cells correspond to counties that are adjacent. Contacts between adjacent counties are included in the model in addition to contacts within counties.
The function ( ) = 1 + exp(0.5( ( ) − ( ) )) −1 is a "soft" hospitalization capacity overflow function. Table 1 shows model parameters and their definitions. Some of the parameters from Table 1 are not directly used in the ODE system, but are used as inputs to compute other model parameters. Figure 2 shows the county map of Connecticut along with the county adjacency matrix. The geographic boundary files were obtained from the Connecticut Department of Environmental Protection [43]. We assume that a fraction (1 − ) of all contacts happen within a given county, and the remaining contacts happen between individuals residing in adjacent counties.

Effects of social distancing and testing interventions
Social distancing measures, in particular school closure and lockdown, reduce the value of transmission parameter . The value of the transmission parameter at time is calculated as: where 0 is a value of transmission parameters in the absence of any interventions, school ( ) is the proportion of contact reduction following school closure, and lockdown ( ) is a proportion of contact reduction following lockdown that is independent of school closure effect. school ( ) and lockdown ( ) are zero when these interventions are not in effect, and can either have constant or time-varying values when interventions are in effect, subject to a constraint 0 ≤ school ( ) + lockdown ( ) ≤ 1 at any time .

5
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted June 16, 2020. . Based on the known dates of school closure and lockdown orders, and assuming that these effects are constant over time once they reach their maximum levels school and lockdown , we calibrate the values of these effects to observed data. To project epidemic dynamics into the future, we replace lockdown ( ) with a step function representing post-lockdown increases in population-level contact. Figures 5 to 10 show examples of assumed post-lockdown increases in contact, starting on May 20, 2020.
Increase in testing capacity and contact-tracing is assumed to result in earlier identification and isolation of mild symptomatic and asymptomatic cases, and leads to increases in (rate of transition from to ) and (rate of transition from to ). The values of these transition rates at time are calculated as: where (0) and (0) are reciprocals of the duration of time until isolation among asymptomatic and mildly symptomatic infections respectively under current testing availability and contact tracing conditions. Similarly to the parametrization of distancing effects, testing, ( ) and testing, ( ) may vary over time. We assume that soon after the lockdown is lifted, testing capacity and contact tracing efforts increase and remain constant at that level. The values of testing, ( ) and testing, ( ) are assumed to be zero prior to May 20, 2020 and increase to testing, and testing, respectively afterwards.

Model calibration and Bayesian posterior inference
We calibrate the distribution of model parameters using the observed dynamics of community hospitalizations and cumulative number of deaths. Due to changing testing patterns and varying case ascertainment proportions over time, we do not use reported case counts in the calibration procedure [1,44]. As is often the case for epidemic models, not all parameters are simultaneously identifiable given the observed data on hospitalizations and deaths.
In particular, different sets of parameters that collectively determine early epidemic growth rate (i.e. transmission parameter and parameters that define force of infection) are observationally equivalent. Given this limitation, we fix a subset of parameters at their point estimates and estimate the remaining parameters using a Bayesian approach.
To generate uncertainty intervals for projections, we sample from the joint posterior over estimated parameters and uncertainty distributions for a subset of fixed parameters, then find pointwise 95% posterior predictive intervals for each time point.
Several model parameters influence epidemic dynamics in the future, beyond the data available for calibration. These parameters include the hospital overflow CFR ( ¯ ), testing capacity and contact tracing effects ( testing, and testing, ), and the extent of release of suppressed contact once lockdown is lifted. These parameters cannot be estimated based on historical data, but may be estimated in the future. For the purpose of projections, we impose a distribution on ¯ , testing, , and testing, independent of one another and of the joint posterior distribution of calibrated parameters. Increases in contact rates following state reopening are parameterized relative to the size of lockdown effect lockdown . The point estimates of models parameters, prior distributions, and assumed distributions are summarized in Table 2.

Continuous transmission model parameters
Let denote the subset of continuous transmission model parameters whose joint distribution is calibrated to observed data. For each individual parameter , we specify a fixed support [ min , max ], and put independent beta priors on 6 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 16, 2020. . the transformed parameter, i.e., where the shape parameters and are set to let have mean and standard deviation . Table 2 provides the list of ( , , min , max ) for all parameters in .

Initial conditions
Given a set of transmission parameters, we initialize the compartmental model by specifying the size of exposed ( ) compartment in each county and setting the size of downstream compartments to be zero. Let ( ) 0 be the -th element of vector 0 denoting initial size of exposed compartment in county . The initial size of susceptible compartment in county is given by Elements of vector 0 were pre-specified based on the relative population size and the dates of first registered case and death in each county.
We allow the date of epidemic onset to vary by letting 0 to correspond to day − 0 , where day 0 corresponds to a calendar date of March 1, 2020. The state of the system at any given time is therefore a deterministic function of the initial size of exposed compartment, model parameters, and 0 that determines the date of epidemic onset. We put a uniform prior on 0 over { min , . . . , max }.

Reported hospitalizations and deaths
We accommodate reporting lags in observed hospitalizations and deaths. Reporting lags are correlated with other unknown parameters, including latency period, epidemic onset lag, time between infection and hospitalization, time between infection and death, length of hospital stay, and difference in death reporting lags between hospitals and nursing homes. Following Osthus et al. [45], we model the observed population fractions of hospitalizations ℎ( ) and deaths ( ) at time using a Beta distribution, where is the size of Connecticut population; and ( , 0 , , ) and ( , 0 , , ) are model-projected current hospitalizations and cumulative deaths at time with reporting lags and , and epidemic onset lag 0 under parameter values . We put uniform priors on and over a range of plausible values. Parameters ℎ and control the variance of respective Beta distributions. We put the same independent Gamma prior on ℎ and , i.e.
where and are shape and rate parameters of the Gamma distribution, set to = 4 and = 0.0002.

Posterior inference
Posterior inference is performed using a weighted likelihood that assigns more weight to more recent observations. We let the weight function ( ) take the following form, 7 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 16, 2020. Observation weight qqqqqqqqqqqqqqqqqqq qqqq qq qq qq q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q qq q qqqqqqqqqq qq qq q qq q q q q q q q q q q q q q q q q q q qq qq qq qq qqq q q qq q q q q q qqq q q q q q qq q q q q qqq q q where ( ) is the weight assigned to observation at time , ∈ { 0 , ..., max } and the correspondence between { 0 , ..., max } and calendar time is set such that = 0 corresponds to = 45 days prior to the most recent observation. Parameter controls the smoothness of logistic function. We set = 0.3, which leads to the weight function in Figure 3.
We construct the posterior distribution over unknown parameters ( , , L) as where = ( , , , , , , , , , school , lockdown ), = ( ℎ , ), and L = ( 0 , , ). Each likelihood term is weighted by ( )/2 because we assign equal weight 1/2 to data contributions from both hospitalizations and deaths. Sampling from the joint posterior distribution of ( , , L) given in (6) is performed using Markov Chain Monte Carlo (MCMC). We implemented a Metropolis-Hastings algorithm with random walk proposals for and and independent proposals for L. We ran the sampler for 2,000,000 iterations and thinned the chain using every 400th iteration.

Parameter values, prior distributions, and data sources
Asymptomatic infections play an important role in transmission of SARS-CoV-2 [38][39][40]46], but estimates of the proportion of infections that do not exhibit symptoms vary [46][47][48][49][50][51]. The true proportion of asymptomatic infections is important for projections and policy planning due to its relationship to evolving herd immunity. In the absence of reliable estimates of cumulative incidence based on sero-prevalence surveys, we consider three scenarios: is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 16, 2020. . Princess cruise ship. We applied this estimate to age group 65 plus years old, which is the age group, in which most infections occurred. We assumed 60% for age group 0-19 years old, consistent with findings reported by Russell et al. [52], where 4 out of 6 infections in this age group were asymptomatic among passengers of the Diamond Princess. The average was weighted by the age distribution of Connecticut population. At the time of writing this report, the best estimate of asymptomatic proportion recommended by the CDC was 35% [53]. Based on recent evidence suggesting that the proportion of asymptomatic infections may be higher [46,47,50,51,54,55], we assumed = 0.5 and = 0.7 in medium and high scenarios respectively. Table 2 provides a list of prior mean, standard deviation, and lower and upper bounds of the model parameters along with data sources. For parameters whose values were fixed, only the mean is given. In the three scenarios considered, we fixed the values of asymptomatic proportions ( ). The proportion of severe cases ( ) was allowed to vary with a mean assumed to be 10% of symptomatic infections in line with [14,[56][57][58].
Duration of the latency period (1/ ) was fixed, since reliable estimates of this parameter are available and because it is correlated with parameters that were allowed to vary, including reporting lags and parameters that determine duration of infectiousness and time between infection and recovery or death. We assume an average of 4 days of latency [14,36] and 1.5 days of presymptomatic infectiousness [59] resulting in an average incubation period of 5.5 days in line with [14,47,53,57,[60][61][62].
Parameters ( , , , , ) collectively determine force of infection at a given time. Force of infection and transmission parameter together determine the early growth of the epidemic. Without additional data, these parameters cannot be simultaneously identified. We therefore fixed parameters for which estimates are available or which can be calculated from available estimates ( , , ), and assumed prior distributions and calibrated posterior of those that are unavailable ( , ). Duration of infectiousness of asymptomatic individuals is unknown, but is likely shorter than that of symptomatic individuals [63]. While several studies estimated that viral RNA could be detected in upper respiratory tract for 2-3 weeks [64,65], findings from [66] suggest that live virus can be isolated for a substantially shorter time period, which in this study was a maximum of 7 days. We assume a diffuse prior on , which absorbs additional variations of parameters that determine force of infection.

Initial conditions and reporting lags
Based on the relative population size of the counties and the date of first registered case and death, we assume an initial size of exposed compartment to be 7.5 in Fairfield, 6.5 in Hartford, 0 in Litchfield, 0.25 in Middlesex, 4.5 in New Haven, 0.13 in New London, 0.05 in Tolland and 0.04 in Windham. We allow the starting date of the epidemic to be between February 12th -19th and calibrate a scenario-specific distribution of starting dates to the observed data.
Early in the epidemic, many infections were likely "imported" from New York City, which experienced a major outbreak in the spring of 2020. In particular, many residents of Fairfield county work in New York City. We do not directly model importation events: the assumed number of exposed individuals at time zero in Fairfield county does not include all such events. Infection importation from New York City likely continued for days or even weeks early in the epidemic. This unaccounted force of infection is absorbed in the estimation of transmission parameter .
We specified a wide range of reporting lags for hospitalizations and deaths, resulting in ranges of 5-9 days for hospitalizations and 8-11 days for deaths. It is likely that deaths occurring in hospitals are reported with substantially smaller lag than those occurring in nursing homes, since many of the latter ones need to be confirmed by a medical examiner [77].

Reduction in transmission following school closure and lockdown orders
We calibrated the parameters that determine reduction in contact rates following school closure and lockdown orders to the observed data. We assume that school ( ) equals zero before March 13th, 2020 -the date of school closure order, and equals some constant positive value school afterwards. The lockdown ( ) equals zero before March 20th, 2020 9 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 16, 2020. . https://doi.org/10.1101/2020.06.12.20126391 doi: medRxiv preprint Posterior parameter estimates were similar if we allowed a smoother decrease in contact rates. Since the dates of school closure and lockdown orders are very close, it is difficult to simultaneously identify school and lockdown . We therefore put a narrow prior distribution on school and diffuse on lockdown , subject to a constraint 0 ≤ ( school + lockdown ) ≤ 1 .

Observed data
Cumulative deaths and current hospitalizations were obtained from Connecticut Department of Public Health daily reports [76] and the Connecticut Hospital Association [75]. We calculated county-level population and age structure in Connecticut using the 2014-2018 estimates of the American Community Survey [78]. This analysis does not use 10 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
Daily total available hospital beds (including occupied) in each county were obtained from the Connecticut Hospital Association/CHIMEData [75,79] and used as hospitalization capacity values on a given date. The latest available value of hospitalization capacity was used for future projections. Figure 4 shows projections of hospitalizations and deaths, with 95% posterior predictive intervals, under the medium asymptomatic fraction scenario and observed data as dots. Observed data points track with mean projections and largely fall within uncertainty intervals. Calibration results under low and high asymptomatic fraction scenarios demonstrate a similar fit to historical data. Table 3 reports means and 95% credible intervals of marginal posterior distributions of model parameters and derived epidemiological parameters under the three asymptomatic fraction scenarios.

Results
Figures 5 to 7 summarize projections under the low, medium and high asymptomatic fraction and 'slow' reopening. Slow reopening assumes that at one-month intervals, 10% of the contacts suppressed via lockdown is released. Figures  8 to 10 summarize projections for the three asymptomatic fraction scenarios assuming 'fast' reopening. Fast reopening assumes that 10% of the suppressed lockdown contact is released at two-week intervals. The projections include daily COVID-19 incidence, hospital census, deaths, cumulative incidence proportion, and effective reproductive number, eff .
Our estimate of "crude" 0 under the low asymptomatic fraction scenario is 4.69 (95% CI: 4.14 − 5.33), which is on the higher end of available estimates [3,4,13,36,47,53,61,80], but similar to recent modeling studies that estimated 0 to be between 4.7 and 6.3 in Northeastern United States, and around 5 in New York and New Jersey [7,80]. Under the high asymptomatic fraction scenario, our estimate of crude 0 is 5.76 (95% CI: 5.05 − 6.65). We believe that this over-estimates the true 0 as we do not model infection importation events from New York City that likely made a substantial contribution to the force of infection early in the epidemic. Unaccounted force of infection biases estimates of transmission parameter , and hence our "crude" 0 , upwards. A recently published analysis implies that 0 = 3.3 in Connecticut [81]. This suggests that about 30% to 40% of infection events early in the epidemic could be imported, which is a plausible proportion range. Estimates of eff at later stages of the epidemic are less likely to be affected by this source of bias, since lockdown measures have probably resulted in a very low rate of interstate contacts. Our estimates of eff on May 20 are similar under the three asymptomatic proportion scenarios.
Under high asymptomatic fraction scenario, we would expect a higher number of infections, hospitalizations and deaths by the end of summer. Under slow reopening, hospitalization capacity is not expected to be overwhelmed under any of the scenarios. In case of fast reopening, hospitalization capacity would likely be exceeded sooner if asymptomatic fraction is high. The main difference between the asymptomatic fraction scenarios is in the cumulative incidence proportion. By the end of summer, it may be as low as 0.1 under the low asymptomatic fraction and slow reopening, and as high as 0.6 under the high asymptomatic fraction and fast reopening. The latter scenario predicts the second peak of infections to occur in mid-August, while all other scenarios predict that the second peak would not occur until later in the Fall. While higher asymptomatic fraction is expected to result in higher number of deaths in the short-term, the total number of deaths by the end of the epidemic would likely be higher under low asymptomatic fraction. When asymptomatic fraction is low, infections and deaths are delayed, but the proportion of severe cases is higher.

11
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 16, 2020. qqqqqqqqqqqqqqqqqqqqqqq qqq qq qq q qq q q q q q q q q qq q q q q q q q q q q q q q q q q q q q qq qq q q q q qq q q Figure 4: Parameter calibration results under the medium asymptomatic fraction scenario ( = 0.5).   12 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 16, 2020.   Figure 6: Projections under a "slow" reopening scenario and Scenario 2, medium asymptomatic proportion. 10% of suppressed contact is released every 30 days, starting on May 20, 2020, with 95% uncertainty intervals. The dashed line above hospitalization projections is an estimate of the hospital bed capacity in Connecticut [75]. eff in March includes the effect of imported infections from outside Connecticut, in particular the New York region.

13
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 16, 2020. 14 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 16, 2020. . https://doi.org/10.1101/2020.06.12.20126391 doi: medRxiv preprint  Figure 8: Projections under a "fast" reopening scenario and Scenario 1, low asymptomatic proportion. 10% of suppressed contact is released every 2 weeks starting on May 20, 2020, with 95% uncertainty intervals. The dashed line above hospitalization projections is an estimate of the hospital bed capacity in Connecticut [75]. eff in March includes the effect of imported infections from outside Connecticut, in particular the New York region.

15
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 16, 2020. . https://doi.org/10.1101/2020.06.12.20126391 doi: medRxiv preprint : Projections under a "fast" reopening scenario and Scenario 2, medium asymptomatic proportion. 10% of suppressed contact is released every 2 weeks, starting on May 20, 2020, with 95% uncertainty intervals. The dashed line above hospitalization projections is an estimate of the hospital bed capacity in Connecticut [75]. eff in March includes the effect of imported infections from outside Connecticut, in particular the New York region.

16
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. 17 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 16, 2020. . CHR, case hospitalization ratio (proportion of symptomatic cases requiring hospitalization); IHR, infection hospitalization ration (proportion of all infections requiring hospitalization); CFR, case fatality ratio (proportion of symptomatic cases who die); IFR, infection fatality ration (proportion of all infections who die).

Discussion
In this report, we have described technical details of a model of SARS-CoV-2 transmission and COVID-19 disease progression developed to support public health decision-making in Connecticut. The model is calibrated to the observed dynamics of hospitalizations and cumulative deaths in Connecticut; its projections reproduce these dynamics accurately. Many COVID-19 models have been developed and analyzed by the CDC in the attempt to perform ensemble forecasting of the epidemic development in the US [2]. Some of these models offer state-level projections [24][25][26][27][28][29][30][31][32][33][34]. The CDC publishes updates of consolidated summary of cumulative death projections in the next four weeks from these models for each state. Local projections from nation-wide models may offer useful insights, however simplifying or uniform assumptions made in most of these models may not hold in all of the locations.
There is substantial uncertainty about epidemiological parameters that govern aspects of COVID-19 dynamics and have a direct impact on the quality of projections. When local context is not directly taken into account, the effects of parametric uncertainty is exacerbated. State and county-level models are needed to support local decision-making. Our model captures distinct important features of COVID-19 dynamics and the relationship between model features and data reporting in Connecticut. By calibrating model parameters to local data, including changes in hospitalization capacity, and the exact timing of intervention events, we can reduce uncertainty in projections. However, the calibrated posterior distribution of model parameters is not necessarily generalizable to other settings: model projections are tightly linked to the Connecticut reopening plans.
In addition to providing predictions for policymakers, model projections may be useful for prospectively planning epidemiological studies that can inform the state's response. In particular, planning of seroprevalence surveys requires estimates of the proportion of population who have evidence of prior exposure to the virus. Due to limited 18 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 16, 2020. . testing availability and potentially high proportion of asymptomatic individuals, official case counts offer a poor approximation to the true cumulative incidence. Seroprevalence surveys, if properly conducted, can provide an important piece of information that would permit more precise estimates of the fraction of asymptomatic infections.
Our estimates of symptomatic case hospitalization ratio (CHR) and symptomatic CFR are higher than those recommended by the CDC for modeling studies [53]. Epidemic dynamics in Connecticut may be different from other states and from the average dynamics across the US. Our prior estimates are informed by the reports of Connecticut Department of Public Health [76,77], as well as data provided by the Connecticut Hospital Association [75], and by Yale New Haven Hospital. To the extent possible, we use a data-driven approach to parameter estimation, and our posterior parameter estimates are highly consistent with observed epidemic dynamics. One of the important factors that may explain relatively high estimates of CFR in Connecticut is related to high mortality among residents of nursing homes and assisted living facilities. About 50% of official COVID-19 related deaths in Connecticut occurred in this subpopulation [77]. Lower CHR implies a lower proportion of severe cases, and would push estimates of CFR higher in order to reproduce the observed dynamics of hospitalizations and deaths simultaneously.
Symptomatic CHR, symptomatic CFR, and infection fatality ratio (IFR) are related epidemiologic quantities that describe severity of infection. Early estimates of these parameters used in many modeling studies come from China [56,57,82]. A recent review suggests that COVID-19 has higher severity in Europe and in the US compared to China [58] with higher CFR in the same age groups [83]. Estimates of CFR among hospitalized patients range from 15.4% to 29% in different parts of the US [58,68,71], and were reported to be as high as 32% -38.6% in the UK [70,74]. These differences suggest that some epidemiologic parameters may not be generalizable between different settings, and that, to the extent possible, local projections should rely on local parameter estimates.
Estimates of IFR that rely on official COVID-19 related death counts are likely biased downwards [84]. While a recent systematic review reports a pooled estimate of IFR = 0.75% [85], studies that attempt correction of this source of bias report IFR estimates of 1.04% [86] and 1.29% [87], consistent with our model estimates. Epidemiological parameters provided by the CDC are regularly updated and may change in the future.
Prior knowledge and assumptions about plausible ranges of parameter values combined with local data allows us to substantially reduce parametric uncertainty and produce narrow projection intervals. However, several important considerations limit our ability to make reliable long-term projections. First, it is difficult to make predictions about the extent of contact rate increases following lockdown release steps, such as allowing only certain types of businesses to reopen. Second, the effectiveness of widespread testing and contact tracing on timely isolation of infectious individuals, and its subsequent impact on the force of infection is unknown. This effect is a complex function of viral shedding characteristics among symptomatic, presymptomatic and asymptomatic individuals, along with the implementation features of contact tracing, testing, and isolation [88]. Multiple studies conclude that social distancing is highly effective in reducing transmission with effect estimates similar to our estimate of combined school closure and lockdown effect [6,13,14,16,18]. Connecticut officials plan increases in contact tracing and testing capacity to mitigate potential increases in contact rates, and several modeling studies suggest that combination of reduced social distancing with increased testing may prevent a second wave of infections [8,9], but testing coverage should be very high and isolation should happen rapidly [13]. It is, however, unclear how feasible is repeated population-level screening and how successful is traditional "manual" contact tracing. Some researchers argue that while repeated testing is important for health-care workers and high risk groups, it is unlikely to have a substantial impact on transmission [89]. Contact tracing using mobile technology may be a promising intervention [17]. Overall, our estimates suggest that at the end of May, 2020 eff was around 0.65, meaning that a substantial level of social distancing would need to be maintained long-term and that testing and contact tracing alone would likely be insufficient to keep eff below 1.
Third, there may be unequal depletion of susceptible individuals depending on their severity risk profile. If a higher fraction of high-risk individuals have already experienced infection compared to low-risk individuals, then we may be under-estimating current prevalence and over-estimating future number of deaths and the final epidemic size [22,23]. Case counts and mortality data from nursing homes suggest that susceptible individuals in this group are likely depleted faster than the in general population [77], but there may be other high risk groups, which have not yet experienced major outbreaks. While uniform reduction in contact rates following lockdown is consistent with observations, lower risk groups may have reduced their contacts slower than high-risk groups, resulting in higher cumulative incidence with the majority of infections being asymptomatic [51].
Fourth, we assume that all epidemiological parameters are constant over time and are not subject to seasonal forcing. It is hypothesized that temperature may play a role in transmission dynamics beyond its impact on contact patterns [3], in which case we would expect lower transmission intensity in summer all other things being equal. A recent study, however, suggests that the overall impart of warm temperature is small and insufficient to reduce transmission during summer months below the epidemic threshold in the absence of other interventions [90].
In an evolving public health crisis, it is important to keep projections up to date. As more information about the extent of asymptomatic transmission, general epidemiological characteristics of COVID-19, and data on observable model features in Connecticut becomes available, we will update the projections presented in this paper. Reports in this series are posted to https://crawford-lab.github.io/covid19_ct/.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted June 16, 2020. . . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted June 16, 2020. . https://doi.org/10.1101/2020.06.12.20126391 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted June 16, 2020. . . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted June 16, 2020. . https://doi.org/10.1101/2020.06.12.20126391 doi: medRxiv preprint