Estimates of regional infectivity of COVID-19 in the United Kingdom following imposition of social distancing measures ====================================================================================================================== * Robert Challen * Krasimira Tsaneva-Atanasova * Martin Pitt * Tom Edwards * Luke Gompels * Lucas Lacasa * Ellen Brooks-Pollock * Leon Danon ## Abstract The SARS-CoV-2 reproduction number has become an essential parameter for monitoring disease transmission across settings and guiding interventions. The UK published weekly estimates of the reproduction number in the UK starting in May 2020 which are formed from multiple independent estimates. In this paper, we describe methods used to estimate the time-varying SARS-CoV-2 reproduction number for the UK. We used multiple data sources and estimated a serial interval distribution from published studies. We describe regional variability and how estimates evolved during the early phases of the outbreak, until the relaxing of social distancing measures began to be introduced in early July. Our analysis is able to guide localised control and provides a longitudinal example of applying these methods over long timescales. Keywords * COVID-19 * SARS-CoV-2 * reproduction number * regional variation ## Introduction In late 2019 an outbreak of a novel infectious disease was detected. It manifested principally with severe acute respiratory distress, and pneumonia, [1] although many cases followed a mild course [2]. The pathogen was rapidly identified as a new species of coronavirus (severe acute respiratory syndrome coronavirus 2, SARS-CoV-2), and the disease named COVID-19 [3]. Global transmission of the virus followed and major outbreaks have been observed in Europe, beginning with Italy [4]. On the 31st Jan 2020 the first cases were identified in the UK [5]. This was initially managed using testing of suspected individuals in the community, contact tracing and isolation of affected cases. However this was successful only in delaying the spread of the disease and on 13th March 2020 the UK government moved towards a mitigation strategy reserving testing for hospital inpatients only [6]. Following this, a step wise implementation of social distancing measures were mandated by the government including voluntary self isolation of any symptoms & vulnerable people [7], a ban on non essential travel worldwide [8] and school closures [9]. Finally on 23rd March 2020 the government mandated that everyone apart from essential workers should stay at home and away from others [10], instituting a countrywide “lock-down”. Epidemiological studies conducted during the outbreak in China have provided us with a number of estimates of the parameters describing the virus’s spread through the population including a reproduction number between 2.24 and 3.58 [11] and a median incubation period of 5.1 days (credible interval 4.5 to 5.8) [12]. It is estimated that fewer than 2.5% of people will show signs before 2.2 days and 97.5% of people who will develop symptoms will have done so by 11.2 days after exposure [12]. We investigated the reproduction number of SARS-CoV-2 in the UK to determine whether there are any spatial or temporal patterns beyond those resulting from the imposition of social distancing measures, and to track the progression of the outbreak. This article summarizes the methodology and interpretation of national and regional estimates of the time varying reproduction number (*R**t*) of the SARS-CoV-2 outbreak in the United Kingdom. These estimates were provided to the Scientific Pandemic Influenza Group on Modelling (SPI-M) [13] and formed part of the weekly UK *R**t* estimates [14]. ## Methods This section describes the data sources, their processing and combination and our methodology for estimating the serial interval of SARS-CoV-2 infections, and calculation of *R**t* estimates. ### Data We integrate data from a variety of sources, both publicly available and provided to the Scientific Pandemic Influenza Group - Modelling (SPI-M) [13] by Public Health England (PHE) and the Defence Science and Technology Laboratory (DSTL) [15]. We use these data to estimate *R**t* and the exponential growth rate for the UK as a whole, for the four nations of the UK (CTRY), and for the seven NHS regions in England (NHSER). At the UK level, data are available on cases and deaths through the PHE coronavirus tracker [16]. The tracker publishes an overall number of cases and deaths in the UK on a daily basis. It also provides a regional breakdown of the 4 nations, which exclude tests performed in private laboratories (Pillar 2 tests). At the time of this analysis, historical time series of the UK level data was not made available though the PHE site, however this information was collected prospectively and curated by Tom White’s aggregated COVID 19 UK data github site [17]. Cumulative case counts from both the PHE headline UK figure and the combined sum of the 4 nations are compared as these numbers differ. Hospital admission data are available from NHS trusts across the UK via the DSTL, which in turn aggregate the situation reports (SitReps) provided by NHS hospital trusts. These are aggregated to UK level. Although referred to here as “hospital admission” it includes admissions of patients who are subsequently identified as COVID-19 cases in hospital, but identified in hospital, and patients with known COVID-19 who are then admitted to hospital. For the four devolved nations of the UK we used different data sources for each nation. In England we used data from the SPI-M provided line lists for cases and deaths in England. Cases are restricted to those processed in NHS labs (Pillar 1), and are available by date of specimen collection. For the other three nation states of the UK, as above, both historical cases and death data are aggregated from Public Health Wales [18] and Scotland’s [19] sites, and from Northern Ireland’s HSC site [20] respectively, using a time series retrieved from Tom White’s aggregated COVID 19 UK data github site [17] for Scotland, Wales and Northern Ireland. Death data from the 4 nations are provided via the CHESS data set [21]. Death data are subject to a weekly periodicity due to reporting delay over the weekend which is mitigated by using date of death rather than date of report. At the level of the NHS England regions, we take the case data from anonymised test result line listings. Mortality data for NHS England is provided by PHE in a canonical line list of deaths, and is used alongside the CHESS data set[21]. Admissions data is available from DSTL feeds, and ICU admissions from CHESS data set[21]. For England as a nation and for the NHS England regions we also have data available from triage telephone calls from NHS 111 and 999 services. The data on the calls made to 111 & 999 as well as the outcome of that call are provided as aggregate numbers broken down by age. For all data sources cumulative case figures are converted into daily incidence figures, and any data which is broken down by age, or by gender, are combined. In case and death data, the final 5 days of the time series are discarded to account for possible reporting delay. The resulting time series are analysed for outlying data points, which are more than 5 standard deviations away from the mean of the nearest 14 data points. Outlying or missing data are imputed from a linear interpolation of the logarithm of incidence figures, implemented in the R ‘forecast’ library [22], and the results are truncated to ensure no negative incidence figures. A smoothing function (a linear spline interpolation) is then fitted to the logarithm of incidence applied over a 7 day window [23]. This is needed as all the data sources have some degree of weekly periodicity regardless of source. ### Combination of data sources Our estimates are based on an aggregation of the various data sources described above. We generate four estimates of *R**t* based on single time series from each data stream: 1. deaths (EpiEstim/Deaths), 2. cases (EpiEstim/4NationsCases), 3. telephone triage (EpiEstim/Triage), 4. hospital admissions. The aggregation procedure is kept simple and combines multiple data sources when applicable using the mean. Any possible biases this introduces are consistent throughout the time series so that it does not affect relative changes and hence either estimates of *R**t* or growth rates. The resulting time series are manually inspected for consistency and to check there are not abrupt changes in the data streams. The source of the combined data sets and more information about the processing steps used is shown in Supplemental table 1. View this table: [Table 1:](http://medrxiv.org/content/early/2020/11/13/2020.04.13.20062760/T1) Table 1: Estimates of the value of *R**t* in the UK on 4th July 2020 ### The serial interval of SARS-CoV-2 infections, *R**t*, and time delay of estimates The serial interval and the generation interval are closely related measures. The generation interval is a measure of the time taken for an infection to pass from one person (infector) to another (infectee) in a chain of transmission. The serial interval, on the other hand, is a measure of the time between the appearance of clinical symptoms in the infector and infectee. The generation interval cannot be observed directly as both infection events are only detectable once the virus has incubated and become symptomatic. The serial interval is often used as an observable proxy for the generation interval. The use of the serial interval as a proxy for the generation interval is known to produce biased estimates for the reproduction number [24,25], but has the advantage of being directly observed through contact tracing. Assumptions about the serial interval of SARS-CoV-2 have an impact on the absolute level of our estimates of *R**t*. We have used multiple approaches to estimate the serial interval. Firs, a UK specific serial interval was calculated from early case tracking data (the FF100 case data provided by DSTL). Second, a literature review was conducted and the serial intervals from a range of sources [4,26–35] pooled [36]. The serial interval can be well described by a truncated empirical distribution, with a mean plus 95% credible interval of 5.59 days (5.09; 6.20), and a standard deviation of 4.15 days (3.94; 4.46). This uncertainty in the serial interval distribution is propagated to our estimates of *R**t* and used to determine confidence intervals. In other work we have analysed the effect of the pragmatic use of serial interval instead of generation interval and find the effect on estimates of Rt to be small (∼5% of the absolute value) when it is close to one. This choice does not influence the estimate of time when Rt transitions from growth to decay [36]. Using the inferred serial interval distribution, we analysed the time series data using forward equation method [37], implemented in the EpiEstim R library [37–39], to estimate the time varying reproduction numbers during the outbreak. The renewal equation method is predicated on a time series of infections, and on the infectivity profile - a measure of the probability that a secondary infection occurred on a specific day after the primary case, given a secondary infection occurred [37]. A Bayesian framework is then used to update a prior probabilistic estimate of *R**t* on any given day with both information gained from the time series of infections in the epidemic to date and the infectivity profile to produce a posterior estimate. In both the original [38] and revised [39] implementations of this method, the authors acknowledge the pragmatic use of the serial interval distribution, as a proxy measure for the infectivity profile, and the incidence of symptom onset or case identification as a proxy for the incidence of infection, with the caveat that these introduce a time lag into the estimates of *R**t*. We make the simplifying assumption that there is negligible mixing of populations between each geographical area and treat each location independently. The method uses a sliding time window during which the instantaneous reproduction number is assumed to be constant. We used both a 7 day and a 28 day sliding window for calculations of the time varying reproduction number which provides two estimates with alternative trade offs between noise and loss of detail. Our *R**t* estimate is calculated using a loosely informed prior estimate of *R**t* as a gamma distribution with mean of 1 and standard deviation 2. This prior distribution is based on an assumption of the approximate value of *R**t* on the conditions following lock-down, rather than reflecting values of *R*** commonly described in the literature [11] as those are based on the situation without social distancing measures in place. Other events in the timeline of an infection also serve as a proxy for observations of infections in the past including positive testing, hospitalization for severe disease, or death. Although best practice is to calculate *R**t* using a generation interval distribution and infection events [25], neither the generation interval distribution nor the infection event data can be directly observed, and inferring them can also introduce potential bias and uncertainty. As a pragmatic initial step, we use the serial interval distribution described above in lieu of the generation interval, and the various observations available to us, including triage contacts, cases, admissions and deaths as a proxy of prior transmission events, and compare those results. There is a time delay between infection, symptom onset, case identification, hospital admission and ultimately death, and this affects the timing of our estimation of *R**t*. In a separate analysis we estimated these time delays (shown in Supplemental table 2) and apply these estimates as a correction to the time of our estimates of *R**t* to align them to the date of presumed infection. View this table: [Table 2:](http://medrxiv.org/content/early/2020/11/13/2020.04.13.20062760/T2) Table 2: Estimates of Mean *R**t* and 95% confidence intervals for the individual countries in the United Kingdom, based on cases, deaths and hospital admissions on the 4th July 2020 Code and processed data involved in this analysis are available on GitHub [40]. ## Results ### UK Overview Our estimate of the median value of *R**t* based for the United Kingdom based on cases, deaths and hospital admissions from the 4th July 2020 is presented in Table 1. In Figure 1A, we show the incidence per million people of cases, deaths, and hospital admissions due to COVID-19 in the UK over the outbreak. Panel B shows the associated values for *R**t*. The 3 data sources show similar patterns of exponential increase, with admissions and deaths lagging cases followed by a slower phase of exponential decline, beginning 2-3 weeks after the lock-down. In Figure 1B we see the estimates of *R**t* corrected to date of infection. There is a prominent single initial peak in *R**t* from admissions and deaths in late February, followed by a decline over the course of the next few months. *R**t* crosses 1 at the beginning of April, after which it remains below 1 during the lock-down period. Estimates based on cases show a biphasic pattern with an initial peak in mid February and a second smaller peak in early March, follow the same pattern as other estimates. The peak values of *R**t* vary by observation with peak *R**t* by admissions being 8.3, by cases 4.8 and by deaths 3.3. These are within the confidence limits of estimates described in other countries [11]. ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/11/13/2020.04.13.20062760/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2020/11/13/2020.04.13.20062760/F1) Figure 1: Timeline of cases and estimates of *R**t* based on cases reported by PHE and NHS labs (green), deaths reported in NHS trusts (red) and best available data for hospital admissions (blue). A) Number of cases (Pillar 1), deaths and admissions per million. B) estimates of *R**t* Red points are either missing values or identified as anomalies and replacements imputed. ### Countries in the UK Estimates of *R**t* in the different countries of the UK, based on cases, deaths, hospital admissions or triage calls for the 4th July 2020 are presented in Table 2 and Figure 2. Triage figures based on 111 & 999 coronavirus pathway calls were available for England only; we do not have access to the full time series of all information for all countries. We show estimates based on a 28 day rolling window as cases and deaths in Northern Ireland, and Scotland, have fallen to a level which makes estimates over a shorter window unreliable. With insufficient information the Bayesian method used reverts to the prior value of *R* supplied which we set to 1. These estimates show the median value of *R**t* is below 1 for all four nations, using all data sources. The pattern in all nations is similar with *R**t* rapidly decreasing following lock-down on the 23rd March and becoming less than one in early April. Northern Ireland and Scotland have maintained a lower *R**t* for a longer period of time than England and Wales, with *R**t* values in Northern Ireland and Scotland at or below 0.75 for much of May, June and July compared to those in England and Wales which have been between 0.75 and 1 for the same period. ![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/11/13/2020.04.13.20062760/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2020/11/13/2020.04.13.20062760/F2) Figure 2: The median value of *R**t* and 95% confidence intervals for the individual countries in the United Kingdom, based on cases, deaths and hospital admissions, and a 28 day rolling window. For each data source, red points represent data points that were missing and have been imputed ### *R**t* by England NHS region In Figure 3 (and, for completeness, Supplemental table 3) we present the same results as above but with a focus on the NHS regions in England. On the 4th July 2020 the estimates of the mean of *R**t* were largely below 1 in the individual NHS regions. The observed count of different observations demonstrates a clear regional difference between London and the South West and the rest of the country with lower rates of all indicators than other regions. The time series of regional estimates of *R**t* reflect the overall patterns of England, albeit with more volatility due to the different windowing size between Figure 2 and Figure 3. It is however possible to see higher levels of uncertainty in the South West reflecting the smaller case numbers and to a lesser extent the same in London. View this table: [Table 3:](http://medrxiv.org/content/early/2020/11/13/2020.04.13.20062760/T3) Table 3: Estimates of the burden of disease before and after lock-down in the different NHS regions. ![Figure 3:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/11/13/2020.04.13.20062760/F3.medium.gif) [Figure 3:](http://medrxiv.org/content/early/2020/11/13/2020.04.13.20062760/F3) Figure 3: The median value of *R**t* and 95% confidence intervals for the sub-national regions of NHS England, based on cases, deaths and hospital admissions, and a 7 day rolling window. ### *R**t* differences from England baseline In Figure 4 we plot the absolute difference of *R**t* in the seven NHS England administrative regions, from the *R**t* of England overall, as a baseline for each data source. This is based on 28 day window estimates due to the volatility observed in Figure 3. This analysis highlights the regional differences in *R**t* over time. Over the period of the lock-down, the East of England, Midlands and South East regions approximately tracked the England baseline. The South West was seen to initially following the national average but from June demonstrates a trend towards a lower value, although our confidence in these estimates is low. When lock-down started, London was initially well below the England average, by most indicators, and this continued until some point in late May, after which the trend reversed. On the other hand the North West, and less so North East & Yorkshire were consistently above the rest of the country until late May when they eventually reached the England average, and since mid June the *R**t* estimates in the North West have been lower than this average. The impact this variation has had on case loads is seen in Table 3 which shows the pre and post lock-down rates of newly identified cases in each NHS region. ![Figure 4:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/11/13/2020.04.13.20062760/F4.medium.gif) [Figure 4:](http://medrxiv.org/content/early/2020/11/13/2020.04.13.20062760/F4) Figure 4: The difference of *R**t* estimates for NHS regions and baseline *R**t* estimates for England, based on cases, deaths and hospital admissions, and a 28 day rolling window. ## Discussion In this paper, we estimated the reproduction number, *R**t*, for COVID-19 in the UK using multiple data sources over a range of geographies and time points, and the same methodology. We find that, after the initial peak, different data sources produce similar results, without evidence of systematic bias for estimates based on more immediate measures (cases), compared to those based on longer term measures (deaths). In the UK, *R**t* peaked in mid February 2020, coinciding with the end of the school holidays. We find that using different data sources affects the estimated size of peak burden although with considerable uncertainty. We find that *R**t* declined rapidly following lock-down on the 23rd March but did not reach the critical threshold (*R**t* less than one) separating growth and decline until early April. This delay is likely to be the combination of ongoing transmission within households, care homes and hospitals, or outbreaks in factories in key industries such as food production. Other factors that may influence the timing are delays in case identification and reporting and limitations in the estimation methods. Subsequent to mid-April we estimate Rt largely remained below one until the end of the lock-down period. The multiple data sources we considered each have their advantages and drawbacks. Counts of test-positive cases and telephone triage calls provide a rapid indication of infection risk and capture a broad representation of age groups, but may be influenced by changes in behaviour and testing policy. In the UK, initial attempts at community tracing were abandoned when case numbers started to outstrip test availability and afterwards testing was only performed on hospital admissions for suspected COVID-19 [6]. Later, test capacity was increased and the policy reversed to include more community cases, again altering the nature of the population being tested. Although a regional breakdown of testing capacity was not available at the time of this analysis, we do know that capacity was exceeded in the early phase of the epidemic, and this is one reason why case based Rt estimates must be interpreted with caution until the middle of April. Hospital admissions and death data are less subject to changes in sampling strategy, although are subject to reporting delays and biases in ascertainment. As COVID-19 mortality is overwhelmingly in the elderly, statistics based on deaths mainly represent older groups. Due to reduced contact in the elderly we propose the outbreak took longer to become established in those age groups. Counts of admissions may be unreliable when there is delay in identifying a COVID-19 case, or when there is significant hospital transmission, as was the case in the early outbreak. We took a pragmatic approach for sub-national analyses, based on data availability. In early April 2020, immediately following lock-down, estimates of *R**t* were lowest in London and highest in the North West and North East of England. At this time, the burden of cases was highest in London at nearly 50 cases per million people per day, but this reduced 25 fold over the lock-down period. In early July 2020, at the relaxation of lock-down, the case rates per capita were lowest in the South West, due to the smaller initial outbreak size, followed by London due to the larger impact of social distancing. However the benefit of lower case rates in London were offset by relative increases in *R**t*. When case numbers are low *R**t* ceases to be a uniform statistic over a geographical area, because significant town to town variation will exist as clusters of infection become apparent. This was seen in South West of England towards the end of the lock-down where increasing variability in regional estimates of *R**t* became less obviously significant. This reinforces the point that *R**t* is a relative measure and should not be interpreted without information about the incidence rate. Our approach has a number of limitations. Our method treats the whole system and each region as independent and isolated. In reality, regions are connected via travel, although this was reduced during early social distancing measures in March and April. Furthermore, in the early phases, importation rates were high [41], and this would lead to our approach overestimating the true *R**t*. ## Conclusion We present a description of the methodology and data sources used in providing estimates of *R**t* in the UK for SPI-M [14]. Our approach is pragmatic and designed to produce timely, useful information to policy makers. Despite this, we find that using a number of data sources and careful interpretation helps elucidate the regional differences in *R**t* and shows they existed from the outset of lock-down and persisted during lock-down. Due to compound nature of *R**t* the result of this variation is higher case loads in Northern regions in the UK exiting lock-down. As we move forward early detection and prevention of spread of emerging clusters SARS-CoV-2 infections is critical to prevent large scale outbreaks. This will be challenging as the long incubation period and high rate of asymptomatic individuals makes undetected rapid spread easy. Prediction at a more localised level is needed to focus both community testing and more targeted social interventions on high risk areas in the future. Critical to this work continues to be rapid access to information about the spread of SARS-CoV-2 in the community both with high spatial resolution, but also with a short time lag from infection to observation. As test and trace activities ramp up we expect to see similar biases in case related data as testing volumes change locally in response to outbreaks. Our existing approaches to estimating *R**t* principally use hospital based metrics and as such may not provide the perspective on the outbreak that is needed in the future. Telephone triage data is one potential source of information about local outbreaks, and an area for future investigation [42]. ## Data Availability All data is public domain and accessible through GitHub [https://github.com/terminological/uk-covid-datatools/vignettes/current-rt](https://github.com/terminological/uk-covid-datatools/vignettes/current-rt) ## Supplementary material View this table: [Supplemental table 1:](http://medrxiv.org/content/early/2020/11/13/2020.04.13.20062760/T4) Supplemental table 1: The combined data sources View this table: [Supplemental table 2:](http://medrxiv.org/content/early/2020/11/13/2020.04.13.20062760/T5) Supplemental table 2: Estimated correction of *R**t* offset based on incubation period and observation delay View this table: [Supplemental table 3:](http://medrxiv.org/content/early/2020/11/13/2020.04.13.20062760/T6) Supplemental table 3: Estimates of mean values of *R**t* and 95% confidence intervals for the sub-national regions of NHS England, based on 111 calls, cases, deaths and hospital admissions on the 4th July 2020 ## Footnotes * Competing interests: Support for RC and KTA’s research is provided by the EPSRC via grant EP/N014391/1, RC is also funded by TSFT as part of the NHS Global Digital Exemplar programme (GDE); no financial relationships with any organisations that might have an interest in the submitted work in the previous three years, no other relationships or activities that could appear to have influenced the submitted work. * Funding: RC and KTA gratefully acknowledges the financial support of the EPSRC via grant EP/N014391/1 and NHS England, Global Digital Exemplar programme. LD and KTA gratefully acknowledges the financial support of The Alan Turing Institute under the EPSRC grant EP/N510129/1. LL acknowledges the financial support of the EPSRC via Early Career Fellowship EP/P01660X/1. LD and EBP are supported by Medical Research Council (MRC) (MC/PC/19067). EBP was partly supported by the NIHR Health Protection Research Unit in Behavioural Science and Evaluation at University of Bristol, in partnership with Public Health England (PHE). * Post peer review. * Received April 13, 2020. * Revision received November 13, 2020. * Accepted November 13, 2020. * © 2020, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution 4.0 International), CC BY 4.0, as described at [http://creativecommons.org/licenses/by/4.0/](http://creativecommons.org/licenses/by/4.0/) ## References 1. 1.Huang C et al. 2020 Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 395, 497–506. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(20)30183-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F11%2F13%2F2020.04.13.20062760.atom) 2. 2.Wu Z, McGoogan JM. 2020 Characteristics of and Important Lessons From the Coronavirus Disease 2019 (COVID-19) Outbreak in China: Summary of a Report of 72 314 Cases From the Chinese Center for Disease Control and Prevention. JAMA 3. 3.In press. Naming the coronavirus disease (COVID-19) and the virus that causes it. See [https://www.who.int/emergencies/diseases/novel-coronavirus-2019/technical-guidance/naming-the-coronavirus-disease-(covid-2019)-and-the-virus-that-causes-it](https://www.who.int/emergencies/diseases/novel-coronavirus-2019/technical-guidance/naming-the-coronavirus-disease-(covid-2019)-and-the-virus-that-causes-it). 4. 4.Cereda D et al. 2020 The early phase of the COVID-19 outbreak in Lombardy, Italy. See [http://arxiv.org/abs/2003.09320](http://arxiv.org/abs/2003.09320) (accessed on 20 August 2020). 5. 5.Moss P, Barlow G, Easom N, Lillie P, Samson A. 2020 Lessons for managing high-consequence infections from first COVID-19 cases in the UK. Lancet 395, e46. 6. 6.Public Health England. 2020 COVID-19: Investigation and initial clinical management of possible cases. GOV.UK. See [https://www.gov.uk/government/publications/wuhan-novel-coronavirus-initial-investigation-of-possible-cases/investigation-and-initial-clinical-management-of-possible-cases-of-wuhan-novel-coronavirus-wn-cov-infection](https://www.gov.uk/government/publications/wuhan-novel-coronavirus-initial-investigation-of-possible-cases/investigation-and-initial-clinical-management-of-possible-cases-of-wuhan-novel-coronavirus-wn-cov-infection). 7. 7.Public Health England. 2020 COVID-19: Guidance on social distancing and for vulnerable people. GOV.UK. See [https://www.gov.uk/government/publications/covid-19-guidance-on-social-distancing-and-for-vulnerable-people](https://www.gov.uk/government/publications/covid-19-guidance-on-social-distancing-and-for-vulnerable-people). 8. 8.Foreign, Commonwealth Office. 2020 Travel advice: Coronavirus (COVID-19). GOV.UK. See [https://www.gov.uk/guidance/travel-advice-novel-coronavirus](https://www.gov.uk/guidance/travel-advice-novel-coronavirus). 9. 9.Department for Education. 2020 Closure of educational settings: Information for parents and carers. GOV.UK. See [https://www.gov.uk/government/publications/closure-of-educational-settings-information-for-parents-and-carers](https://www.gov.uk/government/publications/closure-of-educational-settings-information-for-parents-and-carers). 10. 10.Cabinet Office. 2020 Full guidance on staying at home and away from others. GOV.UK. See [https://www.gov.uk/government/publications/full-guidance-on-staying-at-home-and-away-from-others](https://www.gov.uk/government/publications/full-guidance-on-staying-at-home-and-away-from-others). 11. 11.Zhao S et al. 2020 Preliminary estimation of the basic reproduction number of novel coronavirus (2019-nCoV) in China, from 2019 to 2020: A data-driven analysis in the early phase of the outbreak. Int. J. Infect. Dis. 92, 214–217. (doi:10.1016/j.ijid.2020.01.050) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ijid.2020.01.050&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32007643&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F11%2F13%2F2020.04.13.20062760.atom) 12. 12.Lauer SA, Grantz KH, Bi Q, Jones FK, Zheng Q, Meredith HR, Azman AS, Reich NG, Lessler J. 2020 The Incubation Period of Coronavirus Disease 2019 (COVID-19) From Publicly Reported Confirmed Cases: Estimation and Application. Ann. Intern. Med. 13. 13.In press. Scientific Pandemic Influenza Group on Modelling (SPI-M). GOV.UK. See [https://www.gov.uk/government/groups/scientific-pandemic-influenza-subgroup-on-modelling](https://www.gov.uk/government/groups/scientific-pandemic-influenza-subgroup-on-modelling) (accessed on 8 July 2020). 14. 14.In press. The R number and growth rate in the UK. GOV.UK. See [https://www.gov.uk/guidance/the-r-number-in-the-uk](https://www.gov.uk/guidance/the-r-number-in-the-uk) (accessed on 19 August 2020). 15. 15.In press. Defence Science and Technology Laboratory. GOV.UK. See [https://www.gov.uk/government/organisations/defence-science-and-technology-laboratory](https://www.gov.uk/government/organisations/defence-science-and-technology-laboratory) (accessed on 19 August 2020). 16. 16.Public Health England. In press. Total UK COVID-19 Cases Update. See [https://www.arcgis.com/apps/opsdashboard/index.html#/f94c3c90da5b4e9f9a0b19484dd4bb14](https://www.arcgis.com/apps/opsdashboard/index.html#/f94c3c90da5b4e9f9a0b19484dd4bb14). 17. 17.White T. In press. Covid-19-uk-data. Github. 18. 18.In press. Public Health Wales - Latest Statement. See [https://covid19-phwstatement.nhs.wales/](https://covid19-phwstatement.nhs.wales/). 19. 19.In press. Coronavirus in Scotland - gov.scot. See [https://www.gov.scot/coronavirus-covid-19/](https://www.gov.scot/coronavirus-covid-19/). 20. 20.In press. COVID-19 Surveillance Reports | HSC Public Health Agency. See [https://www.publichealth.hscni.net/publications/covid-19-surveillance-reports](https://www.publichealth.hscni.net/publications/covid-19-surveillance-reports). 21. 21.In press. Coronavirus (COVID-19): Using data to track the virus - Public health matters. See [https://publichealthmatters.blog.gov.uk/2020/04/23/coronavirus-covid-19-using-data-to-track-the-virus/](https://publichealthmatters.blog.gov.uk/2020/04/23/coronavirus-covid-19-using-data-to-track-the-virus/) (accessed on 8 July 2020). 22. 22.In press. Forecasting Functions for Time Series and Linear Models. See [https://pkg.robjhyndman.com/forecast/](https://pkg.robjhyndman.com/forecast/) (accessed on 8 July 2020). 23. 23.Loader C, Sun J, Technologies L, Liaw A. 2020 Locfit: Local Regression, Likelihood and Density Estimation. See [https://CRAN.R-project.org/package=locfit](https://CRAN.R-project.org/package=locfit). 24. 24.Britton T, Scalia Tomba G. 2019 Estimation in emerging epidemics: Biases and remedies. J R Soc Interface 16. (doi:10.1098/rsif.2018.0670) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1098/rsif.2018.0670&link_type=DOI) 25. 25.Gostic KM et al. 2020 Practical considerations for measuring the effective reproductive number, Rt. medRxiv, 2020.06.18.20134858. (doi:10.1101/2020.06.18.20134858) [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoibWVkcnhpdiI7czo1OiJyZXNpZCI7czoyMToiMjAyMC4wNi4xOC4yMDEzNDg1OHYzIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjAvMTEvMTMvMjAyMC4wNC4xMy4yMDA2Mjc2MC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 26. 26.Bi Q et al. 2020 Epidemiology and transmission of COVID-19 in 391 cases and 1286 of their close contacts in Shenzhen, China: A retrospective cohort study. The Lancet Infectious Diseases 20, 911–919. (doi:10.1016/S1473-3099(20)30287-5) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S1473-3099(20)30287-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32353347&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F11%2F13%2F2020.04.13.20062760.atom) 27. 27.Du Z, Xu X, Wu Y, Wang L, Cowling BJ, Meyers LA. 2020 Serial Interval of COVID-19 among Publicly Reported Confirmed Cases. Emerg Infect Dis 26, 1341–1343. (doi:10.3201/eid2606.200357) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3201/eid2606.200357&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32191173&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F11%2F13%2F2020.04.13.20062760.atom) 28. 28.Ganyani T, Kremer C, Chen D, Torneri A, Faes C, Wallinga J, Hens N. 2020 Estimating the generation interval for coronavirus disease (COVID-19) based on symptom onset data, March 2020. Eurosurveillance 25, 2000257. (doi:10.2807/1560-7917.ES.2020.25.17.2000257) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2807/1560-7917.ES.2020.25.17.2000257&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=7201952&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F11%2F13%2F2020.04.13.20062760.atom) 29. 29.Li Q et al. 2020 Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus-Infected Pneumonia. N. Engl. J. Med. 30. 30.Nishiura H, Linton NM, Akhmetzhanov AR. 2020 Serial interval of novel coronavirus (COVID-19) infections. Int. J. Infect. Dis. 93, 284–286. (doi:10.1016/j.ijid.2020.02.060) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ijid.2020.02.060&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32145466&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F11%2F13%2F2020.04.13.20062760.atom) 31. 31.Tindale LC et al. 2020 Evidence for transmission of COVID-19 prior to symptom onset. eLife 9, e57149. (doi:10.7554/eLife.57149) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7554/eLife.57149&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32568070&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F11%2F13%2F2020.04.13.20062760.atom) 32. 32.Xu X, Liu X, Wang L, Ali ST D. Z, Bosetti P, Cowling BJ, Wu Y. 2020 Household transmissions of SARS-CoV-2 in the time of unprecedented travel lockdown in China. 33. 33.You C et al. 2020 Estimation of the time-varying reproduction number of COVID-19 outbreak in China. International Journal of Hygiene and Environmental Health 228, 113555. (doi:10.1016/j.ijheh.2020.113555) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ijheh.2020.113555&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F11%2F13%2F2020.04.13.20062760.atom) 34. 34.Zhang J et al. 2020 Evolving epidemiology and transmission dynamics of coronavirus disease 2019 outside Hubei province, China: A descriptive and modelling study. The Lancet Infectious Diseases 20, 793–802. (doi:10.1016/S1473-3099(20)30230-9) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S1473-3099(20)30230-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32247326&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F11%2F13%2F2020.04.13.20062760.atom) 35. 35.Zhao S et al. 2020 Estimating the serial interval of the novel coronavirus disease (COVID-19): A statistical analysis using the public data in Hong Kong from January 16 to February 15, 2020. 36. 36.Challen R, Brooks-Pollock E, Tsaneva-Atanasova K, Danon L. 2020 Impact of uncertainty in serial interval, generation interval, incubation period and delayed observations in estimating the reproduction number for COVID 19. Statistical methods in medical research Pending. 37. 37.Cori A. In press. Estimate Time Varying Reproduction Numbers from Epidemic Curves [R package EpiEstim version 2.2-1]. 38. 38.Cori A, Ferguson NM, Fraser C, Cauchemez S. 2013 A New Framework and Software to Es-timate Time-Varying Reproduction Numbers During Epidemics. Am J Epidemiol 178, 1505–1512. (doi:10.1093/aje/kwt133) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/aje/kwt133&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24043437&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F11%2F13%2F2020.04.13.20062760.atom) 39. 39.Thompson RN et al. 2019 Improved inference of time-varying reproduction numbers during infectious disease outbreaks. Epidemics 29, 100356. (doi:10.1016/j.epidem.2019.100356) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.epidem.2019.100356&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F11%2F13%2F2020.04.13.20062760.atom) 40. 40.Challen R. 2020 Terminological/uk-covid-datatools: Zenodo initial release. Zenodo. (doi:10.5281/zenodo.4271338) [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.5281/zenodo.4271338&link_type=DOI) 41. 41.2020 Preliminary analysis of SARS-CoV-2 importation & establishment of UK transmission lineages. Virological. See [https://virological.org/t/preliminary-analysis-of-sars-cov-2-importation-establishment-of-uk-transmission-lineages/507/2](https://virological.org/t/preliminary-analysis-of-sars-cov-2-importation-establishment-of-uk-transmission-lineages/507/2) (accessed on 14 August 2020). 42. 42.Leclerc QJ, Nightingale E, Abbott S, CMMID COVID-19 Working Group, Jombart T. 2020 Analysis of temporal trends in potential COVID-19 cases reported through NHS Pathways England.