Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

SARS-CoV-2 under an elimination strategy in Hong Kong

View ORCID ProfileHaogao Gu, Ruopeng Xie, View ORCID ProfileDillon C. Adam, Joseph L.-H. Tsui, Daniel K. Chu, Lydia D.J. Chang, Sammi S.Y. Cheuk, Shreya Gurung, Pavithra Krishnan, Daisy Y.M. Ng, Gigi Y.Z. Liu, Carrie K.C. Wan, Kimberly M. Edwards, Kathy S.M. Leung, View ORCID ProfileJoseph T. Wu, Dominic N.C. Tsang, View ORCID ProfileGabriel M. Leung, View ORCID ProfileBenjamin J. Cowling, View ORCID ProfileMalik Peiris, View ORCID ProfileTommy T.Y. Lam, View ORCID ProfileVijaykrishna Dhanasekaran, View ORCID ProfileLeo L.M. Poon
doi: https://doi.org/10.1101/2021.06.19.21259169
Haogao Gu
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Haogao Gu
Ruopeng Xie
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
2HKU-Pasteur Research Pole, School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dillon C. Adam
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Dillon C. Adam
Joseph L.-H. Tsui
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Daniel K. Chu
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lydia D.J. Chang
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sammi S.Y. Cheuk
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shreya Gurung
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Pavithra Krishnan
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Daisy Y.M. Ng
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gigi Y.Z. Liu
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Carrie K.C. Wan
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kimberly M. Edwards
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
2HKU-Pasteur Research Pole, School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kathy S.M. Leung
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
3Laboratory of Data Discovery for Health, Hong Kong Science and Technology Park, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Joseph T. Wu
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
3Laboratory of Data Discovery for Health, Hong Kong Science and Technology Park, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Joseph T. Wu
Dominic N.C. Tsang
4Centre for Health Protection, Department of Health, The Government of Hong Kong Special Administrative Region, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gabriel M. Leung
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
3Laboratory of Data Discovery for Health, Hong Kong Science and Technology Park, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Gabriel M. Leung
Benjamin J. Cowling
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
3Laboratory of Data Discovery for Health, Hong Kong Science and Technology Park, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Benjamin J. Cowling
Malik Peiris
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
2HKU-Pasteur Research Pole, School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
5Centre for Immunology & Infection, Hong Kong Science and Technology Park, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Malik Peiris
Tommy T.Y. Lam
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
3Laboratory of Data Discovery for Health, Hong Kong Science and Technology Park, Hong Kong, China
5Centre for Immunology & Infection, Hong Kong Science and Technology Park, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Tommy T.Y. Lam
Vijaykrishna Dhanasekaran
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
2HKU-Pasteur Research Pole, School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Vijaykrishna Dhanasekaran
  • For correspondence: veej@hku.hk llmpoon@hku.hk
Leo L.M. Poon
1School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
2HKU-Pasteur Research Pole, School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
5Centre for Immunology & Infection, Hong Kong Science and Technology Park, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Leo L.M. Poon
  • For correspondence: veej@hku.hk llmpoon@hku.hk
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Hong Kong utilized an elimination strategy with intermittent use of public health and social measures and increasingly stringent travel regulations to control SARS-CoV-2 transmission. By analyzing >1700 genome sequences representing 17% of confirmed cases from 23-January-2020 to 26-January-2021, we reveal the effects of fluctuating control measures on the evolution and epidemiology of SARS-CoV-2 lineages in Hong Kong. Despite numerous importations, only three introductions were responsible for 90% of locally-acquired cases, two of which circulated cryptically for weeks while less stringent measures were in place. We found that SARS-CoV-2 within-host diversity was most similar among transmission pairs and epidemiological clusters due to a strong transmission bottleneck through which similar genetic background generates similar within-host diversity.

One sentence summary Out of the 170 detected introductions of SARS-CoV-2 in Hong Kong during 2020, three introductions caused 90% of community cases.

Severe acute respiratory coronavirus 2 (SARS-CoV-2) emerged in late 2019 (1) and has caused over 170 million confirmed cases and over three million deaths worldwide (as of 1-July-2021) (2). Heterogeneity in disease severity (3-5) and high virus transmission rates (6, 7) necessitated extensive and diverse control strategies, which achieved varied degrees of success. Hong Kong (population 7.5 million) is among the few regions that have been relatively successful in prevention and control of community transmission using an elimination strategy (8), with intermittent public health and social measures combined with strict isolation of cases and quarantine of close contacts in designed facilities (9, 10). Increasingly stringent travel regulations have been implemented to limit importation of infections (Fig. 1). As of 1-July-2021, 11,855 laboratory confirmed cases have been detected resulting in 210 deaths. Using contact tracing data from January to April 2020 (waves one and two) we have shown that sustained community transmission in Hong Kong was largely driven by superspreading events, with 80% of community infection caused by 20% of cases (10). However, two large outbreaks have occurred in Hong Kong since this period (waves three and four). Here, we show that the three major surges in infections that occurred during waves two to four were the result of only three virus introductions (PANGO lineages (11) B.3, B.1.1.63, and B.1.36.27) out of a total of 170 captured through genome sequencing, resulting in 90.0% of the confirmed cases of SARS-CoV-2 observed in the community in Hong Kong. Using genomic data generated from travel-related (n=186) and community cases (n=1,713) during all four waves of the pandemic, comprising 51.4%, 21.1%, 23.6% and 13.7% of all cases in the corresponding period, respectively (figs. S1, S2 and S3, tables S1 and S2), we discuss the effects of intermittent use of public health and social measures on the evolution and epidemiology of SARS-CoV-2 in Hong Kong.

Introductions, local spread, and delayed detection during waves one and two

During wave one (23-January-2020 to 22-February-2020), laboratory-confirmed cases did not exceed 10 per day (Fig. 1 and table S1) and were comprised of both travel-related (n=17/70, 24.3%) and community-acquired cases (n=53/70, 75.7%). The first recognised introduction was detected on 30-January-2020 among a family cluster where the index case had returned from Wuhan, China on 22-January, while the first community case without a known source (travel history or contact with a confirmed case) was also reported on 30-January. However, among two of the earliest locally circulating lineages, time-scaled phylogenetic analysis estimated a median time to most recent common ancestor (tMRCA) of 30-December-2019 (95% Highest Posterior Density Interval (HPD) 24-December-2019 – 1-January-2020) indicating direct ancestry to cases circulating prior to the earliest recognised introductions (Fig. 2A). Critically, these lineages had no recognised epidemiological or phylogenetic link to all other imported cases identified and sampled at the time, indicating that the source of these lineages initially entered Hong Kong undetected and triggered sustained community transmission during the first wave. Though the tMRCA indicates introduction could have occurred as early as 24-December-2019, which would represent one of earliest examples of transmission outside mainland China, it cannot be proven as the lineage diversity observed may have first accumulated in mainland China and subsequently been imported closer to the detection of the earliest cases.

Fig. 1
  • Download figure
  • Open in new tab
Fig. 1 Epidemiological summary and time-scaled phylogeny of SARS-CoV-2 in Hong Kong.

The number of genomes sequenced in this study is shown below in light red. Time-scaled phylogeny of SARS-CoV-2 genomes from Hong Kong, in which lineages HK-wave3 and HK-wave4A were subsampled to 100 and 65 sequences, showing B.1.1.63* and B.1.36.27*, respectively. *denotes the real clade contains more than one PANGO lineage (see Materials and Methods and table S4). Red shaded bars below the timeline delineate five stringency levels of control measures in Hong Kong based on the Oxford COVID-19 Government response tracker (level 1: <40; level 2 : 40–50; level 3: 50–60; level 4: 60–70; level 5: >70) (12).

Fig. 2
  • Download figure
  • Open in new tab
Fig. 2 Descriptive and temporal dynamics of SARS-CoV-2 lineages in Hong Kong.

(A) Time to most recent common ancestor (tMRCA) among the five earliest locally circulating lineages of SARS-CoV-2 introduced into Hong Kong. (B) Number of SARS-CoV-2 genomic samples per lineage identified over time. Lineage size is ordered on a log10 scale, and plotted by the earliest confirmation date among the lineages. (C) Correlation between the detection lag of locally circulating lineages and the final lineage duration. Points represent a random sample of 1,000 lineages from a Bayesian posterior tree distribution (n=8,000). (D:E) Detection lag over time as a function of tMRCA across three epidemic periods (D) waves one and two, (E) wave three, (F) wave four. Overall, a significant reduction in detection lag was observed over time and across each epidemic wave. Points in each panel represent a random sample of 1,000 lineages from a Bayesian posterior tree distribution (n=8,000).

As SARS-CoV-2 was declared a global pandemic on 11-March-2020, Hong Kong experienced a substantial rise in travel-related cases (n=705/978, 72.1%, Fig. 1 and table S1) concomitant with large epidemics internationally (wave two). Introductions occurred mostly from outside of China (fig. S4), with a moderate increase observed in community cases (n=273/978, 27.9%). Again, the inferred common ancestry of community-linked lineages indicated circulation prior to or during early March 2020 (tMRCA), indicating a pattern of prolonged cryptic transmission that preceded increases in community spread (Fig. 2A). The largest of these local lineages comprised 92 genomes or 38.0% (n=92/242) of all sampled genomes during wave two and was classified as PANGO lineage B.3. This lineage was associated with a superspreading event from which contact tracing identified 106 community cases. Genomic analysis linked an additional 16 sporadic community cases, increasing the total inferred cluster size to 122 or 46.2% (n=122/326) of community cases during waves one and two.

Overall, during waves one and two, there were 38 lineages circulating in the community (out of a total of 61 unique imported clades). The median size for non-singleton local lineages was six cases, and the median duration of circulation was 10.5 days. Among local lineages without traced contact to an imported case, the median delay in lineage detection (time between lineage tMRCA and confirmation of the first detected case) was 25 days, though this was noticeably higher in January (median = 31 days), before significantly improving to eight days by early May 2020, likely related to early delays and subsequent improvements in case detection (Spearman’s test, rho (ρ) = -0.49, p < 0.001, Fig. 2D).

Travel measures and the suppression of overseas introductions

Following a peak of infections during 16–27 March 2020 (wave two; Fig. 1 and table S1), a rapid decline occurred as travel and community restrictions were expanded to include mandatory quarantine on arrival with strengthened testing, restrictions on non-resident entry, school closures, adjusted work arrangements, and bans on public gatherings (13). Travel restrictions began as early as 26-January-2020. First, all non-residents that visited Hubei province within two weeks were barred entry into Hong Kong. This was followed by a mandate for compulsory quarantine of passenger arrivals from regions affected with SARS-CoV-2, extending from mainland China to South Korea, Iran, Italy, and the Schengen region, and culminating in the barring of entry of non-residents during the peak of wave two in March. No community cases were reported from mid-April to 12-May-2020, and community measures were gradually relaxed to allow public gatherings of at most eight (from four) people with restricted opening of leisure venues (Fig. 1). Based on the Oxford COVID-19 Government response tracker (12), stringency of control measures reduced from level 4 to level 2 during this period (Fig. 1).

Reintroductions and local surge during waves three and four

The first large SARS-CoV-2 outbreak in Hong Kong occurred from July to September 2020 (wave three), resulting in >4,000 cases (table S1). With a predominant number of cases in the community (n=3,385/4,032, 84.0%), this third epidemic wave was preceded by a period of increased detection of travel-related cases during July 2020 (similar to waves one and two, Fig. 2B). While importations during wave two were predominantly from Europe, subsequent cases were mostly from Asia (fig. S4, table S3 and data S1). The number of laboratory-confirmed cases continued to rise, reaching a peak of >120 cases per day in late July (Fig. 1). Following the implementation of increasingly stringent public health and social distancing measures, the local epidemic subsided in September. However, beginning in early November a second resurgence (termed wave four) occurred, resulting in >6000 additional cases. Wave four peaked in December 2020 and slowly declined towards zero daily cases by April 2021.

Sequencing identified 170 virus lineages belonging to 71 PANGO lineages in Hong Kong within one year (Fig. 2 and data S2). However, 87.0% of those lineages were detected only in travel-related cases or one community case, and no variants of concern were detected in the community during waves three and four. Notably, a single introduction belonging to lineage B.1.1.63 resulted in over 90.0% of genomes sampled during wave three (92.4%, 881/953), forming a HK-wave3 clade (n=902) (also see fig. S1 and table S4). However, the earliest imported cases of the HK-wave3 clade were not sampled, indicating cryptic transmission prior to detection. In a similar trend, two lineages led to most of the fourth wave cases: over 74% genomes (552/704) formed one clade (HK-wave4A, B.1.36.27, table S4), and 4.7% (33/704) formed another (HK-wave4B, B.1.36).

By comparing the tMRCA of local non-singleton clusters in Hong Kong (n=7, 6.4% (7/109)) (Fig. 2 and data S3), we identified two introductions (HK-wave3 and HK-wave4A) that circulated in the community for 108 and 128 days, respectively. The delay in detection of local non-singleton lineages remained low during waves three and four (mean=2.9 days, 95% HPD 0 - 13 days) (Fig. 2E, F), with a detection delay of 11 and 10 days for the two large clades, HK-wave3 and HK-wave4A, respectively (data S3). A negative correlation between delay over time from wave one to wave four (Spearman’s test, rho (ρ) = -0.72, p < 0.001), indicates improvements in identifying cases during 2020 (Figs. 2D, E and F). Interestingly, the cryptic circulation of HK-wave3 occurred during periods of reduced stringency (level 2), however HK-wave4A introduction occurred during the late stages of wave three when stringency level 4 was enacted, and continued to circulate cryptically when stringency level was reduced to level 2 on presumed suppression of the second wave. Similarly, HK-wave4B introduction occurred during relatively stringent level 4 (Fig. 1), while a significant positive correlation between increasing lag in lineage detection and lineage duration was observed (Spearman’s test, rho (ρ) = 0.70, p < 0.001, Fig. 2C).

Contrasting patterns of epidemiology during waves three and four

To understand the effects of public health and social measures on community transmission, we characterized the two largest clades that circulated during waves three and four and identified contrasting pattens of genomic evolution and underlying transmission dynamics. HK-wave3 emerged during a period of reduced community measures (stringency level 2) and was controlled by increasing stringency to level 4. By contrast, HK-wave4A emerged under stringency level 4 and circulated cryptically during reduced level 3. Although the stringency level was rapidly raised to 5 on detection, HK-wave4A continued to circulate for several months.

The effective reproductive number (Re) estimated using a birth-death skyline serial (BDSS) model (14) for 902 HK-wave3 genomes collected from all 18 districts in Hong Kong (5-July-2020 to 21-October-2020) showed that Re was significantly higher (∼3) from the tMRCA in late June or early July (mean tMRCA = 1-July-2020, 95% HPD, 26-June-2020 to 4-July-2020) until recognition of the lineage on 5-July when stringency level was 2 (Fig. 3), highlighting a period of rapid virus expansion when leisure venues were open and public gatherings of up to 50 people were allowed. Notably, as the control stringency was increased on 15-July (from level 2 to 4), Re subsided to ∼2 and subsequently decreased below ∼1. An increasing number of cases continued to be reported among close contacts in residential care homes for the elderly or disabled, households, hospitals, dormitories and workplaces, whereas just 12.6% of the genomes were attributable to social interactions (table S5). Phylogenies reveal a rapid termination of transmission lineages among close contacts, leading to disappearance of all but one sub-lineage that continued to circulate with Re >1 until extinction in October 2020. These results indicate suppression with level 4 stringency during wave three, complimented by aggressive contact tracing, resulted in the elimination of the majority of wave three transmission chains despite the intermittent rise in cases among close contacts.

Fig. 3
  • Download figure
  • Open in new tab
Fig. 3 Phylodynamics of waves three and four in Hong Kong.

Evolutionary relationships and effective reproduction number (Re(t)) of HK-wave3 (B.1.1.63) clade (purple) and HK-wave4A (B.1.36.27) clade (orange) estimated using tree-heights and sequenced incidence data. Purple and orange bars show the number of genomes by collection date. Black line shows the instantaneous effective reproduction number (Rt), estimated based on infection dates inferred from the reported dates of symptom onset (or dates of confirmation for asymptomatic cases).

HK-wave4A (mean tMRCA = 6-September-2020, 95% HPD 5–7 September-2020) continued to circulate for several months despite level 5 stringency (Fig. 3). In contrast to HK-wave3, a rapid expansion did not occur upon emergence. Instead, the HK-wave4A dynamics resembled the pattern of a propagated epidemic curve, with Re increasing with each subsequent peak during September and November, achieving a high of Re ∼3 during mid to late November; Re fluctuated around 1 following this period. The structure of the phylogenetic tree suggests continual elimination of viral lineages with intermittent expansion coinciding with a superspreading event involving 28 dancing and singing venues in different districts of Hong Kong, resulting in a group of 732 cases epidemiologically linked to this superspreading event. Epidemiological data shows that up to 23.6% of genomes during this wave were related to social clusters (table S5) which is significantly higher than the HK-wave3 counterpart (p < 0.001, chi-square test). Human mobility levels inferred from Octopus, a smart card payment system used by >98% of the Hong Kong population aged 16 to 65 (15), showed adult and elderly mobility within Hong Kong resumed to pre-pandemic levels during the early stage of wave four in early to mid-November 2020 (∼100% of the average level during 1–15 January 2020) (fig. S5). Taken together, our results suggest that community measures reduce Re, however prolonged community measures decrease the probability of lineage termination due to fatigue arising in the population (16, 17).

The instantaneous effective reproductive numbers (Rt) of local cases are consistent with Re of the two major transmission lineages during waves three and four (Fig. 3). During the short period of lineage co-circulation (Fig. 3), the relative reproductive number of HK-wave3 estimated using a relative fitness inference framework (15) was higher (mean = 1.28, credible interval [CrI] 0.88-1.93) when compared with HK-wave4A, though the difference in transmissibility was not significant (CI across 1.0). However, these estimates of comparative transmissibility may be affected by stochasticity, such as the occurrence of superspreading events as revealed through sequencing.

Within-host genetic variation determined by genetic background

By analyzing deep-sequence data from confirmed donor and recipient pairs using a beta-binomial statistical framework (18) we estimated the number of virions required to initiate infection was between one to three (Fig. 4A, Materials and Methods and table S8). This is consistent with data from transmission pairs estimated in UK and Austria (one to eight in UK and one to three in Austria (19, 20)) as well as between cats (two to five virions (21)), showing SARS-CoV-2 transmission bottleneck may be universally small. To understand the effects of strong transmission bottlenecks on within-host diversity, we estimated Jaccard distances between sample pairs and found that minor variants were not significantly different between established transmission pairs and epidemiologically clustered samples (Fig. 4B) but were significantly different between samples without epidemiological links. These results show that the SARS-CoV-2 within-host genetic variation is non-random and determined by genomic differences (i.e. consensus sequence), shedding new light on SARS-CoV-2 evolution.

Fig. 4
  • Download figure
  • Open in new tab
Fig. 4 Transmission bottleneck and mutation profiles between cases.

(A) Proportion of shared mutations and transmission bottleneck size estimations (with 95% confidence intervals) between samples from established transmission pairs. The estimates for transmission pairs fam_562 and fam_730 are not available due to a limited number of SNVs in the recipients’ samples. (B) Jaccard distance of the major and minor SNVs between different types of sample pairs. The distribution is shown in both boxplot and density plot. The between-group differences were tested by Wilcoxon tests, and p values < 0.05 are shown (p values < 0.001 are labelled as ***).

Conclusions

Sequencing allowed us to investigate the introduction and circulation pattens of SARS-CoV-2 transmission lineages under the elimination strategy in Hong Kong. Border control measures averted numerous introductions, and community outbreaks were typically associated with cryptic virus circulation during less stringent periods and expansion through superspreading events. Heightened control measures eliminated most transmission chains in the local community, but the proportion of social transmission during wave four was significantly higher than that of wave three, indicating control was less effective when prolonged. Although public health and social measures were promptly lifted when community cases could be traced and controlled, new waves continued as a result of new introductions rather than resurgence of previously circulating viruses. This shows that contact tracing was efficient, but averting outbreaks from new introductions requires heightened border control and enhanced community surveillance during periods of lower control level stringency.

Data Availability

Hong Kong SARS-CoV-2 genome sequences and associated metadata is deposited at gisaid.org. All anonymized data, sequence accession numbers, code, and analysis files are available in supplementary materials and in GitHub repository (https://github.com/hku-sph-covid-19-genomics-consortium/hk-sars-cov-2-genomic-epidemiology).

https://github.com/hku-sph-covid-19-genomics-consortium/hk-sars-cov-2-genomic-epidemiology

Funding

Health and Medical Research Fund, Food and Health Bureau of the Hong Kong SAR Government COVID190118 (LLMP)

Collaborative Research Fund of the Research Grants Council of the Hong Kong SAR Government C7123-20G (BJC)

National Institutes of Health grant U01AI151810 (LLMP)

National Institutes of Health grant HHSN272201400006C (JSMP, LLMP, BJC, DV)

National Institutes of Health grant 75N93021C00016 (JSMP, LLMP, DV)

Author contributions

Study Design: LLMP, DV

Data curation: HG, RX, DCA, DNCT, KME

Genome sequencing group: LLMP, JSMP, DKC, HG, LDJC, SSYC, SG, PK, DYMN, GYZL, CKCW

Phylodynamic analysis group: DV, TTYL, DCA, HG, RX, KSML, JL-HT Visualization: HG, RX, DCA, TTYL, DV

Supervision: LLMP, DV, BJC, JSMP, TTYL, JTKW

Writing – original draft preparation: DV, RX, DCA, HG, TTYL

Writing – review & editing: LLMP, DV, DCA, TTYL, KME, JSMP, BJC, GML

Competing interests

BJC has consulted for Roche, Sanofi Pasteur, GSK, AstraZeneca and Moderna. The authors declare no other competing interests.

Data and materials availability

Hong Kong SARS-CoV-2 genome sequences and associated metadata is deposited at gisaid.org. All anonymized data, sequence accession numbers, code, and analysis files are available in supplementary materials and in GitHub repository (https://github.com/hku-sph-covid-19-genomics-consortium/hk-sars-cov-2-genomic-epidemiology).

Supplementary Materials

Materials and Methods

SARS-CoV-2 data from Hong Kong

De-identified saliva or nasopharyngeal samples positive for SARS-CoV-2 by real time-polymerase chain reaction (RT-PCR), along with epidemiological information including onset date, report date and contact history for individual cases were obtained from the Centre for Health Protection, Hong Kong.

Genomic sequencing of SARS-CoV-2

A total of 1,753 laboratory-confirmed samples were collected from 1,733 RT-PCR confirmed cases from 22-June-2020 to 26-January-2021. Virus genome was reverse transcribed with primers targeting different regions of the viral genome, published in (22). The synthesized cDNA was then subjected to multiple overlapping 2kb PCRs for full-genome amplification. PCR amplicons obtained from the same specimen were pooled and sequenced using Nova sequencing platform (PE150, Illumina). Sequencing library was prepared by Nextera XT. The base calling of raw read signal and demultiplexing of reads by different samples were performed using Bcl2Fastq (Illumina). A reference-based re-sequencing strategy was applied in analyzing the NGS data. Specifically, the raw FASTQ reads were assembled and mapped to the SARS-CoV-2 reference genome (Wuhan-Hu-1, GenBank: MN908947.3) using BWA mem2 (v.2.0pre2) (23). The consensus sequences for each sample were called as dominant bases at each position by samtools mpileup (v.1.11) (24) with minimum depth of 100 reads. Samples less than 27kb in length (excluding gaps) were excluded from downstream analysis. The head and tail 100nt bases of all generated consensus sequences were masked. We also masked another 10 sites located in PCR primer binding regions and observed to be variant (≥ 3% allele frequency) in 1% or more HK samples (table S9). The same masking strategy was also applied in phylodynamics analysis, variant calling, and bottleneck estimation. The average sequencing depth (number of mapped reads) at each nucleotide position that was retained ranged from ∼10,000 to ∼100,000 (fig. S6).

There were 16 patients from which samples were collected or sequenced at multiple time points. Twelve samples from 6 patients were sequenced in duplicate, and 21 samples from 10 patients were collected sequentially. One representative sample for each of the 16 patients was selected based on genome coverage and average sequencing depth. In total, 1,601 representative samples met quality control standards. All 1,601 consensus sequences from Hong Kong, as well as 298 additional consensus sequences from the first two waves were included in the phylogenetic analysis. Sequences from regions outside Hong Kong were retrieved from the GISAID database (total 399,124 sequences, accessed 16-February-2021, detailed accession numbers and acknowledgement information in data S4).

Phylogenetic analysis of SARS-CoV-2 in Hong Kong

Hong Kong sequences were analysed with a global SARS-CoV-2 genome alignment obtained from GISAID (accessed 16-February-2021). For each Hong Kong sequence, the three most similar global sequences (evaluated by p distance excluding gaps, n = 385), as well as the earliest sampled sequence (n = 1,279) from each PANGO lineage (accessed 07-May-2021) (11) were selected. After removing repetitive sequences and trimming masked sites, data quality was evaluated using a root-to-tip regression analysis in TempEst (v.1.5.3) (25), resulting in a final set of 3,437 sequences. Maximum likelihood (ML) phylogenies were estimated using IQ-TREE (v.2) (26), employing the best-fit nucleotide substitution model with Wuhan-Hu-1 (GenBank: MN908947.3) as the outgroup and dated by least square dating (LSD2) (27). Branch support was estimated using ultrafast bootstrap approximation (UFBoot) and SH-like approximate likelihood ratio test (SH-aLRT), and for nodes of interest with <50% support, we examined their stability through multiple iterative runs using the best-fit nucleotide substitution model. Internal branches with zero-length were preserved for dating by setting parameter l as -1. SARS-CoV-2 sequences from Hong Kong were classified based on the dynamic PANGO nomenclature system (https://github.com/cov-lineages/pangolin, v.2.3.9, 23-April-2021) (11) and confirmed using a ML analysis.

Phylodynamics of Hong Kong waves

To identify monophyletic clusters of SARS-CoV-2 lineages in Hong Kong, Bayesian molecular clock phylogenetic analysis pipeline proposed by Plessis et al. (28) were implemented. In this study, the data tree and starting tree were applied to a ML tree generated by IQ-TREE (v.2) (26) with LSD2 (27). Time-scaled phylogenies were generated using the strict clock model with 0.001 substitutions per site per year which is within 95% credible interval of SARS-CoV-2 temporal signal (29), the Skygrid model (30) with 61 grid points and a Laplace root-height prior with mean equal to the dated-ML tree estimated by IQ-TREE (v.2) (26) and scale is set to 20% of mean. To improve computational efficiency, two largest local monophyletic clades in wave three (HK-wave3, n = 902) and wave four (HK-wave4A, n = 552) from ML tree were subsampled to 100 and 65 sequences by 5 earliest cases, 5 latest cases and 10% of the remaining randomly selected, respectively. We ran nine MCMC chains of 100 million, sampling every 1,000 steps and discarding 10% as burn-in. As there are no collapsed internal branches in this study, only uncertainty in branch durations were estimated by MCMC. From the approach described in Geoghegan et al. (31), we used the R package “NELSI” (32) to identify classify all monophyletic lineages, including singletons, and to estimate the delay in lineage detection following importation as well as the duration of circulation, given a set of 8,000 posterior trees. It’s notable that there are two global sequences from Japan (EPI_ISL_591420 and EPI_ISL_721612) present in the HK-wave3 clade. However, these two Japan cases had travel history to the Philippines (similar to the early HK-wave3 imported cases), and were quarantined when landed in Japan, suggesting that they are unlikely to have caused an introduction in Hong Kong. These two cases were therefore excluded when defining the HK-wave3 clade.

For all samples of HK-wave3 and HK-wave4A, we used the birth-death skyline serial (BDSS) model (14) implemented in BEAST (v.2.6.3) (33) to infer the time of origin (tOrigin), time of most recent common ancestor (tMRCA) and temporal variations (piecewise fashion over 12-15 equidistant intervals) in the effective reproductive number denoted as Rt. A non-informative prior for tOrigin was used with a the lower bound set to 1-January-2020. The HKY + G4 nucleotide substitution model and an uncorrelated relaxed molecular clock model with lognormal rate distribution (UCLN) (34) were used. The sampling proportion was given a uniform distribution as prior with the upper bound at the empirical ratio of the number of sequences to the number of reported cases. MCMC chains were run for 600 million and 800 million steps and sampled every 2,000 and 10,000 steps for the lineages HK-wave3 (B.1.1.63) and HK-wave4A (B.1.36.27) respectively, with the initial 10% discarded as burn-in. This resulted in a final total of 270,000 and 72,000 sampled states. Mixing of the MCMC chain was inspected using Tracer (v1.7.1) (35) to ensure an effective sample size (ESS) of >200 for each parameter. Change in the effective reproductive number over time after the estimated tMRCA was plotted using R package “bdskytools” (https://github.com/laduplessis/bdskytools). Since by definition there are no sequences between tMRCA and the estimated tOrigin, the effective reproductive number (Re) was assumed to remain constant in this period. This assumption was incorporated in the default birth-death model using the package TreeSlicer in BEAST2.

Human mobility in Hong Kong using Octopus data

We used digital transactions made on Octopus cards, ubiquitously used by the Hong Kong population for daily public transport and small retail payments (https://www.octopus.com.hk/tc/consumer/index.html), to obtain changes in mobility during 2020–2021 among cards classified as children, students, adults and elderly (fig. S5).

Analysis of within-host genetic variation and transmission bottleneck

Deep sequencing SARS-CoV-2 samples in the United Kingdom and Austria has shown that the within-host genetic diversity (iSNV) is low with a narrow bottleneck during transmission (19, 20). Within-host genetic variation (iSNV) depends on intra-host virus evolution and transmission bottleneck size. To determine the baseline similarity of within-host genetic diversity, we prepared two sets of samples as controls: (a) six samples were sequenced in duplicate to account for uncertainty arising from sequencing; and (b) twenty-one samples collected from ten individuals. To examine the dynamics of mutations related to transmission events, we identified 13 transmission pairs (donor and recipient) that were directly linked with symptom onset varying by 1 to 7 days.

Single nucleotide variants (SNVs) in deep-sequence data were identified using three different variant callers, freebayes (v.1.3.2) (36), VarDict (v.1.82) (37) and LoFreq (v.2.15) (38). To attain a robust variant calling result, only SNVs detected by at least two different variant callers were analyzed in this study (8). SNVs with a minimum depth of 100 reads, minimum frequency of 3%, and detected by at least two different variant callers were retained for further analysis. Parameters and scripts for this pipeline are described in https://github.com/hku-sph-covid-19-genomics-consortium/hk-sars-cov-2-genomic-epidemiology. Gene annotations of the SNVs were based on table S10. To understand the uncertainty in iSNV due to the sequencing protocol, we sequenced six samples in duplicate. While detection of major variants (consensus) was consistent, detection of iSNVs may vary between sequencing runs. For example, while iSNVs of case 10 were shared between sequencing runs, iSNVs of case 15 were mutually exclusive (fig. S7). Similarly, when comparing iSNV of 21 samples collected from 10 individuals, major variants in samples from the same patient remained identical but their iSNVs varied. A similar pattern was observed in transmission pairs, where most of the major variants are shared but some of the minor variants are unique (Fig. 4A), which can be explained by a narrow transmission bottleneck.

The statistical framework for estimating the transmission bottleneck size between identified transmission pairs was introduced in (18). It was based on a beta-binomial method which models the number of transmitted virions from donor to the recipient. Because of the high sequencing depth of the data, the minimum variant calling threshold was set to 0.03 with a minimum depth of 100 reads. Bottleneck size estimates were calculated by maximum likelihood analysis comparing the allele frequency of variants passing threshold between samples. The 95% confidence intervals were calculated using a likelihood ratio test. To identify similarity of SNVs between samples we used the Jaccard distance, defined as one minus the proportion of intersection between two samples divided by the proportion of their union. Embedded Image SARS-Cov-2 sequences from Hong Kong contained within-patient variation in 12,859 sites of the genome when compared to the SARS-CoV-2 reference strain Wuhan-Hu-1 (GenBank: MN908947.3). 37.2% of the sites (n=4,779) contained mutations in more than one sample. High frequency of variation (in >100 Hong Kong sequences) was observed in 30 sites (table S6). The spectrum of allele frequencies (fig. S8) showed that over 90% of the variants had allele frequency ≥95% or ≤10%. Mutation hotspots were detected in four sites of the genome with a greater frequency of intra-host single nucleotide variation (iSNVs). These mutations were sporadically distributed across the phylogeny and present in ∼20 sequences from the GISAID data, which may suggest potential homoplasy. We observed two variants unique to Hong Kong, C5812T and G25785T, which were detected in separate phylogenetic clusters of local Hong Kong cases. The C5812T mutation was identified in two sperate clusters in the fourth wave. While the C5812T mutation in one cluster likely descended from a local ancestral case, the mutation in the earlier cluster may have been imported. Similarly, G25785T was found in both the third and fourth waves, and mutations in at least one cluster likely originated from local cases. Some of the low frequency SNVs (frequency <5%, shown in the low peaks to the bottom of fig. S8 and table S7) commonly occurred in global context. For example, the G28883C (G205R in nucleocapsid) and C22227T (A665V in spike) mutations were found in 38.09% and 21.49% of the global cases, however they were only seen in 1.56% and 0.7% of the Hong Kong cases respectively.

Estimation of the instantaneous effective reproductive number (Rt)

The instantaneous effective reproduction number Rt is defined as the average number of secondary cases generated by cases on day t. If Rt > 1 the epidemic is expanding at time t, whereas Rt < 1 indicates that the epidemic size is shrinking at time t. The transmissibility of imported and local cases was expected to be very different because intensive non-pharmaceutical interventions had been imposed on travelers arriving from COVID-19 affected regions since January 2020. Hence, we only included cases from the following three categories into the computation of Rt, i.e., local case and epidemiologically linked with local cases defined by the Centre for Health Protection (CHP, https://www.coronavirus.gov.hk/eng/index.html).

Since the epidemic curves provided by CHP were based on the dates of symptom onset or dates of confirmation, we used a deconvolution-based method to reconstruct the COVID-19 epidemic curves by dates of infection (39, 40). We assumed that the incubation period was Gamma with mean and standard deviation of 6.5 and 2.6 days (41), and that the distribution of the time between symptom onset and case confirmation was Gamma with mean and standard deviation of 4.3 and 3.2 days. For asymptomatic cases, we assumed they shared the same distribution of the time between infection and case confirmation with the symptomatic cases. We then computed Rt for local cases only from the respective epidemic curves using the “EpiEstim” (42) R package (Fig. 3).

Estimation of the relative reproductive number of HK-wave3 compared with HK-wave4A

We defined the comparative transmissibility of any two lineages as the relative reproductive number, i.e., the ratio of their basic reproductive numbers. We extended a previous competition transmission model (43, 44) of two viruses and applied the fitness inference framework to the sequence data collected in Hong Kong during the cocirculation period of HK-wave3 and HK-wave4A clades (between 19-September and 21-October-2020, Fig. 3). We assumed the two clades shared the same generation time distribution which can be approximated by the serial interval distribution estimated in Leung et al. (45) (i.e., Gamma distribution with mean and standard deviation of 5.2 and 1.7 days). The inference framework incorporates both incidence and genotype frequency data that reflect the local comparative transmissibility of cocirculating lineages.

Fig. S1.
  • Download figure
  • Open in new tab
Fig. S1.

Time-scaled maximum-likelihood phylogeny of SARS-CoV2 using IQ-TREE (v.2) (26). The tree colored by (A) pandemic waves in Hong Kong, (B) Nextclade, and (C) PANGO classification, respectively. Global sequences are shown in grey.

Fig. S2.
  • Download figure
  • Open in new tab
Fig. S2.

Root-to-tip regression analysis was performed in TempEst v.1.5.3 (25). Colours indicate pandemic waves in Hong Kong as in fig. S1 (A).

Fig. S3.
  • Download figure
  • Open in new tab
Fig. S3.

Geography of SARS-CoV-2 cases in Hong Kong. (A) Map of Hong Kong’s 18 districts shaded by the number of laboratory-confirmed cases of SARS-CoV-2. Pie charts divided by transmission types. Pie chart size reflects the number of laboratory-confirmed cases in this district (ratio of cases and radius: 1/50,000). (B) same as (A) based on second wave (ratio of cases and radius in pie charts: 1/5,000). (C) same as (A) based on third wave (ratio of cases and radius in pie charts: 1/25,000). (D) same as (A) based on fourth wave (ratio of cases and radius in pie charts: 1/30,000).

Fig. S4.
  • Download figure
  • Open in new tab
Fig. S4.

SARS-CoV-2 imported cases colored by continent of origin.

Fig. S5.
  • Download figure
  • Open in new tab
Fig. S5.

Octopus mobility data during pre-pandemic period and early period of wave four in Hong Kong.

Fig. S6.
  • Download figure
  • Open in new tab
Fig. S6.

Sequencing depth of NGS data across the genome. The colored line represents the average read depth at each genomic position.

Fig. S7.
  • Download figure
  • Open in new tab
Fig. S7.

Proportion of shared mutations between samples from same individuals.

Fig. S8.
  • Download figure
  • Open in new tab
Fig. S8.

Frequencies and dynamics of SNVs in Hong Kong. (A) Percentage of samples that share mutations in third and fourth wave epidemics. The color codes for major and minor mutations at different genomic sites. High-frequency variant sites (n = 36, identified in at least 5% of the respective samples) are labeled. (B) Relative mutation frequency (allele frequency) of SNVs.

The SNVs are colored by the frequencies of mutations reported in GISIAD dataset.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table S1.

Cases and genome sequences from waves of SARS-CoV-2 in Hong Kong.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table S2.

Genome sequencing and population statistics of Hong Kong districts.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table S3.

Travel related cases and epidemiologically linked with imported cases within seven Hong Kong monophyletic clades (over 10 community cases).

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table S4.

Distribution of PANGO lineages across HK-wave3 and HK-wave4A clades.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table S5.

Sample numbers associated with different transmission settings in HK-wave3, HK-wave4A and all community cases in wave3 and wave4 based on epidemiological data.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table S6.

Major SNVs identified in third and fourth waves in Hong Kong.

View this table:
  • View inline
  • View popup
Table S7.

Minor SNVs identified in Hong Kong cases (frequency in GISAID ≥1%).

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table S8.

Estimation on bottleneck size of transmission pairs.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table S9.

Highly shared variant sites (allele frequency ≥3% and were found in >1% of the HK samples) located within or related to PCR primer binding regions.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table S10.

Gene annotation of SARS-CoV-2 Genome (nucleotide positions base on reference sequence Wuhan-Hu-1, GenBank: MN908947.3).

Data S1. (Data S1.csv)

Origins of imported cases in Hong Kong.

Data S2. (Data S2.csv)

Sample list with waves, NextClade and PANGO lineage designations.

Data S3. (Data S3.csv)

Summary of Hong Kong monophyletic clades.

Data S4. (Data S4.pdf)

Acknowledgements to sequences obtained from GISAID (accessed on 11-June-2021).

Acknowledgments

We gratefully acknowledge the staff from the originating laboratories responsible for obtaining the specimens and from the submitting laboratories where the genome data were generated and shared via GISAID. We thank Octopus Cards Limited for providing aggregate data of passenger numbers by public transportation means and aggregate data of transactions by retail categories for the research. We acknowledge the technical support provided by colleagues from the Centre for PanorOmic Sciences of the University of Hong Kong. We also acknowledge the Centre for Health Protection of the Department of Health for providing epidemiological data for the study. The computations were performed using research computing facilities offered by Information Technology Services, the University of Hong Kong. The funding bodies had no role in the design of the study, the collection, analysis, and interpretation of data, or writing of the manuscript.

References

  1. 1.↵
    F. Wu et al., A new coronavirus associated with human respiratory disease in China. Nature 579, 265–269 (2020).
    OpenUrlCrossRefPubMed
  2. 2.↵
    E. Dong, H. Du, L. Gardner, An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect Dis 20, 533–534 (2020).
    OpenUrlCrossRefPubMed
  3. 3.↵
    R. Verity et al., Estimates of the severity of coronavirus disease 2019: a model-based analysis. Lancet Infect Dis 20, 669–677 (2020).
    OpenUrlCrossRefPubMed
  4. 4.
    T. W. Russell et al., Estimating the infection and case fatality ratio for coronavirus disease (COVID-19) using age-adjusted data from the outbreak on the Diamond Princess cruise ship, February 2020. Euro Surveill 25, (2020).
  5. 5.↵
    J. T. Wu et al., Estimating clinical severity of COVID-19 from the transmission dynamics in Wuhan, China. Nat Med 26, 506–510 (2020).
    OpenUrlCrossRefPubMed
  6. 6.↵
    K. Mizumoto, K. Kagaya, A. Zarebski, G. Chowell, Estimating the asymptomatic proportion of coronavirus disease 2019 (COVID-19) cases on board the Diamond Princess cruise ship, Yokohama, Japan, 2020. Euro Surveill 25, (2020).
  7. 7.↵
    L. Ferretti et al., Quantifying SARS-CoV-2 transmission suggests epidemic control with digital contact tracing. Science 368, (2020).
  8. 8.↵
    M. G. Baker, N. Wilson, T. Blakely, Elimination could be the optimal response strategy for covid-19 and other emerging pandemic diseases. BMJ 371, m4907 (2020).
    OpenUrlFREE Full Text
  9. 9.↵
    B. J. Cowling et al., Impact assessment of non-pharmaceutical interventions against coronavirus disease 2019 and influenza in Hong Kong: an observational study. Lancet Public Health 5, e279–e288 (2020).
    OpenUrlPubMed
  10. 10.↵
    D. C. Adam et al., Clustering and superspreading potential of SARS-CoV-2 infections in Hong Kong. Nat Med 26, 1714–1719 (2020).
    OpenUrlPubMed
  11. 11.↵
    A. Rambaut et al., A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat Microbiol 5, 1403–1407 (2020).
    OpenUrl
  12. 12.↵
    T. Hale et al., A global panel database of pandemic policies (Oxford COVID-19 Government Response Tracker). Nat Hum Behav, (2021).
  13. 13.↵
    G. M. Leung, B. J. Cowling, J. T. Wu, From a Sprint to a Marathon in Hong Kong. N Engl J Med 382, e45 (2020).
    OpenUrlCrossRefPubMed
  14. 14.↵
    T. Stadler, D. Kuhnert, S. Bonhoeffer, A. J. Drummond, Birth-death skyline plot reveals temporal changes of epidemic spread in HIV and hepatitis C virus (HCV). Proc Natl Acad Sci U S A 110, 228–233 (2013).
    OpenUrlAbstract/FREE Full Text
  15. 15.↵
    K. Leung, M. H. Shum, G. M. Leung, T. T. Lam, J. T. Wu, Early transmissibility assessment of the N501Y mutant strains of SARS-CoV-2 in the United Kingdom, October to November 2020. Euro Surveill 26, (2021).
  16. 16.↵
    D. Zhanwei et al., Pandemic fatigue impedes mitigation of COVID-19 in Hong Kong. Research Square, (2021).
  17. 17.↵
    L. Qiuyan et al., Community Psychological And Behavioural Responses To Coronavirus Disease 2019 Over One Year Of The Pandemic In 2020 In Hong Kong. Scientific Reports, (2021).
  18. 18.↵
    A. Sobel Leonard, D. B. Weissman, B. Greenbaum, E. Ghedin, K. Koelle, Transmission Bottleneck Size Estimation from Pathogen Deep-Sequencing Data, with an Application to Human Influenza A Virus. J Virol 91, (2017).
  19. 19.↵
    K. A. Lythgoe et al., SARS-CoV-2 within-host diversity and transmission. Science 372, (2021).
  20. 20.↵
    M. A. Martin, K. Koelle, Reanalysis of deep-sequencing data from Austria points towards a small SARS-COV-2 transmission bottleneck on the order of one to three virions. bioRxiv, 2021.2002.2022.432096 (2021).
  21. 21.↵
    K. M. Braun et al., Transmission of SARS-CoV-2 in domestic cats imposes a narrow bottleneck. PLoS Pathog 17, e1009373 (2021).
    OpenUrl

Supplementary references

  1. 22.↵
    T. H. C. Sit et al., Infection of dogs with SARS-CoV-2. Nature 586, 776–778 (2020).
    OpenUrl
  2. 23.↵
    M. Vasimuddin, S. Misra, H. Li, S. Aluru, in 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS). (2019), pp. 314–324.
  3. 24.↵
    H. Li, A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27, 2987–2993 (2011).
    OpenUrlCrossRefPubMedWeb of Science
  4. 25.↵
    A. Rambaut, T. T. Lam, L. Max Carvalho, O. G. Pybus, Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evol 2, vew007 (2016).
    OpenUrlCrossRefPubMed
  5. 26.↵
    L. T. Nguyen, H. A. Schmidt, A. von Haeseler, B. Q. Minh, IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol 32, 268–274 (2015).
    OpenUrlCrossRefPubMed
  6. 27.↵
    T. H. To, M. Jung, S. Lycett, O. Gascuel, Fast Dating Using Least-Squares Criteria and Algorithms. Syst Biol 65, 82–97 (2016).
    OpenUrlCrossRefPubMed
  7. 28.↵
    L. du Plessis et al., Establishment and lineage dynamics of the SARS-CoV-2 epidemic in the UK. Science 371, 708–712 (2021).
    OpenUrlAbstract/FREE Full Text
  8. 29.↵
    S. Duchene et al., Temporal signal and the phylodynamic threshold of SARS-CoV-2. Virus Evol 6, veaa061 (2020).
    OpenUrl
  9. 30.↵
    A. J. Drummond, A. Rambaut, B. Shapiro, O. G. Pybus, Bayesian coalescent inference of past population dynamics from molecular sequences. Mol Biol Evol 22, 1185–1192 (2005).
    OpenUrlCrossRefPubMedWeb of Science
  10. 31.↵
    J. L. Geoghegan et al., Genomic epidemiology reveals transmission patterns and dynamics of SARS-CoV-2 in Aotearoa New Zealand. Nat Commun 11, 6351 (2020).
    OpenUrl
  11. 32.↵
    S. Y. Ho, S. Duchene, D. Duchene, Simulating and detecting autocorrelation of molecular evolutionary rates among lineages. Mol Ecol Resour 15, 688–696 (2015).
    OpenUrlCrossRefPubMed
  12. 33.↵
    R. Bouckaert et al., BEAST 2.5: An advanced software platform for Bayesian evolutionary analysis. PLoS Comput Biol 15, e1006650 (2019).
    OpenUrlCrossRefPubMed
  13. 34.↵
    A. J. Drummond, S. Y. Ho, M. J. Phillips, A. Rambaut, Relaxed phylogenetics and dating with confidence. Plos Biol 4, e88 (2006).
    OpenUrlCrossRefPubMed
  14. 35.↵
    A. Rambaut, A. J. Drummond, D. Xie, G. Baele, M. A. Suchard, Posterior Summarization in Bayesian Phylogenetics Using Tracer 1.7. Syst Biol 67, 901–904 (2018).
    OpenUrlCrossRefPubMed
  15. 36.↵
    E. P. Garrison, G. Marth, Haplotype-based variant detection from short-read sequencing. arXiv: Genomics, (2012).
  16. 37.↵
    Z. Lai et al., VarDict: a novel and versatile variant caller for next-generation sequencing in cancer research. Nucleic Acids Res 44, e108 (2016).
    OpenUrlCrossRefPubMed
  17. 38.↵
    A. Wilm et al., LoFreq: a sequence-quality aware, ultra-sensitive variant caller for uncovering cell-population heterogeneity from high-throughput sequencing datasets. Nucleic Acids Res 40, 11189–11201 (2012).
    OpenUrlCrossRefPubMedWeb of Science
  18. 39.↵
    K. M. Gostic et al., Practical considerations for measuring the effective reproductive number, Rt. PLoS Comput Biol 16, e1008409 (2020).
    OpenUrlCrossRefPubMed
  19. 40.↵
    J. T. Wu et al., Nowcasting epidemics of novel pathogens: lessons from COVID-19. Nat Med 27, 388–395 (2021).
    OpenUrl
  20. 41.↵
    J. A. Backer, D. Klinkenberg, J. Wallinga, Incubation period of 2019 novel coronavirus (2019-nCoV) infections among travellers from Wuhan, China, 20-28 January 2020. Euro Surveill 25, (2020).
  21. 42.↵
    R. N. Thompson et al., Improved inference of time-varying reproduction numbers during infectious disease outbreaks. Epidemics 29, 100356 (2019).
    OpenUrlCrossRefPubMed
  22. 43.↵
    K. Leung, M. Lipsitch, K. Y. Yuen, J. T. Wu, Monitoring the fitness of antiviral-resistant influenza strains during an epidemic: a mathematical modelling study. Lancet Infect Dis 17, 339–347 (2017).
    OpenUrl
  23. 44.↵
    K. Leung, Y. Pei, G. M. Leung, T. T. Lam, J. T. Wu, Empirical transmission advantage of the D614G mutant strain of SARS-CoV-2. medRxiv, (2020).
  24. 45.↵
    K. Leung, J. T. Wu, D. Liu, G. M. Leung, First-wave COVID-19 transmissibility and severity in China outside Hubei after control measures, and second-wave scenario planning: a modelling impact assessment. Lancet 395, 1382–1393 (2020).
    OpenUrlCrossRefPubMed
Back to top
PreviousNext
Posted June 23, 2021.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
SARS-CoV-2 under an elimination strategy in Hong Kong
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
SARS-CoV-2 under an elimination strategy in Hong Kong
Haogao Gu, Ruopeng Xie, Dillon C. Adam, Joseph L.-H. Tsui, Daniel K. Chu, Lydia D.J. Chang, Sammi S.Y. Cheuk, Shreya Gurung, Pavithra Krishnan, Daisy Y.M. Ng, Gigi Y.Z. Liu, Carrie K.C. Wan, Kimberly M. Edwards, Kathy S.M. Leung, Joseph T. Wu, Dominic N.C. Tsang, Gabriel M. Leung, Benjamin J. Cowling, Malik Peiris, Tommy T.Y. Lam, Vijaykrishna Dhanasekaran, Leo L.M. Poon
medRxiv 2021.06.19.21259169; doi: https://doi.org/10.1101/2021.06.19.21259169
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
SARS-CoV-2 under an elimination strategy in Hong Kong
Haogao Gu, Ruopeng Xie, Dillon C. Adam, Joseph L.-H. Tsui, Daniel K. Chu, Lydia D.J. Chang, Sammi S.Y. Cheuk, Shreya Gurung, Pavithra Krishnan, Daisy Y.M. Ng, Gigi Y.Z. Liu, Carrie K.C. Wan, Kimberly M. Edwards, Kathy S.M. Leung, Joseph T. Wu, Dominic N.C. Tsang, Gabriel M. Leung, Benjamin J. Cowling, Malik Peiris, Tommy T.Y. Lam, Vijaykrishna Dhanasekaran, Leo L.M. Poon
medRxiv 2021.06.19.21259169; doi: https://doi.org/10.1101/2021.06.19.21259169

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Epidemiology
Subject Areas
All Articles
  • Addiction Medicine (228)
  • Allergy and Immunology (506)
  • Anesthesia (110)
  • Cardiovascular Medicine (1245)
  • Dentistry and Oral Medicine (206)
  • Dermatology (147)
  • Emergency Medicine (282)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (534)
  • Epidemiology (10032)
  • Forensic Medicine (5)
  • Gastroenterology (500)
  • Genetic and Genomic Medicine (2467)
  • Geriatric Medicine (238)
  • Health Economics (480)
  • Health Informatics (1647)
  • Health Policy (754)
  • Health Systems and Quality Improvement (637)
  • Hematology (250)
  • HIV/AIDS (536)
  • Infectious Diseases (except HIV/AIDS) (11872)
  • Intensive Care and Critical Care Medicine (626)
  • Medical Education (253)
  • Medical Ethics (75)
  • Nephrology (268)
  • Neurology (2290)
  • Nursing (139)
  • Nutrition (352)
  • Obstetrics and Gynecology (454)
  • Occupational and Environmental Health (537)
  • Oncology (1249)
  • Ophthalmology (377)
  • Orthopedics (134)
  • Otolaryngology (226)
  • Pain Medicine (158)
  • Palliative Medicine (50)
  • Pathology (325)
  • Pediatrics (734)
  • Pharmacology and Therapeutics (315)
  • Primary Care Research (282)
  • Psychiatry and Clinical Psychology (2281)
  • Public and Global Health (4844)
  • Radiology and Imaging (843)
  • Rehabilitation Medicine and Physical Therapy (492)
  • Respiratory Medicine (652)
  • Rheumatology (286)
  • Sexual and Reproductive Health (241)
  • Sports Medicine (227)
  • Surgery (269)
  • Toxicology (44)
  • Transplantation (125)
  • Urology (99)