Genomic epidemiology of the SARS-CoV-2 epidemic in Zimbabwe: Role of international travel and regional migration in spread

Zimbabwe reported its first case of SARS-Cov-2 infection in March 2020, and case numbers increased to more than 8,099 to 16th October 2020. An understanding of the SARS-Cov-2 outbreak in Zimbabwe will assist in the implementation of effective public health interventions to control transmission. Nasopharyngeal samples from 92,299 suspected and confirmed COVID-19 cases reported in Zimbabwe between 20 March and 16 October 2020 were obtained. Available demographic data associated with those cases identified as positive (8,099) were analysed to describe the national breakdown of positive cases over time in more detail (geographical location, sex, age and travel history). The whole genome sequence (WGS) of one hundred SARS-CoV-2-positive samples from the first 120 days of the epidemic in Zimbabwe was determined to identify their relationship to one another and WGS from global samples. Overall, a greater proportion of infections were in males (55.5%) than females (44.85%), although in older age groups more females were affected than males. Most COVID-19 cases (57 %) were in the 20-40 age group. Eight lineages, from at least 25 separate introductions into the region were found using comparative genomics. Of these, 95% had the D614G mutation on the spike protein which was associated with higher transmissibility than the ancestral strain. Early introductions and spread of SARS-CoV-2 were predominantly associated with genomes common in Europe and the United States of America (USA), and few common in Asia at this time. As the pandemic evolved, travel-associated cases from South Africa and other neighbouring countries were also recorded. Transmission within quarantine centres occurred when travelling nationals returning to Zimbabwe. International and regional migration followed by local transmission were identified as accounting for the development of the SARS-CoV-2 epidemic in Zimbabwe. Based on this, rapid implementation of public health interventions are critical to reduce local transmission of SARS-CoV-2. Impact of the predominant G614 strain on severity of symptoms in COVID-19 cases needs further investigation.


Introduction
'Severe Acute Respiratory Syndrome Coronavirus 2' (SARS-CoV-2) was confirmed as the causative agent of 'Coronavirus Disease 2019'   (Huang et al., 2020). The virus spread to more than 200 countries, led to more than 47 million confirmed infections and more than 1.2 million deaths (as of 4 th of November 2020). The COVID-19 pandemic constitutes the largest global public health crisis in a century, with daunting health and socioeconomic challenges (Khandelwal, 2020). Countries in the African continent have particular challenges with respect to controlling COVID-19 infections and reducing mortality rates; coupled with weak public health delivery systems they have large populations of vulnerable people e.g. those immunocompromised from HIV/AIDS, with non-communicable diseases such as diabetes, hypertension and anaemia, or malnutrition (Kaseje, 2020).
The first COVID-19 case in Africa was recorded in Egypt on the 14th of February, 2020 (Lone and Ahmad, 2020), and on 27 th February, the sub-Saharan African region recorded its first case in Nigeria. Further cases in sub-Saharan Africa were recorded in South Africa and Zimbabwe, on March 5 th and 20 th , respectively (Msomi et al., 2020). The initial cases identified in Africa were mostly introduced from Europe, the Middle East and the United States of America (Hâncean et al., 2020;Lone and Ahmad, 2020;Mehtar et al., 2020).
On 30 th March, after confirmation of the first death (24 th March 2020) Zimbabwe introduced a 21-day self-quarantine for citizens arriving from affected territories. On the 23rd of March 2020, the country banned: non-essential in-bound travel and transportation except for returning residents; entertainment and recreational activities; public gatherings of > 50 people; and limited hospital visits to one relative per patient per day. Other measures implemented included compulsory wearing of face masks in all settings, social distancing and hand sanitization. A national COVID-19 coordinating committee chaired by a Cabinet Minister, with a full time senior public health specialist as its coordinator, was set up. At all levels from province to district, coordinating committees were established. Rapid Response Teams at all levels were set up to investigate presumed COVID-19 cases, follow-up on all contacts, collect samples from contacts and supervise the burial of confirmed COVID-19 deaths. Treatment centres were set up, mainly Harare and Bulawayo city. Due to the lockdowns imposed in other countries, there was an influx of Zimbabwean residents returning home. This led to establishment of Government run quarantine centres. Initially, all cases housed at quarantine centres had follow up laboratory testing on day 1, 7 and 21 to assess infection and recovery rate. After having completed the 21-day quarantine period with negative results, the returnees/travellers were discharged. The intervention meant that even those who were not showing signs and symptoms of COVID-19 were tested. On 30 th . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted January 6, 2021. ; https://doi. org/10.1101org/10. /2021 March, total shutdown of all services, except essential services like health care centres, grocery stores and government activities was implemented.
The COVID-19 pandemic, and the actions taken in response to it, disrupted healthcare services and, as a result, overwhelmed the major healthcare programmes in Africa focused on controlling HIV, tuberculosis and malaria. Facilities and staff are taken up with attending to the overwhelmingly high demand for care required by patients with severe COVID-19, leaving other patients without the resources they need. This will have far-reaching consequences for other diseases, poverty, food security, and economic growth in Africa .
There is an urgent need to understand the epidemiology of SARS-CoV-2 in Africa, particularly the dynamics of its virulence, transmission, mutation capacity and symptom variation amongst infected people (Kalk and Schultz, 2020). The SARS-CoV-2 mutation rate is estimated to be approximately 2.5 single nucleotide polymorphisms (SNPs) per month (Duchene et al., 2020;Meredith et al., 2020). Genome sequencing of SARS-CoV-2 is important to monitor these changes because they are relevant to vaccine efficacy, can be used to infer transmission dynamics, and inform future public health measures targeting its spread. Genomic surveillance backed up with detailed epidemiological data on SARS-CoV-2 can identifies which lineages of the virus are circulating in the human population. Information on how these change over time can serve as an indicator of the success of control measures, how often new sources of virus are introduced from other geographical areas, and how the virus has evolved in response to interventions. Despite having inadequate health systems, the number of reported cases in many African countries, specifically Zimbabwe, has been rising more slowly than predicted (Nkengasong and Mankoula, 2020).
Zimbabwe took the initiative to sequence SARS-CoV-2 samples from 100 patients collected over the period March to June 2020. The objectives were to use the sequence results to: understand initial transmission events; understand early domestic transmission of the virus in Zimbabwe; add context to the regional and global data; and to evaluate the role of rapid whole genome sequencing for outbreak analysis in this setting.

Material and methods
Epidemiological analysis. Nasopharyngeal samples (92,299 in total including repeats) were collected from suspected COVID-19 cases across Zimbabwe between the 20 th March (first case identified) and the 16 th October 2020 were diagnosed routinely at nine local . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted January https://doi.org/10.1101https://doi.org/10. /2021 laboratories to various regions affected using previously described PCR methods to confirm positive cases (Corman et al., 2020) cycles. Odd and even PCR reactions were pooled and cleaned using a 1x SPRI bead clean . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted January 6, 2021. ; https://doi.org/10.1101/2021.01.04.20232520 doi: medRxiv preprint using KAPA Pure Beads (Roche Catalogue No. 07983298001) according to manufacturer instructions. PCR products were eluted in 30 µl of 10 mM Tris-HCL buffer, pH 7.5. cDNA was quantified using QuantiFluor ® ONE dsDNA System (Promega, WI, USA). Libraries were then prepared for sequencing on the Illumina or Nanopore platform and sequenced as described previously (Baker et al., 2020). All sequence data are freely available in the GISAID database (Supplementary Table 1 and Supplementary Table 2). Sequence data of SARS-CoV-2 from South Africa was reported previously (Msomi et al., 2020).  (Li, 2013), the ARTIC amplicons were trimmed and a consensus built using iVAR (v.1.2) (Grubaugh et al., 2019).
Quality control of the sequenced genomes. Samples were prepared and sequenced in 96-well plates with one cDNA negative control per plate and one RNA extraction negative control, where applicable. Contaminated samples were removed from analysis. Consensus sequence was defined as passing quality control if greater than 50% of the genome was covered by confident calls or there was at least one contiguous sequence of more than 10,000 bases and no evidence of contamination in the negative control. This is regarded as the minimum amount of data to be phylogenetically useful. A confident call is defined as having 10x depth of coverage. If the coverage falls below these thresholds, the bases are masked with Ns. Low quality variants are also masked with Ns. The QC threshold for inclusion in Global Initiative on Sharing All Influenza Data (GISAID) database is higher, requiring that greater than 90% of the genome is covered by confident calls and that there is no evidence of contamination.
Clustering and phylogenetic analysis. Lineages  assigned to each consensus genome were determined using Pangolin (https://github.com/covlineages/pangolin) from the Rambaut group on MRC CLIMB (Connor et al., 2016) using pangoLEARN training data set from 2020.08.29, with uncertain classifications (low support or discordance from tree) curated by hand. Only samples that passed quality control were considered for genomic analysis. Global sequences (Supplementary Table 2) that were potentially related to those from Zimbabwe were included in the phylogenetic analysis, . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Results:
Epidemiology of SARS-CoV-2 infections in Zimbabwe from March to October 2020. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted January 6, 2021. ; https://doi.org/10.1101/2021.01.04.20232520 doi: medRxiv preprint The majority of cases recorded prior to 20 th of July were travel associated. The number of cases with no recent travel reported remained below 50 during this period of time. People travelling from South Africa accounted for the greatest proportion (83.4%) of travelassociated cases (Fig. 1). From the time the first case was recorded to about 120 days later, there were more travel-associated cases than locally-transmitted cases, and the majority of travel-related cases were asymptomatic. During July the number of cases with no recent travel reported increased rapidly, confirming the start of community transmission. An increase in overall number of daily cases was observed from day 60 after the first case and peaked during the first half of August. Thereafter the number of cases decreased to the middle of October when approximately 20 cases per day were identified.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted January 6, 2021. ; https://doi.org/10.1101/2021.01.04.20232520 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted January 6, 2021. ; https://doi.org/10.1101/2021.01.04.20232520 doi: medRxiv preprint Of those infected, 4455 (55%) were men and 3644 (45%) were women. Health workers constituted about 8% of the total number of reported cases. The most affected age group was the 31-40 years group (Fig. 2). The proportion of cases recorded in all age group categories were slightly greater amongst men than women except in the 91-100 years age group (Fig. 2). The greatest skew towards men was in the age range 41-50 while in females it was amongst the 91-100 age group. Overall, most deaths occurred amongst men (59%) and the largest number of deaths were in the 41-50 years age group. The CFR increased from age-group 41-50 to >90 years. The under-1-year CFR was 5% (Fig. 2).
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted January 6, 2021. ; https://doi.org/10.1101/2021.01.04.20232520 doi: medRxiv preprint   genomes were present as a separate lineage (Figure 3). If each of these clusters . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted January 6, 2021. ; https://doi.org/10.1101/2021.01.04.20232520 doi: medRxiv preprint represented separate introductions then we can infer that 27 was the lower limit for the number of independent introductions evident and for which there was good supporting evidence. However, in the case of four clusters or single genome lineages, the tree topology suggested that the genomes arose by divergence from just two introductions into Zimbabwe.
In each case, two lineages shared a common ancestor with no intermediary nodes formed with genomes from other countries (blue and green lineages in Figure 3). By this more conservative estimation, the minimum number of introductions into Zimbabwe that our data support was 25. The 12 clusters were each comprised between 2 and 30 genomes with 0-3 SNPs between members of the cluster. Seven clusters contained more than one genome that had an identical genome sequence to at least one genome in the same cluster.
Two large clusters within lineage B.1.1 were present on extended branches of the phylogenetic tree composed of genomes that were distinct from all other to any genome present in the GISAID database by 4-5 SNPs. These introductions may have been from geographical locations where sequence of SARS-CoV-2 was not reported or the result of sequence divergence during transmission within Zimbabwe from an earlier case within Zimbabwe not included in our analysis.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

The D614G genotype was the most frequent variant in early spread events in
Zimbabwe. There has been considerable interest in the emergence of a variant lineage of SARS-CoV-2 with a non-synonymous mutation resulting in substitution of an aspartate with a glycine residue at position 614 (D614G) of the spike (S) protein; this variant is linked to increased transmissibility in the upper respiratory tract and increased fitness (Plante et al., 2020;Volz et al., 2020). We therefore determined the distribution and frequency of the D614G variant in SARS-Cov-2 genomes from Zimbabwe. A total of 95 samples out of 100 contained the D614G mutation. Four of the five samples with the ancestral 614D form were . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted January 6, 2021. ; https://doi.org/10.1101/2021.01.04.20232520 doi: medRxiv preprint from a single cluster of identical genomes from lineage A which was associated with travel from Dubai; the other was from lineage B.2 and associated with travel from USA. All five were from cases presenting in April or May 2020. Samples ZW25 and ZW29 were indistinguishable from one another suggesting a recent common source or direct transmission. However, this genome sequence was extremely common in the GISAID database (Shu and Mccauley, 2017) where identical genomes have been reported from multiple countries, although predominantly Europe, consistent with rapid transmission of this clone internationally (Figure 4).

Genomic epidemiology identifies intercontinental transmission and
Genome ZW70 was from a traveller arriving from Dubai and was identical to three other genomes: ZW168, ZW169 and ZW214. Local epidemiological investigation indicated that these were infections resulting from cohabitation and may represent the earliest cases of local transmission. This interpretation was supported by the genome sequence data. Dubai airport is an important hub for flights from many locations in the world including Asia.
Genomes ZW168, ZW169 and ZW214 all belonged to lineage A, which is most widely distributed in China.
An additional seven and three genomes were associated with arrivals from the UK and the US, respectively, in either late March or April. International travel was suspended following the arrival of these cases. Nine of these genomes belonged to the B1 lineage, while the last from a case associated with travel to the US belonged to the B2 lineage. Although in the same B1 lineage, eight of these travel-associated cases were on different sublineage and, with one exception, had no identical genomes from non-travel associated COVID-19 cases in our dataset. The one exception was sample ZW25BY that was from a case recorded in Bulawayo in April, which was identical to samples ZW-391 and ZW-EC, also from Bulawayo, with no travel-history recorded which, therefore, represents potential domestic transmission.
An increase in cases associated with Zimbabwean residents returning from neighbouring countries, especially South Africa, during May 2020 was reported by local surveillance. At this time SARS-CoV-2 was reported to be spreading rapidly in South Africa and therefore a two-week quarantine in designated accommodation was introduced. Although the majority of returnees were asymptomatic, they were tested and on multiple occasions were positive for . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted January 6, 2021. ; https://doi.org/10.1101/2021.01.04.20232520 doi: medRxiv preprint SARS-CoV-2. A total of 26 samples from quarantined individuals housed in six of these accommodations were sequenced. In several quarantine accommodation settings, genomes from multiple samples from different individuals resident in the same quarantine location were identical or differed by only a single nucleotide in the genome. For example in quarantine location A, two clusters were identified that consisted of 10 and 14 samples (May 2020), respectively all of which were in the B1.1 lineage with genomes that were indistinguishable from one another; this was consistent with direct transmission. Since returnees from multiple locations were quarantined together, the most likely explanation is that transmission occurred in the quarantine accommodation rather than from a common outside source. Later spread events were primarily linked to returnees from South Africa and potential transmission in quarantine facilities.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted January 6, 2021. ; https://doi.org/10.1101/2021.01.04.20232520 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Discussion:
Epidemiological analysis identified two phases of the SARS-CoV-2 epidemic in Zimbabwe.
The vast majority of cases in the first 3 months were associated with international or regional travel. This was confirmed by the epidemiological data linking most confirmed cases with a history of having travelled outside of the country exacerbated by rapid influx of Zimbabwean citizens returning from other countries already experiencing the disease. A rapid increase in the number of cases was associated with probable domestic transmission after July 2020.
Zimbabwe is a landlocked country in Southern Africa between the Zambezi and Limpopo rivers and bordering South Africa, Botswana, Zambia and Mozambique. Travel history is essential for understanding the introduction of SARS-Cov-2 into a nation, and this was confirmed by the data from Zimbabwe. Initial cases recorded within the first 120 days of the outbreak were associated with travel from international locations already experiencing the disease. The majority of the initial COVID-19 cases in Zimbabwe (20 th March -30 th June 2020) were Zimbabwean residents returning from neighbouring South Africa, Botswana and Mozambique. Most were associated with travel from South Africa related to informal trade between these two countries by cross-border traders, the majority of which are aged 25-45 years. The combination of informal cross-border traders and formally returning residents may have contributed to the subsequent increased local transmission in Zimbabwe. Other cases, however, were from Europe, Asia and USA, the epicentres of the SARS-Cov-2 epidemic. However, the rise is cases was slower than predicted and mirrored what has been seen in many African countries (Nkengasong and Mankoula, 2020). This slow rise in cases could be attributed to the measures the country instituted which included wearing of face masks, hand sanitization and lockdown. These measures have been shown to minimise and contain the outbreak in other countries, including France (Cauchemez et al., 2020).
The SARS-Cov-2 pandemic is an emerging infectious disease with no known public health interventions except the measures implemented in China, Europe and elsewhere.
Zimbabwe's response was shaped by experiences from other countries, where the epidemic . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted January https://doi.org/10.1101https://doi.org/10. /2021 was described first (La Regina et al., 2020;Shekhar et al., 2020;Xu et al., 2020). For the first 120 days following the first case of Covid-19 in Zimbabwe the total number of detected cases remained below 50, suggesting that public health interventions were effective in limiting local transmission to a minimum. The observed delayed local transmission may be as a result of the surveillance systems employed. Limiting movement by traders and cargo to a small manageable number, implementation of preventative equipment (e.g. hand sanitization, face mask wearing) and response measures within the first 30 days after the first case could explain this phenomenon. Though measures to minimise transmission were in place, a lot of irregularities emerged such as residents crossing borders illegally and/or non-compliance of quarantine requirements and relaxation of lockdown measures which could have contributed to an increase in local transmission. This was also evidenced by having the greatest number of cases centred around Bulawayo, Gweru and Harare cities with close proximity to land borders or international airports and densely populations. These factors may explain the propagated picture of the COVID-19 epidemic curve in Zimbabwe.
Nonetheless, current evidence suggests that quarantine of travel-associated cases and early suspension of tourism-associated travel may have been effective based on our genomic analysis of cases in the first 120 days following the first case.
The factors associated with contraction of COVID-19 in Zimbabwe were consistent with factors associated with contraction in high income countries. The overall CFR of SARS Cov-2 was 2.9%, which was comparable with CFRs reported elsewhere in Asia, Europe and USA (Niforatos et al., 2020;Pachetti et al., 2020;Samaddar et al., 2020). Compared with other countries, younger males aged 31-40 years, were infected with SARS-CoV-2 in Zimbabwe during the first 120 days of the epidemic. However, the older population above 60 years was more likely to die from COVID-19 disease compared with the younger age groups, this agrees with what have been observed in other regions (Brandén et al., 2020;Dhama et al., 2020). Outbreak descriptions from Europe, Asia and the United States of America, indicated that older members of the population with pre-existing medical conditions were at increased risk of both becoming infected with SARS-Cov-2 and of dying from infection (Sanyaolu et al., 2020;Schultze et al., 2020). In the United Kingdom (UK) a considerably larger proportion of cases from ages 10-50 were in females, with cases in males being greatest in 0-10 years and 60-80 age categories (Page et al., 2020). These observations may have been due to two demographic characteristics prevalent in Zimbabwe. First, the younger age-group, 31-50 years is the most mobile and some have migrated out of the country in search of employment (Potts, 2010;Crush et al., 2015). Secondly, the same age group are sexually active and contribute the highest prevalence's of HIV/ AIDS infection in the Southern Africa region (Gouws et al., 2008).
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted January 6, 2021. ; https://doi.org/10.1101/2021.01.04.20232520 doi: medRxiv preprint Fourteen out of the 100 SARS-Cov-2 cases sequenced reported that they had travelled from outside the Southern Africa region. Of these cases two resulted from local transmission with a positive epidemiological link. Amongst the remaining twelve cases there was no evidence of onward transmission based on the SARS-CoV-2 genomes sampled and sequenced in the subsequent two months. The sequences of SARS-CoV-2 genomes associated with travellers returning from neighbouring Southern African countries were distinct from those associated with travellers from the US, UK and Dubai recorded earlier in the epidemic; most were from lineage B1.1, which was also the most common lineage amongst 2016 sequences from 18 African countries reported recently (www.afro.who.int). Interestingly, lineage B.1.1.1 which accounted for 16% of the samples reported in South Africa was not found in our analysis. This may be due to the fact that South Africa is a major airline hub in Southern Africa, connecting most of the countries to Europe, Asia and USA.
The D614G genotype was the most frequent variant in early spread events in Zimbabwe, consistent with the reported trend for increased frequency of this genotype in many locations globally, even when the ancestral genotype (614D) was previously established (Korber et al., 2020). There has been considerable interest in the emergence of a variant lineage of SARS-CoV-2 with a non-synonymous mutation resulting in substitution of 614G in which a glycine residue was replaced by an aspartate residue at position 614 (D614G) of the spike (S) protein. The S-protein is thought to be important for entry of the virus into host cells and D614G has been implicated in increased cell entry (Korber et al., 2020); the mechanism for this was via increasing the likelihood of binding the ACE2 receptor on the cell surface (Yurkovetskiy et al., 2020). Consistent with this idea, population genetic modelling studies indicated that the 614G variant increased in frequency in the UK relative to the 614D ancestral variant, suggesting a selective advantage for 614G (Volz et al., 2020). This protein is also the target in anti-viral (remdesivir and favipiravir) treatment trials for COVID-19 (Saxena, 2020). The mutation of SARS-CoV-2 is important because vaccine-escape strains may emerge quickly and there is also the potential for detection-failure if the target used in the RT-PCR tests for SARS-CoV-2 may accumulate mutations affecting the test. A total of 95 of the 100 SARS-CoV-2 genomes used in our analysis contained the D614G mutation. All five of the genomes with the ancestral 614D genotype were from samples from March or April 2020, with none observed in sequences from samples taken in May or June. The dominance of the 614G genotype in Zimbabwe likely reflects the relatively high frequency of the 614G genotype in countries from which the virus was introduced. However, since it has been suggested that the 614G genotype is present in greater numbers in the upper respiratory tract in infected people it may also have been more likely to spread once introduced into the Zimbabwean population.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted January 6, 2021. ; https://doi.org/10.1101/2021.01.04.20232520 doi: medRxiv preprint A combination of local surveillance and whole genome sequencing revealed the early intercontinental and subsequent local international spread of the SARS-CoV-2 into Zimbabwe. The analysis provides insights into the impact of interventions on transmission and the basis for improved future responses to this and other pandemics. Our whole genome sequence analysis provides a baseline view of the SARS-CoV-2 introduced into Zimbabwe that will place later domestic spread of the virus within a Zimbabwean context. As the national response to the SARS-CoV-2 pandemic continues the capability to test, isolate and track the evolution of SARS-CoV-2, is crucial to assess ongoing mutation of the virus.

Conclusion:
Phylogenomic analysis highlights the dominance of regional migration over intercontinental travel in the spread of SARS-CoV-2 in Zimbabwe. The majority of SARS-CoV-2 genomes from lineages B.1 and B.1.1 were recovered from residents returning from South Africa.
These lineages appear to be predominant in Africa and continuous efforts for widespread sequencing is important to provide insights about their origin, and the evolution of local/imported viral dissemination in Zimbabwe. As the epidemic unfolded most of the more than 8,000 cases were due to local transmission. Risk factors for local transmission have not been described. Continuous surveillance using whole-genome sequencing is required to estimate the presence of mutations that may be increasing transmissibility. In addition to describing the immunological response to SARS-CoV-2 infection, data from SARS-CoV-2 genomic surveillance may contribute to the development of vaccines against this novel virus.
Zimbabwe will need to strengthen health systems so that public health interventions that are effective against SARS-CoV-2 transmission elsewhere in the world, are also effective in Zimbabwe.

Limitations
Samples used for whole genome sequencing where sampled from the first 120 days of the epidemic. Potential subsequent circulation of descendants of these early intercontinental introductions after around 120 days once transmission was increasing exponentially, remains to be determined.

Ethical approval:
Ethical approval for the study was exempted by the Medical Research Council of Zimbabwe as this study was in response to a public health emergency.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted January 6, 2021. ; https://doi.org/10.1101/2021.01.04.20232520 doi: medRxiv preprint