Trends and Intensity of Human Rhinovirus Invasions in Kilifi, Coastal Kenya Over a Twelve-Year Period, 2007-2018

ABSTRACT Background: Human rhinovirus (HRV) is an ubiquitous pathogen and the principal etiologic agent of common cold. Despite the high frequency of HRV infections, data describing its long-term epidemiological patterns in a single population remain limited. Methods: We analysed 1,070 VP4/VP2 genomic region sequences obtained from samples collected between 2007-2018 from hospitalised paediatric patients (< 60 months) with acute respiratory disease in Kilifi County Hospital on the Kenya Coast. Results: Of 7,231 children enrolled, HRV was detected in 1,497 (20.7%) andVP4/VP2 sequences were recovered from 1,070 samples (71.5%). A total of 144 different HRV types were identified (67 HRV-A, 18 HRV-B and 59 HRV-C) and at any time-point, several types co-circulated with alternating predominance. Within types multiple genetically divergent variants were observed. Ongoing HRV infections appeared to be a combination of (i) persistent types (observed up to seven consecutive months), (ii) reintroduced genetically distinct variants and (iii) new invasions (average of 8 new types, annually). Conclusion: Sustained HRV presence in the Kilifi community is mainly due to frequent invasion by new types and variants rather than prolonged circulation of locally established strains.


Introduction 44
Human rhinovirus (HRV) is a highly prevalent pathogen and the principal cause of 45 common cold syndrome globally [1,2]. HRV infections result in a wide range of 46 clinical outcomes spanning from asymptomatic and mild illness in the upper airways 47 to severe illness in the lower airways [3,4]. The infections occur in persons of all 48 ages, with the severe presentation more likely in children under the age of 5 years 49 . The three surface-exposed capsid 56 proteins (VP1, VP2 and VP3) carry the antigenically critical sites [9][10][11]. High genetic 57 variability in the VP4/VP2 and VP1 genomic regions has been instrumental to 58 molecular typing [12,13] and molecular epidemiological investigations [14][15][16]. 59 Currently, 169 HRV types have been described, and classified into three species i.e., 60 HRV-A, HRV-B and HRV-C 61 (https://www.picornaviridae.com/sg3/enterovirus/enterovirus.htm ). 62 HRV infections occur all year-round in most geographical locations, although 63 peaking in the early autumn and late spring in many temperate countries, and in the 64 rainy season in tropical countries [2,17]. Unclear seasonality and year-round 65 infections have been attributed to lack of inter-type cross-protective immunity [18,19], 66 coupled with the high genetic diversity within the three species, each with the ability 67 to spread independently in a population [14,16,20]. 68 . CC-BY 4.0 International license It is made available under a perpetuity.
is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 4, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 6 We used the term 'type' to refer to HRV sequences classified by either cross-119 neutralization or genetic comparisons as distinct as described previously [13]. Based 120 on this approach, sequences were assigned into the same HRV type based on 121 >90% nucleotide similarity to rhinovirus prototype sequences (also referred to as 122 reference sequences, http://www.picornaviridae.com/sequences/sequences.htm) 123 and phylogenetic clustering with bootstrap support value >70% [13]. Distributions of 124 pairwise genetic distances were assessed for evaluation of intertype and intra-type 125 divergence [13]. Intra-type 'variant' was defined on the basis of a divergence 126 threshold value determined as the least frequent value between the first and second 127 modes in a pairwise nucleotide difference distribution plot. Here we are implicitly 128 assuming that sequences with pairwise nucleotide difference falling into the 129 distribution with the low (first) mode are members of the same phylogenetic clade, 130 whereas those with pairwise nucleotide difference within the second distribution with 131 higher mode are members of different phylogenetic clades. A group of viruses within 132 the first, lower distribution were classified as belonging to the same HRV type 133

variant. 134
The definitions used to describe the temporal occurrence of HRV types are 135 summarised as follows: is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint  Table 1). The monthly frequency of detection of HRV in the study population is 159 shown in Figure 1. HRV infections were occurring year-round frequently peaking 160 between the month of May and September each year ( Figure 1). 161

9
Additionally, several types recurred after considerable periods of absence. For is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 4, 2021. The majority (99%) of our sequences occurred within the proposed 232 divergence thresholds for HRV typing and classification using VP4/VP2 region 233 (10.5% for HRV-A, 9.5% for HRVB, and 10.5% for HRV-C) [13]. This exemplifies 234 significant sequence conservation in the VP4/VP2 region within a type allowing 235 robust genotypic assignment. However, 24 sequences did not fit the classification 236 system for VP4/VP2 region and require further investigation such as by whole 237 genome sequencing to also consider variation in the VP1 region to determine 238 whether these sequence correspond to new types [13]. 239 In total, 144 distinct HRV types were detected over the 12-year period, 240 representing 85% of the currently known types (http://www.picornaviridae.com). The 241 finding of enteroviruses (EV) sequences reflects the cross-reactivity of the HRV 242 . CC-BY 4.0 International license It is made available under a perpetuity.
is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 4, 2021.
detection PCR for non-rhinovirus EV, due to their close relationship and the 243 nucleotide sequence conservation of the 5'-untranslated region that is targeted by 244 our diagnostic assay [34,35]. Half of the EVs (5/10) were EV-D68, an EV that is also 245 associated with respiratory disease [36]. Two sequences were found to be HPV-1. 246 The children of 23-and 10-months age had probably been immunized (oral polio 247 vaccine) just before the nasopharyngeal swab. The other EVs detected (CV-B3, CV-248 B2 and E-19) are found only occasionally in respiratory samples [37]. genetically identical to older circulating strains, may also be observed in the portion 257 of the population that had not been previously exposed to a HRV type. 258 Some HRV types were seen once or very sporadically and could be 259 associated with mild disease or asymptomatic infections or have reduced 260 transmission rates probably suppressed by pre-existing host population immunity 261 perhaps indicating the period a given study cohort will take to experience every 264 known HRV type or maximum number of types. 265 In some HRV types, VP4/VP2 coding region remained conserved at both 266 nucleotide and amino acid level after periods of quiescence, which probably ensures 267 . CC-BY 4.0 International license It is made available under a perpetuity.
is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 4, 2021. ; 12 strain survival by maintaining low-level genetic variation. In a linear strain space, 268 strains interact via cross-immunity to nearby strains with shared epitopes and this 269 interaction tails off with genetic distance [40]. Yet, the VP4/VP2 region might not be 270 primarily antigenic [41] , and genetic changes could have occurred at immunogenic 271 sites located in other capsid proteins (VP1 or VP3). Genome-wide sequence data 272 would therefore be useful to confirm strain conservation and maintenance. The 273 evident intra-type genetic diversity with differential temporal distribution could is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 4, 2021. ; https://doi.org/10.1101/2021.08.03.21261536 doi: medRxiv preprint 291 We thank the study participants for providing the study samples. We also thank all 292

members of the Virus Epidemiology and Control (VEC) Research Group in Kilifi who 293
were involved at various stages of this study. We also thank the parents and 294 guardians of the children for accepting to participate in this study. 295 296

Funding 297
This study was supported by The Wellcome Trust, United Kingdom (grant 102975). 298 The funder had no role in other aspects of the study including its design, data 299 collection, data analysis, data interpretation, or writing of this manuscript. This was 300

Conflict of Interests 309
The authors declare that they have no competing interest. 310

Availability of data and materials 311
All data generated and analysis script for this manuscript are available from the Virus is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 4, 2021. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 4, 2021. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 4, 2021. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 4, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 20 Tables 445 Table 1 Number of different human rhinovirus types identified in Kilifi, Kenya, 2007-446 2018   is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 4, 2021. ; Figure 4 (A) Nucleotide variability across the sequenced VP4/VP2 region for types 469 A49 and C2. For each type, the viruses were compared to the earliest sampled 470 sequence. Vertical coloured bars show the nucleotide differences: red is a change to 471 T, orange is a change to A, purple is a change to C and blue is a change to G. (B) 472 Distribution of pairwise nucleotide difference for the VP4/VP2 region of types A49 473 and C2. 474  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 4, 2021. ; https://doi.org/10.1101/2021.08.03.21261536 doi: medRxiv preprint

22
show the nucleotide differences: red is a change to T, orange is a change to A, 494 purple is a change to C and blue is a change to G. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 4, 2021.