Inferring pandemic growth rates from sequence data

Eric de Silva; Neil M Ferguson; Christophe Fraser

doi:10.1098/rsif.2011.0850

Inferring pandemic growth rates from sequence data

J R Soc Interface. 2012 Aug 7;9(73):1797-808. doi: 10.1098/rsif.2011.0850. Epub 2012 Feb 15.

Authors

Eric de Silva¹, Neil M Ferguson, Christophe Fraser

Affiliation

¹ Department of Infectious Disease Epidemiology, MRC Centre for Outbreak Analysis and Modelling, Imperial College London, London W2 1PG, UK.

Abstract

Using sequence data to infer population dynamics is playing an increasing role in the analysis of outbreaks. The most common methods in use, based on coalescent inference, have been widely used but not extensively tested against simulated epidemics. Here, we use simulated data to test the ability of both parametric and non-parametric methods for inference of effective population size (coded in the popular BEAST package) to reconstruct epidemic dynamics. We consider a range of simulations centred on scenarios considered plausible for pandemic influenza, but our conclusions are generic for any exponentially growing epidemic. We highlight systematic biases in non-parametric effective population size estimation. The most prominent such bias leads to the false inference of slowing of epidemic spread in the recent past even when the real epidemic is growing exponentially. We suggest some sampling strategies that could reduce (but not eliminate) some of the biases. Parametric methods can correct for these biases if the infected population size is large. We also explore how some poor sampling strategies (e.g. that over-represent epidemiologically linked clusters of cases) could dramatically exacerbate bias in an uncontrolled manner. Finally, we present a simple diagnostic indicator, based on coalescent density and which can easily be applied to reconstructed phylogenies, that identifies time-periods for which effective population size estimates are less likely to be biased. We illustrate this with an application to the 2009 H1N1 pandemic.

MeSH terms

Humans
Influenza A Virus, H1N1 Subtype* / genetics
Influenza A Virus, H1N1 Subtype* / pathogenicity
Influenza, Human* / epidemiology
Influenza, Human* / genetics
Models, Biological*
Pandemics*
Phylogeny*
RNA, Viral / genetics*
Sequence Analysis, RNA

Substances

RNA, Viral

Grants and funding

G0800596/MRC_/Medical Research Council/United Kingdom