Combinatorial decomposition of an outbreak signature

https://doi.org/10.1016/j.mbs.2006.03.012Get rights and content

Abstract

We use mathematically rigorous definitions of epidemiological concepts in order to derive a sequential combinatorial model of disease outbreak decomposition. We define the idea of a population specific ‘disease signature’ and use this in order to decompose and further understand outbreaks as incidents of spatial and temporal spread of disease exposure both in, and across, populations. This allows us to differentiate between different disease spread scenarios with a level of sensitivity that previous models were unable to provide. This perspective leads us to propose a new practical definition for ‘outbreak’. In addition, we are able to use this model to understand, estimate, and, in some cases, correct for, the likely instances of reporting error inherent in disease surveillance.

We demonstrate our model first with a hypothetical outbreak scenario and then in an analysis of suspected outbreaks of waterborne diseases in Massachusetts (MA) in 1995.

Introduction

Mathematical models have long been recognized as useful epidemiological tools. They provide a foundation for quantitative predictions, allow for rigorous testing of hypotheses, and necessitate clear definitions of concepts and parameters. The complicated and diverse array of infectious diseases lead to the generation of generalized models that need to be tailored by the use of carefully generated parameters to provide direct insight into the mechanisms of transmission of a particular pathogen, the rate of infection and likelihood of widespread outbreak given certain circumstances, all of which have direct applications for health care management and disease control. Traditionally, these tailored parameters have been mass action transfer terms governing the respective likelihoods of an individual transitioning from susceptible to infected to recovered over time (SIR models, cf. [1]). In cases involving complex circumstances such as multiple distinct populations or repeated, isolated exposures, these models can become complicated, the determination of appropriate mass action terms for each separate population can be difficult [2] and, in some cases, the use of mass action terms themselves can be inappropriate for the focus of the investigation [3]. Additionally, while these mass action transfer terms are mathematically meaningful, they are clinically difficult to measure, creating a disparity between mathematical elegance and usefulness.

By focusing our models on a narrower set of pathogens, those where the link between exposure and infection is clearly defined (as opposed to diseases where there can be multiple and confounding factors), we are able to use the timing of disease incidence and different etiologies specific to the different affected subpopulations to fully understand the dynamics of disease outbreaks. We here propose a method of sequential combinatorial decomposition to accomplish this narrower focus, allowing us to incorporate an understanding of the different temporal distributions governing the transition from susceptible to infected to recovered associated with each population. This method embodies a compromise between the complexity of individual behavior and the broad-brush assumption of mass action, population averages, and is based on a set of clinically measurable parameters. Our system of choice for this study will be waterborne illness due to the clearly defined direct link between exposure and infection. While we have chosen this system for study here, this method may be applied to any system so long as that link is unambiguously understood. All implications of our model are meant to be representative only of this type of system, though the theory may be generalized to others.

Unlike most SIR models that focus solely or primarily on secondary (human-to-human) transmission, we emphasize primary transmission for waterborne illnesses (i.e. an infection from an environmental contaminant/external point source). This is an important aspect because many waterborne diseases have this sort of external transmission that at least sparks an outbreak. We here present a set of rigorous definitions and operational rules that lead to a natural characterization of disease spread. This provides an outbreak signature decomposition model through heterogeneous populations. Additionally, our model will provide a natural, practical, mathematical definition of an ‘outbreak’. We will present our method by analyzing both data from a simulated scenario and actual data from the suspected cryptosporidiosis and giardiasis outbreaks in Massachusetts, USA during 1995 [4].

Section snippets

Motivation and rationale

In contrast to other diseases, the waterborne illnesses giardiasis and cryptosporidiosis are prototypical emerging diseases. Both are caused by microscopic parasites in the intestine and are passed in the stool of those infected, contaminating soil, food, or water. Cryptospordiosis has both a small inoculum for humans [5], [6] and a large animal reservoir [7]. Both cryptosporidiosis and giardiasis have high rates of exposure once contamination is present in a population, and high rates of

Outbreak signature as a composite of component disease signatures

In the interest of providing a more practical metric, every outbreak can be described as a series of separate events in time and space [4], [16]. Each has a set of characteristics that may distinguish it from others, even of the same pathogenic source: duration of the outbreak, magnitude, the overall shape, etc. Together, these traits create an outbreak signature which mathematical models may be used to reproduce or even predict [17], [18]. By using the specific properties of the disease

Basic model formulation

The parameter and variable notation in traditional SIR models lend themselves naturally to mathematical formulation of epidemiological processes, however, these formulations are not always of equal facility in clinical practice or public health surveillance (either for discussion or for practical measurement). Fundamentally, all studies of disease spread rely on composites of underlying etiological and population-specific parameters. In order to foster greater facility in communication among

Discussion

This method of combinatorial modeling allows for a careful understanding of the spread of disease through heterogeneous populations, incorporating spatial and temporal complexity. Existing models look at an outbreak as a single curve. In reality, few outbreaks, no matter how they are defined, are isolated enough to be the result of one instance of pathogen contamination into a single, uniform population. This implies that most outbreaks are, in fact, composites of multiple curves, each

Acknowledgments

We would like to thank the NIH for supporting this research with grant R01 HD038327-04 (N.H.F.), AI03015 (E.N.N., N.H.F.), Dr. J.K. Griffiths, and Dr. J.M. Reed for their thoughtful advice, J. Jagai for help with data abstraction in preparation of the manuscript, Dr. A. DeMaria for providing us with reported case incidence data, and the support of an NSF travel grant to The International Environmetrics Society 2003 annual conference where the contents of this paper, in part, were originally

References (30)

  • E.N. Naumova et al.

    Use of passive surveillance data to study temporal and spatial variation in the incidence of giardiasis and cryptosporidiosis

    Public Health Rep.

    (2000)
  • H.L. DuPont et al.

    The infectivity of Cryptosporidium parvum in healthy volunteers

    N. Engl. J. Med.

    (1995)
  • T.S. Steiner et al.

    Protozoal agents: what are the dangers for the public water supply?

    Annu. Rev. Med.

    (1997)
  • S.E. Majowicz et al.

    Descriptive analysis of endemic cryptosporidiosis cases reported in Ontario, 1996–1997

    Can. J. Public Health Rev.

    (2001)
  • W.R. MacKenzie et al.

    A massive outbreak in Milwaukee of cryptosporidium infection transmitted through the public water supply

    N. Engl. J. Med.

    (1994)
  • Cited by (9)

    • Waterborne disease surveillance

      2019, Encyclopedia of Environmental Health
    • Deviations in influenza seasonality: Odd coincidence or obscure consequence?

      2012, Clinical Microbiology and Infection
      Citation Excerpt :

      However, as illustrated by the dynamic maps, even within a single influenza season, it is possible to trace multiple origins contributing to the overall seasonal curve. A seasonal pattern observed globally is not necessarily a simple sum of patterns observed locally [43]. Although annual epidemics typically begin abruptly, peak within 2-3 weeks and last from 5 to 10 weeks in the continental USA [20], their local behavior might exhibit unusual clusters that percolate during an influenza season.

    • Waterborne Disease Surveillance

      2011, Encyclopedia of Environmental Health
    • Waterborne Disease Surveillance

      2011, Encyclopedia of Environmental Health, Volume 1-5
    View all citing articles on Scopus
    View full text