Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Spatial aggregation choice in the era of digital and administrative surveillance data

View ORCID ProfileElizabeth C. Lee, Ali Arab, Vittoria Colizza, Shweta Bansal
doi: https://doi.org/10.1101/2021.04.22.21255643
Elizabeth C. Lee
1Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Elizabeth C. Lee
  • For correspondence: elizabeth.c.lee@jhu.edu
Ali Arab
2Department of Mathematics and Statistics, Georgetown University, Washington, District of Columbia, United States of America
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Vittoria Colizza
3INSERM, Sorbonne Université, Institut Pierre Louis d’Epidémiologie et de Santé Publique, Paris, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shweta Bansal
4Department of Biology, Georgetown University, Washington, District of Columbia, United States of America
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Background Traditional disease surveillance is increasingly being complemented by data from non-traditional sources like medical claims, electronic health records, and participatory syndromic data platforms. As non-traditional data are often collected at the individual-level and are convenience samples from a population, choices must be made on the aggregation of these data for epidemiological inference. Our study seeks to understand the influence of spatial aggregation choice on our understanding of disease spread with a case study of influenza-like illness in the United States.

Methods Using U.S. medical claims data from 2002 to 2009, we examined the epidemic source location, onset and peak season timing, and epidemic duration of influenza seasons for data aggregated to the county and state scales. We also compared spatial autocorrelation and tested the relative magnitude of spatial aggregation differences between onset and peak measures of disease burden.

Results We found discrepancies in the inferred epidemic source locations and estimated influenza season onsets and peaks when comparing county and state-level data. Spatial autocorrelation was detected across more expansive geographic ranges during the peak season as compared to the early flu season, and there were greater spatial aggregation differences in early season measures as well.

Conclusions Epidemiological inferences are more sensitive to spatial scale early on during U.S. influenza seasons, when there is greater heterogeneity in timing, intensity, and geographic spread of the epidemics. Users of non-traditional disease surveillance should carefully consider how to extract accurate disease signals from finer-scaled data for early use in disease outbreaks.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

ECL received a dissertation support grant from the Jayne Koskinas Ted Giovanis Foundation for Health and Policy. This work was also supported by the RAPIDD Program of the Science & Technology Directorate, Department of Homeland Security and the Fogarty International Center, National Institutes of Health.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

All analyses were performed with aggregated time series data for influenza-like illness rather than patient-level information. This study was evaluated by the Institutional Review Board of Georgetown University and deemed exempt.

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • ali.arab{at}georgetown.edu, vittoria.colizza{at}inserm.fr, shweta.bansal{at}georgetown.edu)

Data Availability

The medical claims database is not publicly available; they were obtained from IMS Health, now IQVIA, which may be contacted at https://www.iqvia.com/. All model code is available on GitHub at https://github.com/eclee25/flu-SDI-scales.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted April 22, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Spatial aggregation choice in the era of digital and administrative surveillance data
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Spatial aggregation choice in the era of digital and administrative surveillance data
Elizabeth C. Lee, Ali Arab, Vittoria Colizza, Shweta Bansal
medRxiv 2021.04.22.21255643; doi: https://doi.org/10.1101/2021.04.22.21255643
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Spatial aggregation choice in the era of digital and administrative surveillance data
Elizabeth C. Lee, Ali Arab, Vittoria Colizza, Shweta Bansal
medRxiv 2021.04.22.21255643; doi: https://doi.org/10.1101/2021.04.22.21255643

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Epidemiology
Subject Areas
All Articles
  • Addiction Medicine (179)
  • Allergy and Immunology (434)
  • Anesthesia (99)
  • Cardiovascular Medicine (948)
  • Dentistry and Oral Medicine (178)
  • Dermatology (110)
  • Emergency Medicine (260)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (422)
  • Epidemiology (8989)
  • Forensic Medicine (4)
  • Gastroenterology (420)
  • Genetic and Genomic Medicine (1959)
  • Geriatric Medicine (190)
  • Health Economics (402)
  • Health Informatics (1329)
  • Health Policy (660)
  • Health Systems and Quality Improvement (519)
  • Hematology (212)
  • HIV/AIDS (420)
  • Infectious Diseases (except HIV/AIDS) (10809)
  • Intensive Care and Critical Care Medicine (575)
  • Medical Education (200)
  • Medical Ethics (54)
  • Nephrology (222)
  • Neurology (1830)
  • Nursing (110)
  • Nutrition (274)
  • Obstetrics and Gynecology (353)
  • Occupational and Environmental Health (470)
  • Oncology (999)
  • Ophthalmology (298)
  • Orthopedics (111)
  • Otolaryngology (182)
  • Pain Medicine (126)
  • Palliative Medicine (44)
  • Pathology (265)
  • Pediatrics (580)
  • Pharmacology and Therapeutics (276)
  • Primary Care Research (234)
  • Psychiatry and Clinical Psychology (1904)
  • Public and Global Health (4123)
  • Radiology and Imaging (676)
  • Rehabilitation Medicine and Physical Therapy (368)
  • Respiratory Medicine (549)
  • Rheumatology (225)
  • Sexual and Reproductive Health (191)
  • Sports Medicine (177)
  • Surgery (207)
  • Toxicology (39)
  • Transplantation (109)
  • Urology (81)