Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Multi-country and intersectoral assessment of cluster congruence between different bioinformatics pipelines for genomics surveillance of foodborne bacterial pathogens

View ORCID ProfileVerónica Mixão, View ORCID ProfileMiguel Pinto, View ORCID ProfileHolger Brendebach, View ORCID ProfileDaniel Sobral, João Dourado Santos, Nicolas Radomski, Anne Sophie Majgaard Uldall, Arkadiusz Bomba, Michael Pietsch, Andrea Bucciacchio, Andrea de Ruvo, Pierluigi Castelli, Ewelina Iwan, Sandra Simon, View ORCID ProfileClaudia E. Coipan, View ORCID ProfileJörg Linde, View ORCID ProfileLiljana Petrovska, View ORCID ProfileRolf Sommer Kaas, View ORCID ProfileKatrine Grimstrup Joensen, View ORCID ProfileSofie Holtsmark Nielsen, Kristoffer Kiil, View ORCID ProfileKarin Lagesen, View ORCID ProfileAdriano Di Pasquale, View ORCID ProfileJoão Paulo Gomes, View ORCID ProfileCarlus Deneke, View ORCID ProfileSimon H. Tausch, View ORCID ProfileVítor Borges
doi: https://doi.org/10.1101/2024.07.24.24310933
Verónica Mixão
1Genomics and Bioinformatics Unit, Department of Infectious Diseases, National Institute of Health Doutor Ricardo Jorge (INSA), Lisbon, Portugal
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Verónica Mixão
Miguel Pinto
1Genomics and Bioinformatics Unit, Department of Infectious Diseases, National Institute of Health Doutor Ricardo Jorge (INSA), Lisbon, Portugal
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Miguel Pinto
Holger Brendebach
2National Study Center for Sequencing, Department of Biological Safety, German Federal Institute for Risk Assessment (BfR), Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Holger Brendebach
Daniel Sobral
1Genomics and Bioinformatics Unit, Department of Infectious Diseases, National Institute of Health Doutor Ricardo Jorge (INSA), Lisbon, Portugal
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Daniel Sobral
João Dourado Santos
1Genomics and Bioinformatics Unit, Department of Infectious Diseases, National Institute of Health Doutor Ricardo Jorge (INSA), Lisbon, Portugal
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nicolas Radomski
3National Reference Centre (NRC) for Whole Genome Sequencing of microbial pathogens: database and bioinformatics analysis (GENPAT), Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Anne Sophie Majgaard Uldall
4Department of Bacteria, Parasites & Fungi, Statens Serum Institut (SSI), Copenhagen, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Arkadiusz Bomba
5Department of Omics Analyses, National Veterinary Research Institute (PIWet), Puławy, Poland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael Pietsch
6Unit of Enteropathogenic Bacteria and Legionella, Robert Koch Institute (RKI), Wernigerode, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Andrea Bucciacchio
3National Reference Centre (NRC) for Whole Genome Sequencing of microbial pathogens: database and bioinformatics analysis (GENPAT), Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Andrea de Ruvo
3National Reference Centre (NRC) for Whole Genome Sequencing of microbial pathogens: database and bioinformatics analysis (GENPAT), Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Pierluigi Castelli
3National Reference Centre (NRC) for Whole Genome Sequencing of microbial pathogens: database and bioinformatics analysis (GENPAT), Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ewelina Iwan
5Department of Omics Analyses, National Veterinary Research Institute (PIWet), Puławy, Poland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sandra Simon
6Unit of Enteropathogenic Bacteria and Legionella, Robert Koch Institute (RKI), Wernigerode, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Claudia E. Coipan
7Department for Infectious Diseases, Epidemiology and Surveillance, National Institute for Public Health and the Environment (RIVM), Bilthoven, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Claudia E. Coipan
Jörg Linde
8Institute of Bacterial Infections and Zoonoses, Friedrich-Loeffler-Institute (FLI), Jena, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jörg Linde
Liljana Petrovska
9Animal and Plant Health Agency (APHA), Addlestone, Surrey, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Liljana Petrovska
Rolf Sommer Kaas
10National Food Institute, Technical University of Denmark (DTU), Lyngby, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Rolf Sommer Kaas
Katrine Grimstrup Joensen
4Department of Bacteria, Parasites & Fungi, Statens Serum Institut (SSI), Copenhagen, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Katrine Grimstrup Joensen
Sofie Holtsmark Nielsen
4Department of Bacteria, Parasites & Fungi, Statens Serum Institut (SSI), Copenhagen, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sofie Holtsmark Nielsen
Kristoffer Kiil
4Department of Bacteria, Parasites & Fungi, Statens Serum Institut (SSI), Copenhagen, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Karin Lagesen
11Section for Epidemiology, Norwegian Veterinary Institute (NVI), Norway
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Karin Lagesen
Adriano Di Pasquale
3National Reference Centre (NRC) for Whole Genome Sequencing of microbial pathogens: database and bioinformatics analysis (GENPAT), Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Adriano Di Pasquale
João Paulo Gomes
1Genomics and Bioinformatics Unit, Department of Infectious Diseases, National Institute of Health Doutor Ricardo Jorge (INSA), Lisbon, Portugal
12Veterinary and Animal Research Center (CECAV), Faculty of Veterinary Medicine, Lusófona University, Lisbon, Portugal
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for João Paulo Gomes
Carlus Deneke
2National Study Center for Sequencing, Department of Biological Safety, German Federal Institute for Risk Assessment (BfR), Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Carlus Deneke
Simon H. Tausch
2National Study Center for Sequencing, Department of Biological Safety, German Federal Institute for Risk Assessment (BfR), Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Simon H. Tausch
Vítor Borges
1Genomics and Bioinformatics Unit, Department of Infectious Diseases, National Institute of Health Doutor Ricardo Jorge (INSA), Lisbon, Portugal
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Vítor Borges
  • For correspondence: vitor.borges{at}insa.min-saude.pt
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Food and waterborne disease (FWD) surveillance requires Whole-Genome Sequencing (WGS)-based systems following a One Health approach. However, different laboratories employ different WGS pipelines in their routine surveillance activities, casting doubt on the comparability of their results and hindering optimal communication at intersectoral and international levels. Through a collaborative effort involving eleven European institutes across seven countries and spanning the food, animal and human health sectors, we aimed to assess the inter-laboratory comparability of WGS clustering results for four important foodborne pathogens: Listeria monocytogenes, Salmonella enterica, Escherichia coli and Campylobacter jejuni. Each participating institute (n=9) applied its surveillance pipeline over the same WGS datasets (>2000 isolates per species), and, for each pipeline, genetic clusters were identified at each possible allele/SNP distance threshold. Inter-pipeline clustering congruence was assessed by calculating a “Congruence Score” (relying on Adjusted Wallace and Adjusted Rand coefficients) across all resolution levels, followed by an in-depth comparative analysis of cluster composition at outbreak level. An additional cluster congruence assessment was performed between WGS and traditional typing, which, depending on the species, included Sequence Type (ST), Clonal Complex (CC) and/or serotype. Our results revealed a general high concordance between allele-based pipelines at all resolution levels for all species, except for C. jejuni, where the different resolution power of available allele-based schemas led to marked discrepancies. Still, this study identified non-negligible differences in allele-based pipeline performance for outbreak cluster detection, suggesting that a threshold flexibilization is important for the detection of similar outbreak signals by different laboratories. These results, together with the observation that different STs, CCs and serotypes exhibit remarkably different genetic diversity, should inform future threshold selections for outbreak case definitions. In conclusion, this study provides valuable insights into the comparability of pipelines commonly used for routine genomics surveillance, and reinforces the need, while demonstrating the feasibility, of conducting continuous and comprehensive WGS pipeline comparability assessments. Ultimately, it opens good perspectives for a smoother international and intersectoral cooperation and communication towards a sustainable and efficient One Health FWD surveillance.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This work was supported by co-funding from the European Union's Horizon 2020 Research and Innovation program under grant agreement No 773830: One Health European Joint Programme (2020 to 2022) (https://onehealthejp.eu/projects/foodborne-zoonoses/jrp-beone) and by the ISIDORe project (funding from the European Union's Horizon Europe Research & Innovation Programme, Grant Agreement no. 101046133). VM contribution was funded by national funds through FCT - Foundation for Science and Technology, I.P., in the frame of Individual CEEC 2022.00851.CEECIND/CP1748/CT0001 (2023 onwards). JDS contribution was supported by the project "Sustainable use and integration of enhanced infrastructure into routine genome-based surveillance and outbreak investigation activities in Portugal" (GENEO, https://www.insa.min-saude.pt/category/projectos/geneo/) on behalf of the EU4H programme (EU4H-2022-DGA-MS-IBA-1). Research at the National Veterinary Research Institute (PIWet) Poland was supported by the Polish Ministry of Education and Science from the funds for science in the years 2018-2022 allocated for the implementation of a co-financed international project.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Data Availability

Anonymized sequencing reads of the BeONE dataset are deposited in the European Nucleotide Archive (ENA) database under the BioProjects PRJEB57166, PRJEB57179, PRJEB57098 and PRJEB57119. Genome assemblies are deposited in the Zenodo repository (L. monocytogenes: 10.5281/ZENODO.7267486; S. enterica: 10.5281/ZENODO.7267785; E. coli: 10.5281/ZENODO.7267844; C. jejuni: 10.5281/ZENODO.7267879). The public dataset data was retrieved from Zenodo (L. monocytogenes: 10.5281/ZENODO.7116878; S. enterica: 10.5281/ZENODO.7119735; E. coli: 10.5281/ZENODO.7120057; C. jejuni: 10.5281/ZENODO.7120166). The collection of scripts used to conduct these analyses are available at the github repository https://github.com/insapathogenomics/WGS_cluster_congruence. Supplementary data are available in the Zenodo repository (https://doi.org/10.5281/zenodo.12805750).

https://doi.org/10.5281/zenodo.12805750

https://doi.org/10.5281/ZENODO.7267486

https://doi.org/10.5281/ZENODO.7116878

https://doi.org/10.5281/ZENODO.7267785

https://doi.org/10.5281/ZENODO.7119735

https://doi.org/10.5281/ZENODO.7267844

https://doi.org/10.5281/ZENODO.7120057

https://doi.org/10.5281/ZENODO.7267879

https://doi.org/10.5281/ZENODO.7120166

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-ND 4.0 International license.
Back to top
PreviousNext
Posted July 25, 2024.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Multi-country and intersectoral assessment of cluster congruence between different bioinformatics pipelines for genomics surveillance of foodborne bacterial pathogens
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Multi-country and intersectoral assessment of cluster congruence between different bioinformatics pipelines for genomics surveillance of foodborne bacterial pathogens
Verónica Mixão, Miguel Pinto, Holger Brendebach, Daniel Sobral, João Dourado Santos, Nicolas Radomski, Anne Sophie Majgaard Uldall, Arkadiusz Bomba, Michael Pietsch, Andrea Bucciacchio, Andrea de Ruvo, Pierluigi Castelli, Ewelina Iwan, Sandra Simon, Claudia E. Coipan, Jörg Linde, Liljana Petrovska, Rolf Sommer Kaas, Katrine Grimstrup Joensen, Sofie Holtsmark Nielsen, Kristoffer Kiil, Karin Lagesen, Adriano Di Pasquale, João Paulo Gomes, Carlus Deneke, Simon H. Tausch, Vítor Borges
medRxiv 2024.07.24.24310933; doi: https://doi.org/10.1101/2024.07.24.24310933
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Multi-country and intersectoral assessment of cluster congruence between different bioinformatics pipelines for genomics surveillance of foodborne bacterial pathogens
Verónica Mixão, Miguel Pinto, Holger Brendebach, Daniel Sobral, João Dourado Santos, Nicolas Radomski, Anne Sophie Majgaard Uldall, Arkadiusz Bomba, Michael Pietsch, Andrea Bucciacchio, Andrea de Ruvo, Pierluigi Castelli, Ewelina Iwan, Sandra Simon, Claudia E. Coipan, Jörg Linde, Liljana Petrovska, Rolf Sommer Kaas, Katrine Grimstrup Joensen, Sofie Holtsmark Nielsen, Kristoffer Kiil, Karin Lagesen, Adriano Di Pasquale, João Paulo Gomes, Carlus Deneke, Simon H. Tausch, Vítor Borges
medRxiv 2024.07.24.24310933; doi: https://doi.org/10.1101/2024.07.24.24310933

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Public and Global Health
Subject Areas
All Articles
  • Addiction Medicine (576)
  • Allergy and Immunology (867)
  • Anesthesia (306)
  • Cardiovascular Medicine (4480)
  • Dentistry and Oral Medicine (449)
  • Dermatology (385)
  • Emergency Medicine (614)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1528)
  • Epidemiology (15276)
  • Forensic Medicine (31)
  • Gastroenterology (1133)
  • Genetic and Genomic Medicine (6643)
  • Geriatric Medicine (671)
  • Health Economics (1006)
  • Health Informatics (4602)
  • Health Policy (1378)
  • Health Systems and Quality Improvement (1622)
  • Hematology (544)
  • HIV/AIDS (1275)
  • Infectious Diseases (except HIV/AIDS) (15959)
  • Intensive Care and Critical Care Medicine (1110)
  • Medical Education (626)
  • Medical Ethics (147)
  • Nephrology (674)
  • Neurology (6692)
  • Nursing (346)
  • Nutrition (1006)
  • Obstetrics and Gynecology (1152)
  • Occupational and Environmental Health (961)
  • Oncology (3369)
  • Ophthalmology (988)
  • Orthopedics (370)
  • Otolaryngology (421)
  • Pain Medicine (437)
  • Palliative Medicine (131)
  • Pathology (668)
  • Pediatrics (1703)
  • Pharmacology and Therapeutics (699)
  • Primary Care Research (717)
  • Psychiatry and Clinical Psychology (5494)
  • Public and Global Health (9284)
  • Radiology and Imaging (2223)
  • Rehabilitation Medicine and Physical Therapy (1375)
  • Respiratory Medicine (1201)
  • Rheumatology (598)
  • Sexual and Reproductive Health (720)
  • Sports Medicine (535)
  • Surgery (720)
  • Toxicology (100)
  • Transplantation (290)
  • Urology (266)