Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

De novo identification and visualization of important cell populations for classic Hodgkin lymphoma using flow cytometry and machine learning

View ORCID ProfilePaul D. Simonson, Yue Wu, David Wu, Jonathan R. Fromm, Aaron Y. Lee
doi: https://doi.org/10.1101/2020.12.18.20248526
Paul D. Simonson
1Department of Pathology and Laboratory Medicine, Weill Cornell Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Paul D. Simonson
  • For correspondence: pds9003@med.cornell.edu
Yue Wu
2Department of Ophthalmology, University of Washington
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David Wu
3Department of Laboratory Medicine and Pathology, University of Washington
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jonathan R. Fromm
3Department of Laboratory Medicine and Pathology, University of Washington
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Aaron Y. Lee
2Department of Ophthalmology, University of Washington
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Objectives Automated classification of flow cytometry data has the potential to reduce errors and accelerate flow cytometry interpretation. We desired a machine learning approach that is accurate, intuitively easy to understand, and highlights the cells that are most important in the algorithm’s prediction for a given case.

Methods We developed an ensemble of convolutional neural networks (CNNs) for classification and visualization of impactful cell populations in detecting classic Hodgkin lymphoma, using two-dimensional (2D) histograms. Data from 977 and 245 clinical flow cytometry cases were used for training and testing, respectively. 78 non-gated 2D histograms were created per flow cytometry file. SHAP values were calculated to determine the most impactful 2D histograms and regions within the histograms. The SHAP values from all 78 histograms were then projected back to the original cells data for gating and visualization using standard flow cytometry software.

Results The algorithm achieved 67.7% recall (sensitivity), 82.4 % precision, and 0.92 AUROC. Visualization of the important cell populations in making individual predictions demonstrated correlations with known biology.

Conclusions The method presented enables model explainability while highlighting important cell populations in individual flow cytometry specimens, with potential applications in both diagnosis and discovery of previously overlooked key cell populations.

Competing Interest Statement

The authors have declared no competing interest.

Clinical Protocols

http://github.com/SimonsonLab/EnsembleCNN

Funding Statement

This work was supported by the National Eye Institute [K23EY029246] (AYL), an unrestricted grant from Research to Prevent Blindness (AYL), the University of Washington Department of Laboratory Medicine, a Roger Moe Fellowship (PDS), and the donation of a GPU by NVIDIA Corporation (PDS).

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

In total, 1222 samples were analyzed using a nine-color classic Hodgkin lymphoma-specific flow cytometry panel and used in accordance with the University of Washington's institutional review board (IRB) approval. Given the retrospective use of the data is for clinical laboratory test quality and operations improvement and there is minimal risk for patient harm, patient written consent was deemed unnecessary and was therefore waived by the IRB of the University. All methods were carried out in accordance with relevant guidelines, regulations, and approval by the IRB of the University.

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

Algorithm code and updates will be made available at http://github.com/SimonsonLab/EnsembleCNN.

http://github.com/SimonsonLab/EnsembleCNN

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted December 22, 2020.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
De novo identification and visualization of important cell populations for classic Hodgkin lymphoma using flow cytometry and machine learning
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
De novo identification and visualization of important cell populations for classic Hodgkin lymphoma using flow cytometry and machine learning
Paul D. Simonson, Yue Wu, David Wu, Jonathan R. Fromm, Aaron Y. Lee
medRxiv 2020.12.18.20248526; doi: https://doi.org/10.1101/2020.12.18.20248526
Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation Tools
De novo identification and visualization of important cell populations for classic Hodgkin lymphoma using flow cytometry and machine learning
Paul D. Simonson, Yue Wu, David Wu, Jonathan R. Fromm, Aaron Y. Lee
medRxiv 2020.12.18.20248526; doi: https://doi.org/10.1101/2020.12.18.20248526

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (70)
  • Allergy and Immunology (168)
  • Anesthesia (50)
  • Cardiovascular Medicine (451)
  • Dentistry and Oral Medicine (83)
  • Dermatology (55)
  • Emergency Medicine (157)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (191)
  • Epidemiology (5258)
  • Forensic Medicine (3)
  • Gastroenterology (195)
  • Genetic and Genomic Medicine (757)
  • Geriatric Medicine (80)
  • Health Economics (213)
  • Health Informatics (698)
  • Health Policy (358)
  • Health Systems and Quality Improvement (223)
  • Hematology (99)
  • HIV/AIDS (163)
  • Infectious Diseases (except HIV/AIDS) (5867)
  • Intensive Care and Critical Care Medicine (361)
  • Medical Education (104)
  • Medical Ethics (25)
  • Nephrology (83)
  • Neurology (764)
  • Nursing (43)
  • Nutrition (130)
  • Obstetrics and Gynecology (142)
  • Occupational and Environmental Health (231)
  • Oncology (479)
  • Ophthalmology (152)
  • Orthopedics (38)
  • Otolaryngology (95)
  • Pain Medicine (39)
  • Palliative Medicine (20)
  • Pathology (141)
  • Pediatrics (223)
  • Pharmacology and Therapeutics (136)
  • Primary Care Research (96)
  • Psychiatry and Clinical Psychology (862)
  • Public and Global Health (2011)
  • Radiology and Imaging (348)
  • Rehabilitation Medicine and Physical Therapy (158)
  • Respiratory Medicine (285)
  • Rheumatology (94)
  • Sexual and Reproductive Health (74)
  • Sports Medicine (76)
  • Surgery (109)
  • Toxicology (25)
  • Transplantation (29)
  • Urology (39)