TY - JOUR T1 - <em>De novo</em> identification and visualization of important cell populations for classic Hodgkin lymphoma using flow cytometry and machine learning JF - medRxiv DO - 10.1101/2020.12.18.20248526 SP - 2020.12.18.20248526 AU - Paul D. Simonson AU - Yue Wu AU - David Wu AU - Jonathan R. Fromm AU - Aaron Y. Lee Y1 - 2020/01/01 UR - http://medrxiv.org/content/early/2020/12/22/2020.12.18.20248526.abstract N2 - Objectives Automated classification of flow cytometry data has the potential to reduce errors and accelerate flow cytometry interpretation. We desired a machine learning approach that is accurate, intuitively easy to understand, and highlights the cells that are most important in the algorithm’s prediction for a given case.Methods We developed an ensemble of convolutional neural networks (CNNs) for classification and visualization of impactful cell populations in detecting classic Hodgkin lymphoma, using two-dimensional (2D) histograms. Data from 977 and 245 clinical flow cytometry cases were used for training and testing, respectively. 78 non-gated 2D histograms were created per flow cytometry file. SHAP values were calculated to determine the most impactful 2D histograms and regions within the histograms. The SHAP values from all 78 histograms were then projected back to the original cells data for gating and visualization using standard flow cytometry software.Results The algorithm achieved 67.7% recall (sensitivity), 82.4 % precision, and 0.92 AUROC. Visualization of the important cell populations in making individual predictions demonstrated correlations with known biology.Conclusions The method presented enables model explainability while highlighting important cell populations in individual flow cytometry specimens, with potential applications in both diagnosis and discovery of previously overlooked key cell populations.Competing Interest StatementThe authors have declared no competing interest.Clinical Protocols http://github.com/SimonsonLab/EnsembleCNN Funding StatementThis work was supported by the National Eye Institute [K23EY029246] (AYL), an unrestricted grant from Research to Prevent Blindness (AYL), the University of Washington Department of Laboratory Medicine, a Roger Moe Fellowship (PDS), and the donation of a GPU by NVIDIA Corporation (PDS).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:In total, 1222 samples were analyzed using a nine-color classic Hodgkin lymphoma-specific flow cytometry panel and used in accordance with the University of Washington's institutional review board (IRB) approval. Given the retrospective use of the data is for clinical laboratory test quality and operations improvement and there is minimal risk for patient harm, patient written consent was deemed unnecessary and was therefore waived by the IRB of the University. All methods were carried out in accordance with relevant guidelines, regulations, and approval by the IRB of the University.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAlgorithm code and updates will be made available at http://github.com/SimonsonLab/EnsembleCNN. http://github.com/SimonsonLab/EnsembleCNN ER -