Abstract
The wealth of information captured by multiparameter flow cytometry (MFC) can be analyzed by recent methods of computer vision when represented as a single image file. We therefore transformed MFC raw data into a multicolor 2D image by a self-organizing map (SOM) and classified this representation using a convolutional neural network (CNN). By this means, we built an artificial intelligence that is not only able to distinguish diseased from healthy samples, but that can also differentiate seven subtypes of mature B-cell neoplasm (B-NHL). We trained our model with 18,274 cases including chronic lymphocytic leukemia (CLL) and its precursor monoclonal B-cell lymphocytosis (MBL), marginal zone lymphoma (MZL), mantle cell lymphoma (MCL), prolymphocytic leukemia (PL), follicular lymphoma (FL), hairy cell leukemia (HCL), lymphoplasmacytic lymphoma (LPL) and achieved a weighted F1 score of 0.94 on a separate test set of 2,348 cases. Furthermore, we estimated the trustworthiness of a classification and could classify 70% of all cases with a confidence of 0.95 and higher. Our performance analyses indicate that particularly for rare subtypes further improvement can be expected when even more samples are available for training.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
M.Z. (primary author) received a PhD grant from Berlin Institute of Health. No additional funding from third parties were received.
Author Declarations
All relevant ethical guidelines have been followed; any necessary IRB and/or ethics committee approvals have been obtained and details of the IRB/oversight body are included in the manuscript.
Yes
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
All data are available upon reasonable request. Please send request to P.M.K pkrawitz@uni-bonn.de