TY - JOUR T1 - Detection of Colorectal Adenocarcinoma and Grading Dysplasia on Histopathologic Slides Using Deep Learning JF - medRxiv DO - 10.1101/2022.09.19.22280112 SP - 2022.09.19.22280112 AU - June Kim AU - Naofumi Tomita AU - Arief A. Suriawinata AU - Saeed Hassanpour Y1 - 2022/01/01 UR - http://medrxiv.org/content/early/2022/09/22/2022.09.19.22280112.abstract N2 - Colorectal cancer is one of the most common types of cancer among men and women. The grading of dysplasia and the detection of adenocarcinoma are important clinical tasks in the diagnosis of colorectal cancer and shape the patients’ follow-up plans. This study evaluates the feasibility of deep learning models for the classification of colorectal lesions into four classes: benign, low-grade dysplasia, high-grade dysplasia, and adenocarcinoma. To this end, we develop a deep neural network on a training set of 655 whole-slide images of digitized colorectal resection slides from a tertiary medical institution and evaluate it on an internal test set of 234 slides, as well as on an external test set of 606 adenocarcinoma slides from The Cancer Genome Atlas database. Our model achieves an overall accuracy, sensitivity, and specificity of 95.5%, 91.0%, and 97.1% on the internal test set and an accuracy and sensitivity of 98.5% for adenocarcinoma detection task on the external test set. Our results suggest that such deep learning models can potentially assist pathologists in grading colorectal dysplasia, detecting adenocarcinoma, prescreening, and prioritizing the reviewing of suspicious cases to improve the turnaround time for patients with a high risk of colorectal cancer. Furthermore, the high sensitivity on the external test set suggests our model’s generalizability in detecting colorectal adenocarcinoma on whole slide images across different institutions.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis research was supported in part by grants from the US National Library of Medicine (R01LM012837 and R01LM013833) and the US National Cancer Institute (R01CA249758).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This study and the use of human participant data in this project were approved by the Dartmouth-Hitchcock Health Institutional Review Board (IRB) with a waiver of informed consent. The conducted research reported in this study is in accordance with this approved Dartmouth-Hitchcock Health IRB protocol and the World Medical Association Declaration of Helsinki on Ethical Principles for Medical Research involving Human Subjects.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesTCGA data can be downloaded from the websites: https://portal.gdc.cancer.gov/projects/TCGA-COAD and https://portal.gdc.cancer.gov/projects/TCGA-READ. The DHMC dataset used in this study is not publicly available due to patient privacy constraints. An anonymized version of this dataset can be generated and shared upon reasonable request from the corresponding author. ER -