TY - JOUR T1 - Digital re-classification of equivocal dysplastic urothelial lesions using morphologic and immunohistologic analysis JF - medRxiv DO - 10.1101/2020.10.04.20206524 SP - 2020.10.04.20206524 AU - Camelia D Vrabie AU - Marius Gangal Y1 - 2020/01/01 UR - http://medrxiv.org/content/early/2020/10/06/2020.10.04.20206524.abstract N2 - A precise diagnostic of precursor dysplastic urothelial lesions is critical for patients but it can be a challenge for pathologists. Multiple immunohistologic markers (panel) improve ambiguous diagnostics but results are subjective, with a high degree of observational variability. Our research objective was to evaluate how a classification algorithm may help morphology diagnostic. Data coming from 45 unequivocal cases of flat urothelial lesions (“training set”: 20 carcinomas in situ, 8 dysplastic and 17 reactive lesions) were used as ground truth in training a random tree classification algorithm. 50 “atypia of unknown significance” diagnostics (diagnostic set) were digitally re-classified based on morphological and immunohistochemical features as possible carcinoma in situ (20), dysplastic (17) and reactive atypia cases (13). The main sorting criterium was morphologic (nuclear area). A four-markers panel was used for a precise classification (74% correctly classified, 93% accuracy, 76% precision, averaged ROC=0.828). 3 cases were “false negative”. The performance of the immunohistologic panel was evaluated based on a stain index, calculated for CD20, p53, Ki67 and observed for CD44. Within training set, the immunohistologic performance was high. In the diagnostic set both the percentage of high stain index for each marker and the percentage of cases with 2-3 strong markers were low, explaining the initial high number of equivocal cases. In conclusion, digital analysis of morphologic and immunohistologic features may bring clarification in classification of equivocal urothelial lesions. Computational pathology supports diagnostic process as it can measure features and handle data in a precise, reproducible and objective way. In our proof of concept study, a low number of cases and the (deliberate) absence of clinical data were main limitations. Validation of the method on a high number of cases, use of genomics and clinical data are essential for improving the reliability of machine learning classificationCompeting Interest StatementThe authors have declared no competing interest.Funding StatementNoneAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Comisia locală de etică a Spitalului de Urgență Sfantul Ioan Bucuresti, number 1123, Jan 16, 2020All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAll information is provided in the article and in the Annex ER -