PT - JOURNAL ARTICLE AU - Aswolinskiy, Witali AU - Wong, John K.L. AU - Zapukhlyak, Myroslav AU - Kindruk, Yulia AU - Paulikat, Martin AU - Aichmüller, Christian TI - Fast Organ-of-Origin Classification for Digital Pathology Quality Control AID - 10.64898/2026.02.03.26345443 DP - 2026 Jan 01 TA - medRxiv PG - 2026.02.03.26345443 4099 - http://medrxiv.org/content/early/2026/02/04/2026.02.03.26345443.short 4100 - http://medrxiv.org/content/early/2026/02/04/2026.02.03.26345443.full AB - Digitizing large histopathology archives requires processing millions of scanned whole slide images that must be validated rapidly. Automated organ-of-origin classification can accelerate quality control and enable early detection of mislabeled specimens. We developed a deep learning model that classifies the organ of origin from H&E-stained slides using a single low-resolution thumbnail per slide in under one second. For training, we used thumbnails from 16,624 slides from the TCGA and CPTAC archives, which contain mostly primary tumor resections. The images were categorized into 14 classes based on the most common primary sites in TCGA: Bladder, Brain, Breast, Colorectal, Kidney, Liver, Lung, Pancreas, Prostate, Skin, Stomach, Thyroid gland, Uterus, and Other (encompassing the remaining tissue types). We evaluated our approach on two independent external cohorts: a 5-class cohort with 2,857 slides (Colorectal, Kidney, Liver, Pancreas, Prostate) and a comprehensive 14-class cohort (12,348 slides). The model achieved 90% balanced accuracy for the 5-class cohort and 62% for the full 14-class cohort. Notably, when considering only the predictions with high confidence, 53% of the large cohort could be classified with 74% balanced accuracy. Manual review of high-confidence misclassifications suggested that some may reflect errors in the ground truth rather than model error. Mean model inference time was 0.2s per slide on an NVIDIA L4 GPU. Our deep learning approach demonstrates high classification performance with very low inference time, indicating its potential for real-time and cost-effective quality control in digital pathology.Competing Interest StatementAll authors except M.P. are affiliated with PAICON GmbH. M.P. declares no competing interests.Funding StatementNo external funding was received. The work was conducted as part of regular employment at PAICON GmbH.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This study is a retrospective analysis of de-identified whole-slide images and associated metadata from publicly available datasets. No new patient data were collected and no patient contact occurred. Ethics approval and informed consent were not required for this study. The TCGA slides are available at https://portal.gdc.cancer.gov. The CPTAC slides are available at https://www.cancerimagingarchive.net. The PAIP slides are available at https://www.wisepaip.org. The VML slides are available at https://wirtualnymikroskop.mostwiedzy.pl.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesThe TCGA slides are available at https://portal.gdc.cancer.gov. The CPTAC slides are available at https://www.cancerimagingarchive.net. The PAIP slides are available at https://www.wisepaip. org. The VML slides are available at https://wirtualnymikroskop.mostwiedzy.pl.