Weakly supervised and transfer learning for adenocarcinoma classification in transurethral resection of the prostate whole slide images

Masayuki Tsuneki; Makoto Abe; Fahdi Kanavati

doi:10.1101/2022.04.20.22274062

Abstract

The transurethral resection of the prostate (TUR-P) is generally considered an option for benign prostatic diseases especially nodular hyperplasia patients who have moderate to severe urinary problems that have not responded to medication. Importantly, incidental prostate cancer are diagnosed at the time of TUR-P for benign prostatic disease. Since diagnosing a large number of cases containing TUR-P specimens which are characterized by a very large volume of tissue fragments by pathologists using a conventional microscope is time-consuming and limited in terms of human resources. Thus, it is necessary to develop new techniques which can rapidly and accurately screen large numbers of TUR-P specimens. Computational pathology applications which can assist pathologists in detecting prostate adenocarcinoma from TUR-P whole slide images (WSIs) would be of great benefit for routine histopathological workflow. In this study, we trained deep learning models to classify TUR-P WSIs into prostate adenocarcinoma and benign (non-neoplastic) lesions using transfer and weakly supervised learning. We evaluated the models on TUR-P, needle biopsy, and The Cancer Genome Atlas (TCGA) public dataset test sets, achieving an ROC-AUC up to 0.984 in TUR-P test sets for adenocarcinoma. The results demonstrate the high promising potential of deployment in a practical TUR-P histopathological diagnostic workflow system.

1 Introduction

According to the Global Cancer Statistics 2020, prostate cancer is the most frequently diagnosed cancer in men in over one-half (112 of 185) of the countries in the world and the second-most-frequent cancer and the fifth leading cause of cancer death among men in 2020 with an estimated 1,414,259 new cases and 375,304 deaths worldwide Sung et al (2021). Histopathological confirmation is necessary because early carcinomas are not able to be distinguished with assurance from foci of nodular hyperplasia and inflammatory lesions.

Nodular hyperplasia (benign prostatic hyperplasia) is a common benign disorder of the prostate which represents a nodular enlargement of the gland caused by hyperplasia of both glandular and stromal components, resulting in an increase in the weight of the prostate. The conventional treatment for nodular hyperplasia is surgical and the transurethral resection of the prostate (TUR-P) is one of the widely practiced surgical procedures, which is estimated to perform about 7,864 cases annually in 2014 in Japan Takamori et al (2017). TUR-P can be both diagnostic and therapeutic when patients have obstructive symptoms, high PSA, and negative prostate needle biopsies Dellavedova et al (2010). In TUR-P, an electrical loop of resectoscope excises hyperplastic prostate tissues to improve urine flow, resulting to produce many tiny tissue fragments during the procedure. Therefore, there are many wide range of tissue fragments in a single slide glass to be observed by pathologists. As compared to conventional biopsy specimens (e.g., endoscopic biopsy from gastrointestinal tracts), TUR-P specimens are characterized by a very large volume of tissues and a large number of glass slides; therefore, histopathological diagnosis for TUR-P specimen is one of the most tedious and error-prone tasks because it is difficult to determine the orientation of the specimen as well as there are strong tissue artifacts. Importantly, cancers especially prostate adenocarcinoma are detected incidentally around 5-17% of TUR-P specimens Jones et al (2009); Zigeuner et al (2003); Yoo et al (2012); Sakamoto et al (2014); Trpkov et al (2008); Dellavedova et al (2010); Otto et al (2014). Conventional active treatment (surgery or radiotherapy) is indicated in T1a patients with life expectancy longer than 10 years, and in the majority of T1b patients Dellavedova et al (2010). Precised histopathological evaluation of cancer (adenocarcinoma) in TUR-P specimen is very important because the presence of cancer in more than 5% of the tissue fragments Epstein et al (1986); Van Andel et al (1995) or high-grade cancer Andrén et al (2006); Egevad et al (2002) may affect the choice of treatment. Thus, for TUR-P specimens, reporting both the number of microscopic foci of carcinoma and the percentage of carcinomatous involvement is recommended. All these factors and burdens mentioned above highlight the benefit of establishing a histopathological screening system to detect prostate adenocarcinoma based on TUR-P specimens. Conventional glass slides of TUR-P specimens can be digitized as whole slide images (WSIs) which could great benefit from the application of computational histopathology algorithms especially deep learning models to aid pathologists allowing the potential of reducing the burden of time-consuming diagnosis and increasing the appropriate detection rate of prostate adenocarcinoma in TUR-P WSIs as part of a screening system.

In computational pathology, deep learning models have been widely applied in histopathological cancer classification on WSIs, cancer cell detection and segmentation, and the stratification of patient outcomes Yu et al (2016); Hou et al (2016); Madabhushi and Lee (2016); Litjens et al (2016); Kraus et al (2016); Korbar et al (2017); Luo et al (2017); Coudray et al (2018); Wei et al (2019); Gertych et al (2019); Bejnordi et al (2017); Saltz et al (2018); Campanella et al (2019); Iizuka et al (2020). Previous works have looked into applying deep learning models for adenocarcinoma classification in stomach Iizuka et al (2020); Kanavati and Tsuneki (2021b); Kanavati et al (2021a), colon Iizuka et al (2020); Tsuneki and Kanavati (2021), lung Kanavati and Tsuneki (2021b); Kanavati et al (2021b), and breast Kanavati and Tsuneki (2021a); Kanavati et al (2022) histopathological specimen WSIs. In a previous study, we trained a prostate adenocarcinoma classification model on needle biopsy WSIs Tsuneki et al (2022) and evaluated the models on both needle biopsy and TUR-P WSI test sets to confirm their application on different types of specimens, achieving an ROC-AUC up to 0.978 in needle biopsy test sets; however, the model under-performed on TUR-P WSIs. Therefore, in this study, we trained deep learning models specifically for TUR-P WSIs. We evaluated the trained models on TUR-P, needle biopsy, and TCGA (The Cancer Genome Atlas) public dataset test sets, achieving an ROC-AUC up to 0.984 in TUR-P test sets, 0.913 in needle biopsy test sets, and 0.947 in TCGA public dataset test sets. These findings suggest that deep learning models might be very useful as routine histopathological diagnostic aids for inspecting TUR-P WSIs to detect prostate adenocarcinoma precisely.

2 Materials and methods

2.1 Clinical cases and pathological records

Retrospectively, a total of 2,060 H&E (hematoxylin & eosin) stained histopathological specimen slides of human prostate adenocarcinoma and benign (non-neoplastic) lesions –1,560 TUR-P and 500 needle biopsy– were collected from the surgical pathology files of total three hospitals: Shinyukuhashi, Wajiro, and Shinkuki hospitals (Kamachi Group Hospitals, Fukuoka, Japan), after histopathological review of all specimens by surgical pathologists. The histopathological specimens were selected randomly to reflect a real clinical settings as much as possible. Prior to the experimental procedures, each WSI diagnosis was observed by at least two pathologists with the final checking and verification performed by senior pathologists. All WSIs were scanned at a magnification of x20 using the same Leica Aperio AT2 Digital Whole Slide Scanner (Leica Biosystems, Tokyo, Japan) and were saved as SVS file format with JPEG2000 compression.

2.2 Dataset

Hospitals which provided histopathological specimen slides in the present study were anonymised (e.g., Hospital-A, B, and C). Table 1 breaks down the distribution of training and validation sets of TUR-P WSIs from two domestic hospitals (Hospital-A and B). Validation sets were selected randomly from the training sets (Table 1). The test sets consisted of TUR-P, needle biopsy, and TCGA (The Cancer Genome Atlas) public dataset WSIs (Table 2). The distribution of test sets from three domestic hospitals (Hospital-A, B, and C) and TCGA public dataset was summarized in Table 2. The patients’ pathological records were used to extract the WSIs’ pathological diagnoses and to assign WSI labels. Training set WSIs were not annotated, and the training algorithm only used the WSI diagnosis labels, meaning that the only information available for the training was whether the WSI contained adenocarcinoma or was benign (non-neoplastic lesion), but no information about the location of the cancerous tissue lesions. The external prostate TCGA datasets are publicly available through the Genomic Data Commons (GDC) Data Portal (https://portal.gdc.cancer.gov/). We have confirmed that surgical pathologists were able to diagnose test sets in Table 2 from visual inspection of the H&E stained slide WSIs alone.

View this table:

Table 1:

Distribution of transuretheral resection of the prostate (TUR-P) whole-slide images (WSIs) in the training and validation sets obtained from two hospitals (A and B)

View this table:

Table 2:

Distribution of whole slide images (WSIs) in the transuretheral resection of the prostate (TUR-P), public dataset (TCGA), and needle biopsy test sets obtained from three hospitals (A-C)

2.3 Deep learning models

We trained the models via transfer learning using the partial fine-tuning approachKanavati and Tsuneki (2021c). This is an efficient fine-tuning approach that consists of using the weights of an existing pre-trained model and only fine-tuning the affine parameters of the batch normalization layers and the final classification layer. For the model architecture, we used EfficientNetB1 Tan and Le (2019) starting with pre-trained weights on ImageNet. Figure 1 shows an overview of the training method. The training methodology that we used in the present study was exactly the same as reported in our previous studies Kanavati and Tsuneki (2021b); Tsuneki et al (2022). For completeness we repeat the methodology here.

Fig. 1:

Schematic diagrams of training method overview. (A) shows a representative zoomed-in example of a histopathological patch from a transurethral resection of the prostate (TUR-P) whole slide image (WSI). During training (B), we iteratively alternated between inference and training. During the inference step, the model weights were frozen and the model was used to select tiles with the highest probability after applying it on the entire tissue regions of each WSI. The top k tiles with the highest probabilities were then selected from each WSI and placed into a queue. During training, the selected tiles from multiple WSIs formed a training batch and were used to train the model.

We performed slide tiling by extracting square tiles from tissue regions of the WSIs. We started by detecting the tissue regions in order to eliminate most of the white background. We did this by performing a thresholding on a grayscale version of the WSIs using Otsu’s method Otsu (1979). During prediction, we performed the tiling of the tissue regions in a sliding window fashion, using a fixed-size stride. During training, we initially performed random balanced sampling of tiles extracted from the tissue regions, where we tried to maintain an equal balance of each label in the training batch. To do so, we placed the WSIs in a shuffled queue such that we looped over the labels in succession (i.e., we alternated between picking a WSI with a positive label and a negative label). Once a WSI was selected, we randomly sampled tiles from each WSI to form a balanced batch.

To maintain the balance on the WSI, we oversampled from the WSIs to ensure the model trained on tiles from all of the WSIs in each epoch. We then switched to hard mining of tiles. To perform the hard mining, we alternated between training and inference. During inference, the CNN was applied in a sliding window fashion on all of the tissue regions in the WSI, and we then selected the k tiles with the highest probability for being positive. This step effectively selects the tiles that are most likely to be false positives when the WSI is negative. The selected tiles were placed in a training subset, and once that subset contained N tiles, the training was run. We used k = 8, N = 256, and a batch size of 32.

To obtain a single prediction for the WSIs from the the tile predictions, we took the maximum probability from all of the tiles. We used the Adam optimizer Kingma and Ba (2014), with the binary cross-entropy as the loss function, with the following parameters: beta₁ = 0.9, beta₂ = 0.999, a batch size of 32, and a learning rate of 0.001 when fine-tuning. We used early stopping by tracking the performance of the model on a validation set, and training was stopped automatically when there was no further improvement on the validation loss for 10 epochs. We chose the model with the lowest validation loss as the final model.

2.4 Software and statistical analysis

The deep learning models were implemented and trained using TensorFlow Abadi et al (2015). AUCs were calculated in python using the scikit-learn package Pedregosa et al (2011) and plotted using matplotlib Hunter (2007). The 95% CIs of the AUCs were estimated using the bootstrap method Efron and Tibshirani (1994) with 1000 iterations.

The true positive rate (TPR) was computed as and the false positive rate (FPR) was computed as Where TP, FP, and TN represent true positive, false positive, and true negative, respectively. The ROC curve was computed by varying the probability threshold from 0.0 to 1.0 and computing both the TPR and FPR at the given threshold.

2.5 Availability of data and material

The datasets generated during and/or analysed during the current study are not publicly available due to specific institutional requirements governing privacy protection but are available from the corresponding author on reasonable request. The datasets that support the findings of this study are available from Kamachi Group Hospitals (Fukuoka, Japan), but restrictions apply to the availability of these data, which were used under a data use agreement which was made according to the Ethical Guidelines for Medical and Health Research Involving Human Subjects as set by the Japanese Ministry of Health, Labour and Welfare (Tokyo, Japan), and so are not publicly available. However, the data are available from the authors upon reasonable request for private viewing and with permission from the corresponding medical institutions within the terms of the data use agreement and if compliant with the ethical and legal requirements as stipulated by the Japanese Ministry of Health, Labour and Welfare.

2.6 Code availability

To train the classification model in this study we adapted the publicly available TensorFlow training script available at https://github.com/tensorflow/models/tree/master/official/vision/image_classification.

3 Results

3.1 Insufficient AUC performance of WSI prostate adenocarcinoma evaluation on TUR-P WSIs using existing series of adenocarcinoma classification models

Prior to the training of prostate adenocarcinoma model using TUR-P WSIs, we have demonstrated the existing adenocarcinoma classification models AUC performances on TUR-P test sets (Table 2). Existing adenocarcinoma classification models were summarized in Table 3: (1) breast invasive ductal carcinoma (IDC) classification model (Breast IDC (x10, 512)) Kanavati and Tsuneki (2021a), (2) breast invasive ductal carcinoma and ductal carcinoma in-situ (DCIS) classification model (Breast IDC, DCIS (x10, 224)) Kanavati et al (2022), (3) colon adenocarcinoma (ADC) and adenoma (AD) classification model (Colon ADC, AD (x10, 512)) Iizuka et al (2020), (4) colon poorly differentiated adenocarcinoma classification model (transfer learning model from stomach poorly differentiated adenocarcinoma classification model) (Colon poorly ADC-1 (x20, 512)) Tsuneki and Kanavati (2021), (5) colon poorly differentiated adenocarcinoma classification model (EfficientNetB1 trained model) (Colon poorly ADC-2 (x20, 512)) Tsuneki and Kanavati (2021), (6) stomach adenocarcinoma and adenoma classification model (Stomach ADC, AD (x10, 512)) Iizuka et al (2020), (7) stomach poorly differentiated adenocarcinoma classification model (Stomach poorly ADC (x20, 224)) Kanavati and Tsuneki (2021b), (8) stomach signet ring cell carcinoma (SRCC) classification model (Stomach SRCC (x10, 224)) Kanavati et al (2021a), (9) pancreas endoscopic ultrasound guided fine needle aspiration (EUS-FNA) biopsy adenocarcinoma classification model (Pancreas EUS-FNA ADC (x10, 224)) Naito et al (2021), and (10) lung carcinoma classification model (Lung Carcinoma (x10, 512)) Kanavati et al (2020). Table 3 shows that Colon poorly ADC-2 (x20, 512) and Lung Carcinoma (x10, 512) models exhibited both high ROC-AUC and low log loss values as compared to other models. Thus, we have trained the models based on the Colon poorly ADC-2 (x20, 512) and Lung Carcinoma (x10, 512) models using TUR-P training sets (Table 1).

View this table:

Table 3:

ROC-AUC and log loss results for adenocarcinoma classification on transuretheral resection of the prostate (TUR-P) test sets (Hospital-A-B) using existing adenocarcinoma classification models

3.2 High AUC performance of TUR-P WSI evaluation of prostate adenocarcinoma histopathology images

We have trained models using transfer learning (TL) and weakly supervised learning approaches which could be used with weak labels (WSI labels) Kanavati et al (2020); Tsuneki et al (2022). At the same time, we trained using the EfficientNetB1 convolutional neural network (CNN) architecture at magnification x10 and x20. The models were applied in a sliding window fashion with input tiles of 224×224 and 512×512 pixels and a stride of 256 (Fig. 1). To train the deep learning models, we used a total of 79 adenocarcinoma and 941 benign training set WSIs and 20 adenocarcinoma and 20 benign validation set WSIs (Table 1). This resulted in four different models: (1) TL-colon poorly ADC-2 (x20, 512), (2) TL-lung carcinoma (x10, 512), (3) EfficientNetB1 (x10, 224), and (4) EfficientNetB1 (x20, 512). We have evaluated four different trained deep learning models on test sets from three different hospitals (Hospital-A-C) and TCGA public datasets (Table 2). For each test set (TUR-P: Hospital-A-B, TUR-P: Hospital-A, TUR-P: Hospital-B, public dataset: TCGA, and needle biopsy: Hospital-A-C), we computed the ROC-AUC, log loss, accuracy, sensitivity, and specificity and summarized in Table 4 & 5 and Fig. 2. The transfer learning model (TL-colon poorly ADC-2 (x20, 512)) (Fig. 2A) from existing colon poorly differentiated adenocarcinoma classification model (Colon poorly ADC-2 (x20, 512)) Tsuneki and Kanavati (2021) trained using TUR-P training sets have higher ROC-AUCs and lower log losses compared to the other models (TL-lung carcinoma (x10, 512), EfficientNetB1 (x10, 224), and EfficientNetB1 (x20, 512)) (Table 4, Fig. 2). On the other hand, on the TUR-P hospital-B test sets, both EfficientNetB1 (x10, 224) and EfficientNetB1 (x20, 512) models exhibited very high ROC-AUCs (0.924 - 0.973) and low log-losses (0.126 - 0.251) as compared to the other test sets (TUR-P Hospital-A, public dataset, and needle biopsy) (Table 4 and Fig. 2C, D). Looking at heatmap images of the same TUR-P WSI which were correctly predicted as prostate adenocarcinoma using four different trained models, both EfficientNetB1 (x10, 224) and EfficientNetB1 (x20, 512) models falsely predicted adenocarcinoma on the marked blue-dots which pathologists marked when they performed diagnosis (Fig. 3C, D). In contrast, both TL-colon poorly ADC-2 (x20, 512) and TL-lung carcinoma (x10, 512) models precisely predicted adenocarcinoma (Fig. 3A, B). The model (TL-colon poorly ADC-2 (x20, 512)) achieved highest ROC-AUC of 0.984 (CI: 0.956 - 1.000) and lowest log loss of 0.127 (CI: 0.076 - 0.205) for prostate adenocarcinoma classification in TUR-P hospital-A test sets and also achieved high ROC-AUCs in public dataset (0.947, CI: 0.922 - 0.972) and needle biopsy test sets (0.913, CI: 0.887 - 0.939) (Table 4). In all test sets, the model ((TL-colon poorly ADC-2 (x20, 512)) achieved very high accuracy (0.821 - 0.969), sensitivity (0.764 - 0.900), and specificity (0.884 - 0.992) (Table 5). As shown in Fig. 2 & 3 and Table 4 & 5, the model (TL-colon poorly ADC-2 (x20, 512)) is fully applicable for prostate adenocarcinoma classification in TUR-P WSIs as well as TCGA public WSI dataset and even needle biopsy WSIs. Figures 4, 5, 6, 7, and 8 show representative WSIs of true-positive, true-negative, false-positive, and false-negative, respectively from using the model (TL-colon poorly ADC-2 (x20, 512)).

View this table:

Table 4:

ROC-AUC and log loss results for adenocarcinoma classification on the transuretheral resection of the prostate (TUR-P), public dataset (TCGA), and needle biopsy test sets using trained models

View this table:

Table 5:

Scores of accuracy, sensitivity, and specificity on the transuretheral resection of the prostate (TUR-P), public dataset (TCGA), and needle biopsy test sets using the best model (TL-colon poorly ADC-2 (x20, 512))

Fig. 2:

ROC curves with AUCs from four different trained deep learning models (A-D) on the test sets: (A) transfer learning (TL) model from existing colon poorly differentiated adenocarcinoma (ADC) classification model with tile size 224 px and magnification at x20, (B) TL model from existing lung carcinoma classification model with tile size 512 px and magnification at x10, (C) EfficientNetB1 model with tile size 224 px and magnification at x10, and (D) EfficientNetB1 model with tile size 512 px and magnification at x20.

Fig. 3:

Comparison of adenocarcinoma prediction in transurethral resection of the prostate (TUR-P) whole slide image (WSI) of four trained deep learning models (A-D). In transfer learning (TL) models from colon poorly differentiated adenocarcinoma (A) and lung carcinoma (B), the heatmap images show true positive prediction of adenocarcinoma where pathologists marked surrounding with blue-dots. In EfficientNetB1 models (C, D), the heatmap images show false-positive prediction of adenocarcinoma on the marked blue-dots. The heatmap uses the jet color map where blue indicates low probability and red indicates high probability.

Fig. 4:

A representative example of prostate adenocarcinoma true positive prediction outputs on a whole slide image (WSI) from transurethral resection of the prostate (TUR-P) test sets using the model (TL-colon poorly ADC-2 (x20, 512)). In the prostate adenocarcinoma WSI of TUR-P specimen (A), according to the histopathological diagnostic report, adenocarcinoma cells infiltrated in the three tissue fragments highlighted with yellow-triangles. The heatmap image (B) shows true positive predictions of prostate adenocarcinoma cells (D, G, J) which correspond respectively to H&E histopathology (C-E, F-H, and I-K). The heatmap image (B) also shows no positive predictions (true negative predictions) in the tissue fragments without evidence of adenocarcinoma infiltration (A). The heatmap uses the jet color map where blue indicates low probability and red indicates high probability.

Fig. 5:

Representative histopathological examples of prostate adenocarcinoma true positive prediction outputs on whole slide images (WSIs) from transurethral resection of the prostate (TUR-P) test sets using the model (TL-colon poorly ADC-2 (x20, 512)). Depiction of prostate adenocarcinoma histopathologies and corresponding heatmap images of adenocarcinoma prediction outputs: (A, B) medium-sized, discrete and distinct neoplastic glands (Gleason pattern 3), (C, D) medium-sized discrete and distinct glands with illformed glands (Gleason score 3+4), (E, F) ill-formed glands (Gleason pattern 4), (G, H) cribriform pattern (Gleason pattern 4). The heatmap uses the jet color map where blue indicates low probability and red indicates high probability.

Fig. 6:

Representative true negative prostate adenocarcinoma prediction outputs on a whole slide image (WSI) from transurethral resection of the prostate (TUR-P) test sets using the model (TL-colon poorly ADC-2 (x20, 512)). Histopathologically, in (A), there were nodular hyperplasia (benign prostatic hyperplasia) with chronic inflammation without any evidence of malignancy (C-E). The heatmap image (B, C, E) shows true negative prediction of prostate adenocarcinoma. The heatmap uses the jet color map where blue indicates low probability and red indicates high probability.

Fig. 7:

Representative examples of prostate adenocarcinoma false positive prediction outputs on whole slide images (WSIs) from transurethral resection of the prostate (TUR-P) test sets using the model (TL-colon poorly ADC-2 (x20, 512)). Histopathologically, (A, E, and H) have no evidence of adenocarcinoma infiltration. The heatmap images (B, F, and I) exhibit false positive predictions of prostate adenocarcinoma (C, D, G, and J) where the tissues consist of xanthogranulomatous inflammation (C, D), macrophagic infiltration (G), and squamous metaplasia with pseudo-koilocytosis (J), which are most likely the primary cause of the false positive prediction due to its morphological similarity to prostate adenocarcinoma cells. The heatmap uses the jet color map where blue indicates low probability and red indicates high probability.

Fig. 8:

A representative example of prostate adenocarcinoma false negative prediction output on a whole slide image (WSI) from transurethral resection of the prostate (TUR-P) test sets using the model (TL-colon poorly ADC-2 (x20, 512)). According to the histopathological diagnostic report, this case (A) has very small number of adenocarcinoma foci (cells) in (C) where pathologists marked surrounding with blue-dots but not in other areas which consist of nodular hyperplasia. The heatmap image (B) exhibited no positive adenocarcinoma prediction (C). The heatmap uses the jet color map where blue indicates low probability and red indicates high probability.

3.3 True positive prostate adenocarcinoma prediction of TUR-P WSIs

Our model (TL-colon poorly ADC-2 (x20, 512)) satisfactorily predicted adenocarcinoma in TUR-P WSIs (Fig. 4A, B). According to the histopathological report and additional pathologist’s reviewing, in this WSI (Fig. 4A), there were three tissue fragments (highlighted with yellow-triangles) with prostate adenocarcinoma cell infiltration (Fig. 4C, E, F, H, I, K). The heatmap image (Fig. 4B) shows true positive predictions in these fragments (yellow-triangles) (Fig. 4D, E, G, H, J, K) without false-positive predictions in other tissue fragments which were histopathologically evaluated as nodular hyperplasia (benign prostatic hyperplasia) without evidence of malignancy (Fig. 4A, B). Not only this representative WSI (Fig. 4), our model (TL-colon poorly ADC-2 (x20, 512)) precisely predicted wide variety of prostate adenocarcinoma histopathological features (Fig. 5): medium-sized, discrete and distinct neoplastic glands (Gleason pattern 3) (Fig. 5A, B); medium-sized discrete and distinct glands with ill-formed glands (Gleason score 3+4) (Fig. 5C, D); ill-formed glands (Gleason pattern 4) (Fig. 5E, F); and cribriform pattern (Gleason pattern 4) (Fig. 5G, H).

3.4 True negative prostate adenocarcinoma prediction of TUR-P WSIs

Our model (TL-colon poorly ADC-2 (x20, 512)) showed true negative predictions of prostate adenocarcinoma in TUR-P WSIs (Fig. 6A, B). In Fig. 6A, histopathologically, there were nodular hyperplasia (benign prostatic hyperplasia) with chronic inflammation in all tissue fragments without evidence of malignancy (Fig. 6A, C-F) which were not predicted as prostate adenocarcinoma (Fig. 6B, C, E).

3.5 False positive prostate adenocarcinoma prediction of TUR-P WSIs

According to the histopathological reports and additional pathologist’s reviewing, there were no prostate adenocarcinoma in these TUR-P WSIs (Fig. 7A, E, H). Our model (TL-colon poorly ADC-2 (x20, 512)) showed false positive predictions of prostate adenocarcinoma (Fig. 7B-D, F-G, I-J). These false positive tissue areas (Fig. 7B-D, F-G, I-J) showed xanthogranulomatous inflammation (Fig. 7A, C, D), macrophagic infiltration (Fig. 7E, G), and squamous metaplasia with pseudo-koilocytosis (Fig. 7H, J), which could be the primary cause of false positives due to its morphological similarity in adenocarcinoma cells.

3.6 False negative prostate adenocarcinoma prediction of TUR-P WSIs

According to the histopathological report and additional pathologist’s reviewing, in this TUR-P WSI (Fig. 8A), there were very small number of adenocarcinoma cells infiltrating in a tissue fragment (Fig. 8C) where pathologists marked with blue-dots. However, our model (TL-colon poorly ADC-2 (x20, 512)) did not predict any prostate adenocarcinoma cells (Fig. 8B, C).

4 Discussion

In this study, we trained deep learning models for the classification of prostate adenocarcinoma in TUR-P WSIs. Of the four models we trained (Table 4), the best model (TL-colon poorly ADC-2 (x20, 512)) achieved ROC-AUCs in the range of 0.896 - 0.984 on the TUR-P test sets. The best model (TL-colon poorly ADC-2 (x20, 512)) also achieved high ROC-AUCs on needle biopsy (0.913) and TCGA public dataset (0.947) test sets. The model (TL-lung carcinoma (x10, 512)) also achieved high ROC-AUCs in all test sets but lower than the best one (TL-colon poorly ADC-2 (x20, 512)). The other two models were trained using the EfficientNetB1 Tan and Le (2019) models starting with pre-trained weights on ImageNet at different magnifications (x10 and x20) and tile sizes (224×224 px, 512×512 px). The models based on EfficientNetB1 (EfficientNetB1 (x10, 224) & EfficientNetB1 (x20, 512)) achieved robust high ROC-AUC values on TUR-P Hospital-B test sets as compared to other test sets (Table 4). Based on the prediction heatmap images of prostate adenocarcinoma, it was obvious that the models based on EfficientNetB1 (EfficientNetB1 (x10, 224) & EfficientNetB1 (x20, 512)) incorrectly predicted blue ink dots, which pathologists had marked during diagnosis, as prostate adenocarcinoma (Fig. 3). Based on this finding, we have looked over WSIs in TUR-P Hospital-B test sets (Table 2) and most of adenocarcinoma positive WSIs (28 out of 30 WSIs) had ink dots on WSIs which were falsely predicted as adenocarcinoma. On the other hand, transfer learning models (TL-colon poorly ADC-2 (x20, 512) & TL-lung carcinoma (x10, 512)) revealed no false positive predictions on ink dots (Fig. 3); this is because those models had been trained on WSIs with ink labelled as non-neoplastic. The best model (TL-colon poorly ADC-2 (x20, 512)) and the second best model (TL-lung carcinoma (x10, 512)) were trained by the transfer learning approach from our existing colon poorly differentiated adenocarcinoma classification model Tsuneki and Kanavati (2021) and lung carcinoma classification model Kanavati et al (2020) based on the findings of ROC-AUC and log loss values on TUR-P test sets (TUR-P Hospital-A-B) using existing adenocarcinoma classification models (Table 3). We used the partial fine-tuning approach Kanavati and Tsuneki (2021c) to train the models faster, as there are less weights involved to tune. We used only 1,020 TUR-P WSIs (adenocarcinoma: 79 WSIs, benign: 941 WSIs) (Table 1) without manual annotations by pathologists Iizuka et al (2020); Naito et al (2021); Kanavati et al (2022). We see that by specifically training on TUR-P WSIs, the models significantly improved prediction performance on TUR-P test set (Table 4) compared to the previous study Tsuneki et al (2022) that had lower ROC-AUC (0.737 - 0.909) and higher log loss (3.269 - 4.672) values. The combination of both models would be able to provide accurate prostate adenocarcinoma classification on both needle biopsy Tsuneki et al (2022) and TUR-P WSIs in routine histopathological diagnostic workflow.

Nodular hyperplasia (benign prostatic hyperplasia) is a common benign disorder of the prostate as a histopathological diagnosis referring to the nodular enlargement of the gland caused by hyperplasia of both glandular and stromal components within the prostatic transition zone and results in varying degrees of urinary obstruction, sometimes requiring surgical intervention including TUR-P Lokeshwar et al (2019). Importantly, incidental prostate cancers are diagnosed at the time of TUR-P for benign prostatic disease Otto et al (2014). According to the literature search, cancers especially prostate adenocarcinoma are detected incidentally around 5-17% of TUR-P specimens Jones et al (2009); Zigeuner et al (2003); Yoo et al (2012); Sakamoto et al (2014); Trpkov et al (2008); Dellavedova et al (2010); Otto et al (2014), meaning around 83-95% of TUR-P specimens are benign lesions; which is nearly identical to the ratio of adenocarcinoma in the TUR-P test sets (Table 2). Therefore, the high values of specificity (0.884 - 0.992) in the best model is noteworthy (Table 5). Moreover, heatmap images revealed true-negative prediction perfectly on each non-neoplastic fragment in both adenocarcinoma (Fig. 4) and benign (non-neoplastic) (Fig. 6) WSIs. Thus, the heatmap images predicted by the best model would provide great benefits for pathologists who have to report the detail descriptions of many TUR-P specimens in routine clinical practices.

The best deep learning model established in the present study offers promising results that indicate it could be beneficial as a screening aid for pathologists prior to observing histopathology on glass slides or WSIs. At the same time, the model could be used as a double-check tool to reduce the risk of missed cancer foci (incidental adenocarcinoma in TUR-P specimens). The most important advantage of using a fully automated computational tool is that it can systematically handle large amounts of WSIs without potential bias due to the fatigue commonly experienced by pathologists, which could drastically alleviate the heavy clinical burden of practical pathology diagnosis using conventional microscopes.

Data Availability

6 Compliance with Ethical Standards

The experimental protocol was approved by the ethical board of Kamachi Group Hospitals (No. 173). All research activities complied with all relevant ethical regulations and were performed in accordance with relevant guidelines and regulations in the all hospitals mentioned above. Informed consent to use histopathological samples and pathological diagnostic reports for research purposes had previously been obtained from all patients prior to the surgical procedures at all hospitals, and the opportunity for refusal to participate in research had been guaranteed by an opt-out manner.

7 Funding

The authors received no financial supports for the research, authorship, and publication of this study.

8 Conflict of Interest

M.T. and F.K. are employees of Medmain Inc. All authors declare no competing interests.

9 Contributions

M.T. designed the studies, performed experiments and analyzed the data; M.T. and F.K. performed the computational studies; M.A. performed the histopathological diagnoses and reviewed the cases; M.T. and F.K. wrote the manuscript; M.T. supervised the project. All authors reviewed and approved the final manuscript.

5 Acknowledgements

We are grateful for the support provided by Dr. Shigeo Nakano at Kamachi Group Hospitals (Fukuoka, Japan). We thank pathologists who have been engaged in reviewing cases and clinicopathological discussion for this study.

References

↵
Abadi M, Agarwal A, Barham P, et al. (2015) TensorFlow: Large-scale machine learning on heterogeneous systems. URL https://www.tensorflow.org/, software available from tensorflow.org
↵
Andrén O, Fall K, Franzén L, et al. (2006) How well does the gleason score predict prostate cancer death? a 20-year followup of a population based cohort in sweden. The Journal of urology 175(4):1337–1340
OpenUrl CrossRef PubMed Web of Science
↵
Bejnordi BE, Veta M, Van Diest PJ, et al. (2017) Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. Jama 318(22):2199–2210
OpenUrl CrossRef PubMed
↵
Campanella G, Hanna MG, Geneslaw L, et al. (2019) Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nature medicine 25(8):1301–1309
OpenUrl PubMed
↵
Coudray N, Ocampo PS, Sakellaropoulos T, et al. (2018) Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning. Nature medicine 24(10):1559–1567
OpenUrl CrossRef PubMed
↵
Dellavedova T, Ponzano R, Racca L, et al. (2010) Prostate cancer as incidental finding in transurethral resection. Archivos Espanoles de Urologia 63(10):855–861
OpenUrl PubMed
↵
Efron B, Tibshirani RJ (1994) An introduction to the bootstrap. CRC press
↵
Egevad L, Granfors T, Karlberg L, et al. (2002) Percent gleason grade 4/5 as prognostic factor in prostate cancer diagnosed at transurethral resection. The Journal of urology 168(2):509–513
OpenUrl CrossRef PubMed
↵
Epstein JI, Paull G, Eggleston JC, et al. (1986) Prognosis of untreated stage a1 prostatic carcinoma: a study of 94 cases with extended followup. The Journal of urology 136(4):837–839
OpenUrl PubMed Web of Science
↵
Gertych A, Swiderska-Chadaj Z, Ma Z, et al. (2019) Convolutional neural networks can accurately distinguish four histologic growth patterns of lung adenocarcinoma in digital slides. Scientific reports 9(1):1483
OpenUrl
↵
Hou L, Samaras D, Kurc TM, et al. (2016) Patch-based convolutional neural network for whole slide tissue image classification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 2424– 2433
↵
Hunter JD (2007) Matplotlib: A 2d graphics environment. Computing in Science & Engineering 9(3):90–95. https://doi.org/10.1109/MCSE.2007.55
OpenUrl
↵
Iizuka O, Kanavati F, Kato K, et al. (2020) Deep learning models for histopathological classification of gastric and colonic epithelial tumours. Scientific reports 10(1):1–11
OpenUrl
↵
Jones J, Follis H, Johnson J (2009) Probability of finding t1a and t1b (incidental) prostate cancer during turp has decreased in the psa era. Prostate cancer and prostatic diseases 12(1):57–60
OpenUrl PubMed
↵
Kanavati F, Tsuneki M (2021a) Breast invasive ductal carcinoma classification on whole slide images with weakly-supervised and transfer learning. bioRxiv
↵
Kanavati F, Tsuneki M (2021b) A deep learning model for gastric diffusetype adenocarcinoma classification in whole slide images. arXiv preprint arXiv:210412478
↵
Kanavati F, Tsuneki M (2021c) Partial transfusion: on the expressive influence of trainable batch norm parameters for transfer learning. In: Medical Imaging with Deep Learning, PMLR, pp 338–353
↵
Kanavati F, Toyokawa G, Momosaki S, et al. (2020) Weakly-supervised learning for lung carcinoma classification using deep learning. Scientific reports 10(1):1–11
OpenUrl
↵
Kanavati F, Ichihara S, Rambeau M, et al. (2021a) Deep learning models for gastric signet ring cell carcinoma classification in whole slide images. Technology in Cancer Research & Treatment 20:15330338211027,901
OpenUrl
↵
Kanavati F, Toyokawa G, Momosaki S, et al. (2021b) A deep learning model for the classification of indeterminate lung carcinoma in biopsy whole slide images. Scientific Reports 11(1):1–14
OpenUrl
↵
Kanavati F, Ichihara S, Tsuneki M (2022) A deep learning model for breast ductal carcinoma in situ classification in whole slide images. Virchows Archiv pp 1–14
↵
Kingma DP, Ba J (2014) Adam: A method for stochastic optimization. arXiv preprint arxiv:14126980
↵
Korbar B, Olofson AM, Miraflor AP, et al. (2017) Deep learning for classification of colorectal polyps on whole-slide images. Journal of pathology informatics 8
↵
Kraus OZ, Ba JL, Frey BJ (2016) Classifying and segmenting microscopy images with deep multiple instance learning. Bioinformatics 32(12):i52–i59
OpenUrl CrossRef PubMed
↵
Litjens G, Sánchez CI, Timofeeva N, et al. (2016) Deep learning as a tool for increased accuracy and efficiency of histopathological diagnosis. Scientific reports 6:26,286
OpenUrl
↵
Lokeshwar SD, Harper BT, Webb E, et al. (2019) Epidemiology and treatment modalities for the management of benign prostatic hyperplasia. Translational andrology and urology 8(5):529
OpenUrl
↵
Luo X, Zang X, Yang L, et al. (2017) Comprehensive computational pathological image analysis predicts lung cancer prognosis. Journal of Thoracic Oncology 12(3):501–509
OpenUrl
↵
Madabhushi A, Lee G (2016) Image analysis and machine learning in digital pathology: Challenges and opportunities. Medical Image Analysis 33:170– 175
OpenUrl PubMed
↵
Naito Y, Tsuneki M, Fukushima N, et al. (2021) A deep learning model to detect pancreatic ductal adenocarcinoma on endoscopic ultrasound-guided fine-needle biopsy. Scientific reports 11(1):1–8
OpenUrl
↵
Otsu N (1979) A threshold selection method from gray-level histograms. IEEE transactions on systems, man, and cybernetics 9(1):62–66
OpenUrl CrossRef PubMed Web of Science
↵
Otto B, Barbieri C, Lee R, et al. (2014) Incidental prostate cancer in transurethral resection of the prostate specimens in the modern era. Advances in Urology 2014
↵
Pedregosa F, Varoquaux G, Gramfort A, et al. (2011) Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 12:2825–2830
OpenUrl CrossRef
↵
Sakamoto H, Matsumoto K, Hayakawa N, et al. (2014) Preoperative parameters to predict incidental (t1a and t1b) prostate cancer. Canadian Urological Association Journal 8(11-12):E815
OpenUrl
↵
Saltz J, Gupta R, Hou L, et al. (2018) Spatial organization and molecular correlation of tumor-infiltrating lymphocytes using deep learning on pathology images. Cell reports 23(1):181–193
OpenUrl
↵
Sung H, Ferlay J, Siegel RL, et al. (2021) Global cancer statistics 2020: Globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: a cancer journal for clinicians 71(3):209–249
OpenUrl CrossRef PubMed
↵
Takamori H, Masumori N, Kamoto T (2017) Surgical procedures for benign prostatic hyperplasia: a nationwide survey in japan, 2014 update. International journal of urology: official journal of the Japanese Urological Association 24(6):476–477
OpenUrl
↵
Tan M, Le Q (2019) Efficientnet: Rethinking model scaling for convolutional neural networks. In: International Conference on Machine Learning, PMLR, pp 6105–6114
↵
Trpkov K, Thompson J, Kulaga A, et al. (2008) How much tissue sampling is required when unsuspected minimal prostate carcinoma is identified on transurethral resection? Archives of pathology & laboratory medicine 132(8):1313–1316
OpenUrl
↵
Tsuneki M, Kanavati F (2021) Deep learning models for poorly differentiated colorectal adenocarcinoma classification in whole slide images using transfer learning. Diagnostics 11(11):2074
OpenUrl
↵
Tsuneki M, Abe M, Kanavati F (2022) A deep learning model for prostate adenocarcinoma classification in needle biopsy whole-slide images using transfer learning. Diagnostics 12(3):768
OpenUrl
↵
Van Andel G, Vleeming R, Kurth K, et al. (1995) Incidental carcinoma of the prostate. In: Seminars in Surgical Oncology, Wiley Online Library, pp 36–45
↵
Wei JW, Tafe LJ, Linnik YA, et al. (2019) Pathologist-level classification of histologic patterns on resected lung adenocarcinoma slides with deep neural networks. Scientific reports 9(1):1–8
OpenUrl
↵
Yoo C, Oh CY, Kim SJ, et al. (2012) Preoperative clinical factors for diagnosis of incidental prostate cancer in the era of tissue-ablative surgery for benign prostatic hyperplasia: a korean multi-center review. Korean journal of urology 53(6):391–395
OpenUrl
↵
Yu KH, Zhang C, Berry GJ, et al. (2016) Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features. Nature communications 7:12,474
OpenUrl
↵
Zigeuner RE, Lipsky K, Riedler I, et al. (2003) Did the rate of incidental prostate cancer change in the era of psa testing? a retrospective study of 1127 patients. Urology 62(3):451–455
OpenUrl PubMed

View the discussion thread.

Posted April 21, 2022.

Download PDF

Data/Code

Citation Tools

Subject Area

Pathology

Subject Areas

All Articles

Addiction Medicine (322)
Allergy and Immunology (623)
Anesthesia (162)
Cardiovascular Medicine (2335)
Dentistry and Oral Medicine (283)
Dermatology (205)
Emergency Medicine (373)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (828)
Epidemiology (11694)
Forensic Medicine (10)
Gastroenterology (693)
Genetic and Genomic Medicine (3679)
Geriatric Medicine (345)
Health Economics (630)
Health Informatics (2361)
Health Policy (925)
Health Systems and Quality Improvement (881)
Hematology (339)
HIV/AIDS (772)
Infectious Diseases (except HIV/AIDS) (13256)
Intensive Care and Critical Care Medicine (765)
Medical Education (363)
Medical Ethics (104)
Nephrology (396)
Neurology (3436)
Nursing (194)
Nutrition (519)
Obstetrics and Gynecology (665)
Occupational and Environmental Health (659)
Oncology (1800)
Ophthalmology (533)
Orthopedics (216)
Otolaryngology (285)
Pain Medicine (230)
Palliative Medicine (66)
Pathology (444)
Pediatrics (1021)
Pharmacology and Therapeutics (424)
Primary Care Research (415)
Psychiatry and Clinical Psychology (3142)
Public and Global Health (6084)
Radiology and Imaging (1254)
Rehabilitation Medicine and Physical Therapy (731)
Respiratory Medicine (821)
Rheumatology (375)
Sexual and Reproductive Health (365)
Sports Medicine (320)
Surgery (396)
Toxicology (50)
Transplantation (171)
Urology (144)

[1] ↵
Abadi M, Agarwal A, Barham P, et al. (2015) TensorFlow: Large-scale machine learning on heterogeneous systems. URL https://www.tensorflow.org/, software available from tensorflow.org

[2] ↵
Andrén O, Fall K, Franzén L, et al. (2006) How well does the gleason score predict prostate cancer death? a 20-year followup of a population based cohort in sweden. The Journal of urology 175(4):1337–1340
OpenUrl CrossRef PubMed Web of Science

[3] ↵
Bejnordi BE, Veta M, Van Diest PJ, et al. (2017) Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. Jama 318(22):2199–2210
OpenUrl CrossRef PubMed

[4] ↵
Campanella G, Hanna MG, Geneslaw L, et al. (2019) Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nature medicine 25(8):1301–1309
OpenUrl PubMed

[5] ↵
Coudray N, Ocampo PS, Sakellaropoulos T, et al. (2018) Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning. Nature medicine 24(10):1559–1567
OpenUrl CrossRef PubMed

[6] ↵
Dellavedova T, Ponzano R, Racca L, et al. (2010) Prostate cancer as incidental finding in transurethral resection. Archivos Espanoles de Urologia 63(10):855–861
OpenUrl PubMed

[7] ↵
Efron B, Tibshirani RJ (1994) An introduction to the bootstrap. CRC press

[8] ↵
Egevad L, Granfors T, Karlberg L, et al. (2002) Percent gleason grade 4/5 as prognostic factor in prostate cancer diagnosed at transurethral resection. The Journal of urology 168(2):509–513
OpenUrl CrossRef PubMed

[9] ↵
Epstein JI, Paull G, Eggleston JC, et al. (1986) Prognosis of untreated stage a1 prostatic carcinoma: a study of 94 cases with extended followup. The Journal of urology 136(4):837–839
OpenUrl PubMed Web of Science

[10] ↵
Gertych A, Swiderska-Chadaj Z, Ma Z, et al. (2019) Convolutional neural networks can accurately distinguish four histologic growth patterns of lung adenocarcinoma in digital slides. Scientific reports 9(1):1483
OpenUrl

[11] ↵
Hou L, Samaras D, Kurc TM, et al. (2016) Patch-based convolutional neural network for whole slide tissue image classification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 2424– 2433

[12] ↵
Hunter JD (2007) Matplotlib: A 2d graphics environment. Computing in Science & Engineering 9(3):90–95. https://doi.org/10.1109/MCSE.2007.55
OpenUrl

[13] ↵
Iizuka O, Kanavati F, Kato K, et al. (2020) Deep learning models for histopathological classification of gastric and colonic epithelial tumours. Scientific reports 10(1):1–11
OpenUrl

[14] ↵
Jones J, Follis H, Johnson J (2009) Probability of finding t1a and t1b (incidental) prostate cancer during turp has decreased in the psa era. Prostate cancer and prostatic diseases 12(1):57–60
OpenUrl PubMed

[15] ↵
Kanavati F, Tsuneki M (2021a) Breast invasive ductal carcinoma classification on whole slide images with weakly-supervised and transfer learning. bioRxiv

[16] ↵
Kanavati F, Tsuneki M (2021b) A deep learning model for gastric diffusetype adenocarcinoma classification in whole slide images. arXiv preprint arXiv:210412478

[17] ↵
Kanavati F, Tsuneki M (2021c) Partial transfusion: on the expressive influence of trainable batch norm parameters for transfer learning. In: Medical Imaging with Deep Learning, PMLR, pp 338–353

[18] ↵
Kanavati F, Toyokawa G, Momosaki S, et al. (2020) Weakly-supervised learning for lung carcinoma classification using deep learning. Scientific reports 10(1):1–11
OpenUrl

[19] ↵
Kanavati F, Ichihara S, Rambeau M, et al. (2021a) Deep learning models for gastric signet ring cell carcinoma classification in whole slide images. Technology in Cancer Research & Treatment 20:15330338211027,901
OpenUrl

[20] ↵
Kanavati F, Toyokawa G, Momosaki S, et al. (2021b) A deep learning model for the classification of indeterminate lung carcinoma in biopsy whole slide images. Scientific Reports 11(1):1–14
OpenUrl

[21] ↵
Kanavati F, Ichihara S, Tsuneki M (2022) A deep learning model for breast ductal carcinoma in situ classification in whole slide images. Virchows Archiv pp 1–14

[22] ↵
Kingma DP, Ba J (2014) Adam: A method for stochastic optimization. arXiv preprint arxiv:14126980

[23] ↵
Korbar B, Olofson AM, Miraflor AP, et al. (2017) Deep learning for classification of colorectal polyps on whole-slide images. Journal of pathology informatics 8

[24] ↵
Kraus OZ, Ba JL, Frey BJ (2016) Classifying and segmenting microscopy images with deep multiple instance learning. Bioinformatics 32(12):i52–i59
OpenUrl CrossRef PubMed

[25] ↵
Litjens G, Sánchez CI, Timofeeva N, et al. (2016) Deep learning as a tool for increased accuracy and efficiency of histopathological diagnosis. Scientific reports 6:26,286
OpenUrl

[26] ↵
Lokeshwar SD, Harper BT, Webb E, et al. (2019) Epidemiology and treatment modalities for the management of benign prostatic hyperplasia. Translational andrology and urology 8(5):529
OpenUrl

[27] ↵
Luo X, Zang X, Yang L, et al. (2017) Comprehensive computational pathological image analysis predicts lung cancer prognosis. Journal of Thoracic Oncology 12(3):501–509
OpenUrl

[28] ↵
Madabhushi A, Lee G (2016) Image analysis and machine learning in digital pathology: Challenges and opportunities. Medical Image Analysis 33:170– 175
OpenUrl PubMed

[29] ↵
Naito Y, Tsuneki M, Fukushima N, et al. (2021) A deep learning model to detect pancreatic ductal adenocarcinoma on endoscopic ultrasound-guided fine-needle biopsy. Scientific reports 11(1):1–8
OpenUrl

[30] ↵
Otsu N (1979) A threshold selection method from gray-level histograms. IEEE transactions on systems, man, and cybernetics 9(1):62–66
OpenUrl CrossRef PubMed Web of Science

[31] ↵
Otto B, Barbieri C, Lee R, et al. (2014) Incidental prostate cancer in transurethral resection of the prostate specimens in the modern era. Advances in Urology 2014

[32] ↵
Pedregosa F, Varoquaux G, Gramfort A, et al. (2011) Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 12:2825–2830
OpenUrl CrossRef

[33] ↵
Sakamoto H, Matsumoto K, Hayakawa N, et al. (2014) Preoperative parameters to predict incidental (t1a and t1b) prostate cancer. Canadian Urological Association Journal 8(11-12):E815
OpenUrl

[34] ↵
Saltz J, Gupta R, Hou L, et al. (2018) Spatial organization and molecular correlation of tumor-infiltrating lymphocytes using deep learning on pathology images. Cell reports 23(1):181–193
OpenUrl

[35] ↵
Sung H, Ferlay J, Siegel RL, et al. (2021) Global cancer statistics 2020: Globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: a cancer journal for clinicians 71(3):209–249
OpenUrl CrossRef PubMed

[36] ↵
Takamori H, Masumori N, Kamoto T (2017) Surgical procedures for benign prostatic hyperplasia: a nationwide survey in japan, 2014 update. International journal of urology: official journal of the Japanese Urological Association 24(6):476–477
OpenUrl

[37] ↵
Tan M, Le Q (2019) Efficientnet: Rethinking model scaling for convolutional neural networks. In: International Conference on Machine Learning, PMLR, pp 6105–6114

[38] ↵
Trpkov K, Thompson J, Kulaga A, et al. (2008) How much tissue sampling is required when unsuspected minimal prostate carcinoma is identified on transurethral resection? Archives of pathology & laboratory medicine 132(8):1313–1316
OpenUrl

[39] ↵
Tsuneki M, Kanavati F (2021) Deep learning models for poorly differentiated colorectal adenocarcinoma classification in whole slide images using transfer learning. Diagnostics 11(11):2074
OpenUrl

[40] ↵
Tsuneki M, Abe M, Kanavati F (2022) A deep learning model for prostate adenocarcinoma classification in needle biopsy whole-slide images using transfer learning. Diagnostics 12(3):768
OpenUrl

[41] ↵
Van Andel G, Vleeming R, Kurth K, et al. (1995) Incidental carcinoma of the prostate. In: Seminars in Surgical Oncology, Wiley Online Library, pp 36–45

[42] ↵
Wei JW, Tafe LJ, Linnik YA, et al. (2019) Pathologist-level classification of histologic patterns on resected lung adenocarcinoma slides with deep neural networks. Scientific reports 9(1):1–8
OpenUrl

[43] ↵
Yoo C, Oh CY, Kim SJ, et al. (2012) Preoperative clinical factors for diagnosis of incidental prostate cancer in the era of tissue-ablative surgery for benign prostatic hyperplasia: a korean multi-center review. Korean journal of urology 53(6):391–395
OpenUrl

[44] ↵
Yu KH, Zhang C, Berry GJ, et al. (2016) Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features. Nature communications 7:12,474
OpenUrl

[45] ↵
Zigeuner RE, Lipsky K, Riedler I, et al. (2003) Did the rate of incidental prostate cancer change in the era of psa testing? a retrospective study of 1127 patients. Urology 62(3):451–455
OpenUrl PubMed

Weakly supervised and transfer learning for adenocarcinoma classification in transurethral resection of the prostate whole slide images

Abstract

1 Introduction

2 Materials and methods

2.1 Clinical cases and pathological records

2.2 Dataset

2.3 Deep learning models

2.4 Software and statistical analysis

2.5 Availability of data and material

2.6 Code availability

3 Results

3.1 Insufficient AUC performance of WSI prostate adenocarcinoma evaluation on TUR-P WSIs using existing series of adenocarcinoma classification models

3.2 High AUC performance of TUR-P WSI evaluation of prostate adenocarcinoma histopathology images

3.3 True positive prostate adenocarcinoma prediction of TUR-P WSIs

3.4 True negative prostate adenocarcinoma prediction of TUR-P WSIs

3.5 False positive prostate adenocarcinoma prediction of TUR-P WSIs

3.6 False negative prostate adenocarcinoma prediction of TUR-P WSIs

4 Discussion

Data Availability

6 Compliance with Ethical Standards

7 Funding

8 Conflict of Interest

9 Contributions

5 Acknowledgements

References

Citation Manager Formats

Subject Area