Prediction of Sex and Age from Macular Optical Coherence Tomography Images and Feature Analysis Using Deep Learning

Kuan-Ming Chueh; Yi-Ting Hsieh; Homer H. Chen; I-Hsin Ma; Sheng-Lung Huang

doi:10.1101/2020.12.23.20248805

Abstract

The prevalence of certain macular diseases differs between male and female. However, the actual difference in macular structure between male and female was barely understood. Previous studies reported the mean retinal thickness of macula was thinner for female, but here it was observed that the difference is not statistically large enough for sex distinction. Similarly, the age-related non-pathological change of macular structure was also hardly known. It has been found that the thickness of choroid decreases with age. In this study, deep learning was applied to distinguish sex and age from macular optical coherence tomography (OCT) images of 3134 persons and achieved a sex prediction accuracy of 85.6 ± 2.1% and an age prediction error of 5.78 ± 0.29 years. A thorough analysis of the prediction accuracy and the Grad-CAM showed that 1) the foveal contour leads to a better sex distinction than the macular thickness, 2) B-scan macular OCT images contain more sex-related information than en face fundus images, and 3) the age-related characteristics of the macula are on the whole layers of the retina, not just the choroid. These novel findings reported in this study are useful to ophthalmologists for further investigation in the pathogenesis of sex and age-related macular structural diseases.

Introduction

The macula is located at the center of the retina. It is the area with the densest photoreceptor cone cells and is responsible for the central vision¹. In the past, sexual difference in the structure of the macula was not known, except few studies have shown that the average thickness of the macula in men was thicker than in women^2-7. In 2018, Poplin et al.⁸ reported that their deep learning model could distinguish the sex of the subject from color fundus photography, with 97% accuracy. This means that some differences do exist between male and female at the posterior pole including the macula and optic disc, but such differences cannot be easily identified by ophthalmologists. The fovea is a pitted structure at the center of the macula. The shape of the foveal pit could differ a lot among normal populations, and such variation has no special clinical significance⁹. Recently, a study by our team has found that a wide-based foveal pit, which was demonstrated on optical coherence tomography (OCT), occurred five times as frequently in female as in male. Eyes with a wide-based foveal pit tended to have epiretinal membrane formation, and their contralateral eyes also had an extremely high proportion of epiretinal membrane and macular hole¹⁰. Interestingly, these macular structural diseases including epiretinal membrane and macular hole, also occur more frequently in women^11,12. In the past, the ophthalmologists had no special hypothesis about the sexual differences in these macular structural diseases. According to the above-mentioned observations, we speculate some inherent differences in the anatomical structure of the macula between male and female, which result in their differences in the prevalence of structure-related maculopathy. Idiopathic epiretinal membrane and macular hole also have age predilection; with higher prevalence noted in middle-aged and elderly people^13-15. In addition to vitreous degeneration and cell aging, it is still not clear whether the macula itself has age-related structural changes that make the macula predisposes to these lesions. Therefore, it is clinically important to study the sex and age-related differences in the structure of the macula.

Deep learning allows an algorithm to learn the appropriate features by multiple computation layers rather than requiring features to be hand engineered¹⁶. Some study groups have used deep learning to do the structural segmentation for macular OCT and classification of macular diseases using macular OCT^17-22. However, deep learning has never been used to predict sex and age through macular OCT images. Recently, gradient-weighted class activation mapping (Grad-CAM) has been widely adopted which uses the gradient information flowing into the final convolutional layer of a convolutional neural network (CNN) to produce a localization map of the important regions in the image²³. In this study, deep learning to train CNN was used to predict sex and age according to macular OCT images, and then Grad-CAM was used to explore the sex and age-related features from different layers of CNNs in order to offer a new understanding of the macular structure.

Results

Deep-learning models were developed by using the 6-mm cross-sectional (B-scan) OCT images of the macula. These images were extracted from the macula volume scan using Heidelberg Spectralis (Heidelberg Engineering, Heidelberg, Germany). Each volume scan contained 25 horizontal B-scans that covered the 6 x 6 mm² area of the central macula. Since the most inferior one could not be obtained for the dataset, 24 macular OCT B-scan images were obtained from each set of volume scan (Figure 1). The sex-prediction models were developed using 4866 sets of OCT images (50% of them were female) from the eyes of 2288 persons, and the age-prediction models were developed using 6147 sets of OCT images (average age: 55.2 ± 15.0 years) from the eyes of 3134 persons. Except for using the total 24 images from the volume scan as the input, the 24 images were also divided into 7 groups to analyze from different position of the macula, which included: (1) the 11^th image only, which passed through the center of the fovea, (2) the 10^th and 12^th images, which passed through the inferior and superior margins of the 500 μm-diameter circle of the fovea, (3) the 9^th and 13^th images, which passed through the inferior and superior margins of the 1000 μm-diameter circle of the fovea, (4) the 0^th - 8^th and 14^th-23^th images, (5) the 0^th and 22^th images, (6) the 11^th and 12^th images, and (7) the total 24 images.

Figure 1 24 images (1 volumes) scanned by Heidelberg Spectralis OCT.

The numbers defined in a single volume are shown on the top of each image. Each image contains 496 512 pixels, and the pixel resolution are 3.9 and 11 in the axial and lateral directions, respectively.

Sex prediction by deep learning models using macular OCT images

The deep-learning models were trained and validated by the 10-fold cross-validation with a variety of input data. The test accuracies are shown in Table 1. The accuracy of sex prediction among different positions of the macula were compared among four models that contained the same amount of input data from different scanning positions, which were (A) 11^th and 12^th, (B) 10^th and 12 ^th, (C) 9 ^th and 13 ^th, (D) 0 ^th and 22 ^th. These data groups contained OCT images from two different scanning positions that were symmetric to central fovea except for (A), which contained one image in sets passing through the central fovea and the other image in sets passing though the superior juxtafovea. The distance to fovea was the shortest in (A) and the longest in (E). The average accuracy in sex prediction for models(A), (B), (C) and (D) were 75.9 ± 2.3%, 74.8 ± 3.1%, 73.7 ± 2.5% and 70.5 ± 3.0%, respectively. The accuracy decreased gradually when the scanning positions were getting away from the central fovea. When using the 11^th images only for training and prediction in (E), the accuracy was only 73.1 ± 1.6%, which was less than that in (A). When using all 19 extrafoveal images in (F) or mixed the fovea-containing and extrafoveal images (24 images in sets) together in (G), the accuracy increased to 77.1 ± 2.1% and 77.4 ± 2.2%, respectively.

View this table:

Table 1 Accuracy for sex prediction using different dataset with 10-fold cross validation.

Hard vote & soft vote for deep learning models

Hard vote and soft vote were used to improve the prediction performance by merging the prediction results from different areas at the macula. Table 1 shows the results of a hard vote and soft vote using different model combinations. By using hard vote, the sex prediction accuracy could be up to 85.6 ± 2.1%. Similarly, by using a soft vote, the sex prediction accuracy could be up to 85.5 ± 1.9%.

Feature analysis from Grad-CAM and guided Grad-CAM

Next, Grad-CAM overlaid on original OCT images was used to identify the regions that the CNN might have been using to make its predictions. Five representative examples of macular OCT images with their Grad-CAM are shown in Figure 2. To understand more detailed features in the fovea, the Grad-CAM of different layers of ResNet18 were analyzed. In Layer 1, some Grad-CAM focused on the whole retina while some focused on the vitreous, implying that the model’s focus was bounded by the inner retinal surface of the macula. In Layer 2, Grad-CAM focused mainly on the inner limiting membrane and the retinal pigment epithelium layers, which defined both the inner and outer contours of the retina. In Layers 3 and 4, Grad-CAM focused mainly on the fovea. Layer 4 is the last convolutional layer of ResNet18, so the Grad-CAM of Layer 4 should have the highest correlation with the prediction results. According to these results, it is proposed that the deep-learning model used the information of fovea mostly to make the prediction of sex.

Figure 2 Grad-CAM of different input OCT images.

OCT column shows five typical macular OCT images (). These images are predicted correctly by the model. Layer 1, Layer 2, Layer 3, and Layer 4 column show the Grad-CAM in 4 layers of ResNet18, which have a receptive field of 43×43, 99×99, 211×211, 435×435, respectively. Guided Backprop column shows guided backpropagation of OCT. Guided Grad-CAM column is arrived from pointwise multiplying the Grad-CAM by guided backpropagation.

The extracted features by the deep learning models were further analyzed from the guided Grad-CAM. Guided Grad-CAM combines Grad-CAM with existing fine-grained visualization (guided backpropagation) to create a high-resolution class-discriminative visualization. According to the guided Grad-CAM, the inner contour of the fovea was clearly shown, suggesting that the foveal contour is the key point for differentiation between male and female.

Figure 3 shows the Grad-CAM from the 10-fold cross-validation models of one single macular OCT image. Although the 10-fold cross-validation models were trained using different input data as described in Methods, all the 10 Grad-CAMs and guided Grad-CAM had similar presentations as those shown in Figure 2, indicating that the fovea and its contour as the key point for sex prediction is promising. Supplementary 1 shows more Grad-CAM from the 10-fold cross-validation models of one single macular OCT image.

Figure 3 Grad-CAM on one single OCT image with 10-fold cross validation.

OCT column shows one macular OCT image (). This image is predicted correctly by all ten cross-validation models. Layer 1, Layer 2, Layer 3, and Layer 4 columns show the Grad-CAM in 4 layers of ResNet18 arrived from ten cross-validation models, which have a receptive field of 43×43, 99×99, 211×211, 435×435, respectively. Guided Backprop column shows guided backpropagation calculated from ten cross-validation models. Guided Grad-CAM column is arrived from pointwise multiplying the Grad-CAM by guided backpropagation.

Sex prediction using macular thickness

To predict sex by using the macular thickness, 110 persons (56 female, 54 male) with the normal macula in bilateral eyes were collected and the retinal thickness in different areas of the macula were calculated from OCT. A cut-point value of macular thickness in order to differentiate male from female was planned. The best accuracy in differentiating sex was 61.9%, which was obtained using the central 1-mm area of the central fovea (Figure 4a). Furthermore, the data of macular thickness by Melissa et al.² was used to differentiate male from female. The best accuracy was also 61.9%, which was obtained using the temporal perifoveal area (Figure 4b). Both accuracies were poorer than those obtained from deep learning models using macular OCT.

Figure 4 Accuracy for sex prediction using macular thickness.

a. The accuracy for sex prediction using the macular thickness of 56 female and 54 male using the volume data calculated by the Heidelberg Eye Explorer software. The circles are defined by the ETDRS grid with concentric circles of 1-, 3-, and 6-mm diameters at macula. b. The accuracy for sex prediction from the thickness of macula calculated from the data published by Melissa et al.² The details for approach is described in Supplementary 2.

Sex prediction by deep learning models using infrared fundus photography

The en face image of the macula contains different structural information from the cross-sectional image of the macula. We also used 30° x 30° infrared fundus photography taken during the OCT volume scan to train deep learning models and perform 10-fold cross-validation. The infrared fundus images used were exactly the same individuals as those used for OCT image analysis. Table 2 shows the comparison between infrared fundus photography (384 x 384 pixels, grayscale) and B-scan macular OCT images (256 x 512 pixels) in sex prediction using the images from the same persons. It was found that the accuracy of sex prediction using infrared fundus photography was lower than that using B-scan OCT images; this indicates that B-scan macular OCT images contained more sex-related information than en face infrared fundus photography. Figure 5 is the Grad-CAM models for sex prediction from infrared fundus photography. It shows that the models mainly focused on the optic disc and arcade vessels, which is similar to the results reported by Poplin et al.⁸

View this table:

Table 2 Comparison of sex prediction accuracy between infrared fundus photography and B-scan macular OCT images

Figure 5 Grad-CAM on 3 infrared fundus images with 10-fold cross validation.

The left column shows the infrared fundus photography from three normal eyes. The Grad-CAM of Layer 4 in ResNet18 from 10-fold cross-validation models shows that the hot spots focused mostly on the optic disc and the main arcades of retinal vessels.

Age prediction using macular OCT images

The deep-learning models were trained to predict the age of patients using the macular OCT images (11^th and 12^th). The age of patients ranged from 20 to 90 years old. With the 10-fold cross-validation, the average test mean absolute error (MAE) is 5.78 ± 0.29 years (Table 3).

View this table:

Table 3 Mean absolute error for age prediction

Figure 6 are images of Grad-CAM from patients of different ages. The images show that the deep learning model predicted age mainly based on the whole layers of the retina, rather than the choroid.

Figure 6 Grad-CAM from the age prediction model.

Six examples for the Grad-CAM of OCT images for age prediction. Each image is labeled with the real age and the model’s predicted age.

Discussion

The present study indicates that macular OCT can be used to predict sex and age via deep learning with good accuracy. For sex prediction, the accuracy could be as high as 85.6%. High accuracy is obtained by using more images in model training as shown in Table 1. Images closer to the central fovea yielded more accurate results, which can be seen from the results of models (A) to (D). A single image that contained the central fovea () out of the 24 macular OCT images for model training and validation provided an accuracy of 73.1% (Table 1 (E)). These results indicate that the central fovea contains more sex-related features than the extrafoveal area. The results from Grad-CAM and guided Grad-CAM also showed that the sex-related difference is focused on the central fovea. Furthermore, differentiation of sex by retinal thickness at different areas of the macula provided an accuracy of only 61.9% at best. These results imply that not only the thickness but also the contour of fovea are different between male and female, which were compatible with the inference from Delori et al.²⁴ that the foveal pit is wider in female according to the analysis from fundus reflectometry. Supplementary 3 provides another evidence that the high accuracy rate of the deep learning model for sex prediction was not due to overfitting; the foveal contour is highly sex-related.

The macula is the most important area in the retina because it is in charge of central vision. The sexual difference in macular structure is also an important issue because some macular diseases, such as epiretinal membrane and macular hole, are not equally prevalent between male and female. Using the Grad-CAM and guided Grad-CAM as assistant tools, it was found that the central fovea, especially its contour, is a significant feature of sex. The foveal contour varies largely among normal population; hence little was known about its clinical significance. Poplin et al.⁸ showed that optic nerve head and arcade vessels were highlighted by the Grad-CAM of models for sex prediction using color fundus photography. However, the size of the samples used for model training in their study was much larger (284,335 persons vs 2,844 persons in the present study). The results showed that by using the same number of datasets for training, the accuracy for sex prediction using infrared fundus photography (61.4%) is lower than that using B-scan OCT images (73.1%). This is compatible with the deduction that the foveal contour, which is more clearly shown in B-scan OCT, is much different between male and female. Such results can offer the ophthalmologists an idea to define the foveal contour with a variety of parameters and work on the difference of foveal contour between male and female in order to clarify the sexual correlation with foveal structure-related diseases, such as epiretinal membrane and macular hole. The results of the present study would be helpful for further epidemiological, anatomical, and pathogenic studies of macular structure and its associated diseases.

As for the prediction of age using macular OCT via deep learning models, the MAE for age prediction in this study was 5.78 years for a study population with a mean age of 55.2 ± 15.0 years. Such a result was compatible with the results reported by Poplin et al⁸, in which the MAE for age prediction is 3.26 years for their study population with a mean age of 56.8 ± 8.2 years. This indicates that macular OCT may contain more age-related information since only 6,147 sets of OCT images from 3,134 patients were used for model training and validation in this study, compared to more than 1 million color fundus images from 284,335 patients used in the study by Poplin et al.⁸ Furthermore, the attention maps in their study showed that the model for age prediction focused mostly on retinal vessels. However, the images used in this study for model training and validation are B-scan OCT images, in which only the cross-section image of retinal vessels can be hardly seen. Interestingly, the Grad-CAM in the present study focused on the whole retinal layers of the macula for age prediction. Such a result is compatible with the previous study showing that retinal thickness decreases with age²⁵. Although the choroidal thickness also decreases with age²⁶, the attention maps in the present study did not focus on choroid. It is possible that the OCT used in this study is spectral-domain OCT without enhanced depth imaging, so the whole choroid could be clearly shown. However, since the variations in retinal thickness and choroidal thickness among the normal population of the same age are high, the thickness itself may not be enough for age differentiation. Further studies are needed for evaluating age-related changes in the retina and choroid.

In conclusion, it was found that deep learning can be used to unveil the sex and age-related difference in macular OCT. Some difference exists in the anatomical structure of the foveal pit between male and female, and this may be related to the sexual difference in the prevalence of some macular structural diseases including epiretinal membrane and macular hole. The present study offers important findings for further studies in the pathogenesis of sex-related or age-related macular structural diseases.

Methods

Data acquisition

The data used in this study were obtained from patients who were older than 20 years old and have undergone macular OCT examination for preoperative screening for refractive surgery or cataract surgery. Eyes with vitreous macular traction, epiretinal membrane, macular hole, macular edema, choroidal neovascularization, posterior scleral staphyloma, and any other obvious macular structural abnormalities were excluded from this study.

The size of the B-scan image from the OCT volume scan is 496 pixels ×512 pixels. The axial pixel resolution of the image is 3.9 μm, the lateral pixel resolution is 11 μm, and the distance between each image is 240 μm. This study uses convolutional neural networks to predict age and sex, so the image is labeled with age and sex information obtained from the electronic records.

Data pre-processing

To make the classification easier, the region of interest was automatically cropped from the original OCT images as shown in Figure 7. First, a certain cropping rectangle was designed in the size of 512 pixels × 256 pixels. Then, from the top of the images, the pixel values within the cropping rectangle are added up and the cropping rectangle moves down one pixel to calculate again until the rectangle reaches the bottom of the image. Finally, the cropped region with the highest total pixel values was chosen for analysis.

Figure. 7 Image cropping method.

a. Original OCT image. b. Cropped image after the data pre-processing method.

Model development

In this study, the pretrained Resnet18 on ImageNet²⁷ was used for training. The model’s input was adjusted to be one channel (input size: 512×256), and the output was adjusted to be two ways according to the type of prediction. Supplementary 4 describes the models with more details.

Code availability

The deep learning models developed in this work employed standard libraries and scripts that are publicly available from https://pytorch.org/.

Data availability

The authors declare that the main data supporting the results in this study are available, with restrictions, from Yi-Ting Hsieh under request.

Funding

This study was sponsored by a research grant from the Ministry of Science and Technology of Taiwan (109-2221-E-002-151-).

References

↵
Curcio, C. A. & Allen, K. A. Topography of ganglion cells in human retina. Journal of comparative Neurology 300, 5–25 (1990).
OpenUrl CrossRef PubMed Web of Science
↵
Wagner-Schuman, M. et al. Race-and sex-related differences in retinal thickness and foveal pit morphology. Investigative ophthalmology & visual science 52, 625–634 (2011).
OpenUrl Abstract/FREE Full Text
Wong, A., Chan, C. & Hui, S. Relationship of gender, body mass index, and axial length with central retinal thickness using optical coherence tomography. Eye 19, 292–297 (2005).
OpenUrl CrossRef PubMed Web of Science
Kelty, P. J. et al. Macular thickness assessment in healthy eyes based on ethnicity using Stratus OCT optical coherence tomography. Investigative ophthalmology & visual science 49, 2668–2672 (2008).
OpenUrl Abstract/FREE Full Text
Kashani, A. H. et al. Retinal thickness analysis by race, gender, and age using Stratus OCT. American journal of ophthalmology 149, 496-502. e491 (2010).
OpenUrl CrossRef PubMed Web of Science
Pokharel, A., Shrestha, G. S. & Shrestha, J. B. Macular thickness and macular volume measurements using spectral domain optical coherence tomography in normal Nepalese eyes. Clinical ophthalmology (Auckland, NZ) 10, 511 (2016).
OpenUrl
↵
Song, W. K., Lee, S. C., Lee, E. S., Kim, C. Y. & Kim, S. S. Macular thickness variations with sex, age, and axial length in healthy subjects: a spectral domain–optical coherence tomography study. Investigative ophthalmology & visual science 51, 3913–3918 (2010).
OpenUrl Abstract/FREE Full Text
↵
Poplin, R. et al. Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning. Nature Biomedical Engineering 2, 158 (2018).
OpenUrl
↵
Bringmann, A. et al. The primate fovea: Structure, function and development. Progress in retinal and eye research 66, 49–84 (2018).
OpenUrl CrossRef PubMed
↵
Hsieh, Y., Ma IH. in APVRS Congress 2019 (Shanghai, China, 2019).
↵
McCannel, C. A., Ensminger, J. L., Diehl, N. N. & Hodge, D. N. Population-based incidence of macular holes. Ophthalmology 116, 1366–1369 (2009).
OpenUrl CrossRef PubMed Web of Science
↵
Xiao, W., Chen, X., Yan, W., Zhu, Z. & He, M. Prevalence and risk factors of epiretinal membranes: a systematic review and meta-analysis of population-based studies. BMJ open 7, e014644 (2017).
OpenUrl Abstract/FREE Full Text
↵
Sidd, R. J., Fine, S. L., Owens, S. L. & Patz, A. Idiopathic preretinal gliosis. American journal of ophthalmology 94, 44–48 (1982).
OpenUrl CrossRef PubMed Web of Science
Pascolini, D. et al. 2002 global update of available data on visual impairment: a compilation of population-based prevalence studies. Ophthalmic epidemiology 11, 67–115 (2004).
OpenUrl CrossRef PubMed Web of Science
↵
Congdon, N. et al. Causes and prevalence of visual impairment among adults in the United States. Archives of Ophthalmology (Chicago, Ill.: 1960) 122, 477–485 (2004).
OpenUrl CrossRef
↵
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. nature 521, 436–444 (2015).
OpenUrl CrossRef PubMed
↵
Saha, S. et al. Automated detection and classification of early AMD biomarkers using deep learning. Scientific reports 9, 1–9 (2019).
OpenUrl
Kuwayama, S. et al. Automated detection of macular diseases by optical coherence tomography and artificial intelligence machine learning of optical coherence tomography images. Journal of ophthalmology 2019 (2019).
Kugelman, J. et al. Automatic choroidal segmentation in OCT images using supervised deep learning methods. Scientific reports 9, 1–13 (2019).
OpenUrl
Perdomo, O. et al. Classification of diabetes-related retinal diseases using a deep learning approach in optical coherence tomography. Computer methods and programs in biomedicine 178, 181–189 (2019).
OpenUrl
Wang, J. et al. Deep learning for quality assessment of retinal OCT images. Biomedical Optics Express 10, 6057–6072 (2019).
OpenUrl
↵
Lu, W. et al. Deep learning-based automated classification of multicategorical abnormalities from optical coherence tomography images. Translational Vision Science & Technology 7, 41–41 (2018).
OpenUrl
↵
Selvaraju, R. R. et al. in Proceedings of the IEEE international conference on computer vision. 618–626.
↵
Delori, F. C., Goger, D. G., Keilhauer, C., Salvetti, P. & Staurenghi, G. Bimodal spatial distribution of macular pigment: evidence of a gender relationship. JOSA A 23, 521–538 (2006).
OpenUrl
↵
Alamouti, B. & Funk, J. Retinal thickness decreases with age: an OCT study. British journal of ophthalmology 87, 899–901 (2003).
OpenUrl Abstract/FREE Full Text
↵
Zhou, H. et al. Age-Related changes in choroidal thickness and the volume of vessels and stroma using Swept-Source OCT and fully automated algorithms. Ophthalmology Retina 4, 204–215 (2020).
OpenUrl
↵
Russakovsky, O. et al. Imagenet large scale visual recognition challenge. International journal of computer vision 115, 211–252 (2015).
OpenUrl