Alzheimer's disease diagnosis based on multiple cluster dense convolutional networks

https://doi.org/10.1016/j.compmedimag.2018.09.009Get rights and content

Highlights

  • We propose a classification method based on multiple cluster dense convolutional neural networks (DenseNets) using MR images for AD diagnosis.

  • The whole brain image is first partitioned into different local regions. A number of patches are extracted from each region and grouped into different clusters. Multiple cluster DenseNets are built to learn the patch-level features which are further aggregated for region-level representations. The final image classification is made by combining the representations of different regions. The features of MR brain images are gradually learned from the local patches to global image level for the classification task.

  • The proposed method is a data driven method by jointly learning the features and classification without the domain expert knowledge. There are no rigid registration and segmentation required for preprocessing MR images.

  • Our experiments on ADNI database have demonstrated the promising classification performances of the proposed algorithm.

Abstract

Alzheimer's disease (AD) is an irreversible neurodegenerative disorder with progressive impairment of memory and cognitive functions. Structural magnetic resonance images (MRI) play important role to evaluate the brain anatomical changes for AD Diagnosis. Machine learning technologies have been widely studied on MRI computation and analysis for quantitative evaluation and computer-aided-diagnosis of AD. Most existing methods extract the hand-craft features after image processing such as registration and segmentation, and then train a classifier to distinguish AD subjects from other groups. Motivated by the success of deep learning in image classification, this paper proposes a classification method based on multiple cluster dense convolutional neural networks (DenseNets) to learn the various local features of MR brain images, which are combined for AD classification. First, we partition the whole brain image into different local regions and extract a number of 3D patches from each region. Second, the patches from each region are grouped into different clusters with the K-Means clustering method. Third, we construct a DenseNet to learn the patch features for each cluster and the features learned from the discriminative clusters of each region are ensembled for classification. Finally, the classification results from different local regions are combined to enhance final image classification. The proposed method can gradually learn the MRI features from the local patches to global image level for the classification task. There are no rigid registration and segmentation required for preprocessing MRI images. Our method is evaluated using T1-weighted MRIs of 831 subjects including 199 AD patients, 403 mild cognitive impairment (MCI) and 229 normal control (NC) subjects from Alzheimer's Disease Neuroimaging Initiative (ADNI) database. Experimental results show that the proposed method achieves an accuracy of 89.5% and an AUC (area under the ROC curve) of 92.4% for AD vs. NC classification, and an accuracy of 73.8% and an AUC of 77.5% for MCI vs. NC classification, demonstrating the promising classification performances.

Introduction

Alzheimer's disease (AD) is an irreversible brain disorder with progressive impairment of the memory and cognitive functions. It is the most common case of dementia in the late life of humans. Mild cognitive impairment (MCI) is a transitional state from healthy to dementia and it is usually considered as a clinical precursor of AD. Currently, there are no effective cure for AD. But some treatments can be developed to delay its progression, especially if AD can be diagnosed at an early stage. Thus, its early diagnosis is important for patient care and treatment. But it is still a challenging problem for accurate and early diagnosis of AD/MCI in clinic. Magnetic resonance images (MRI) including structural magnetic resonance images (sMRI) and functional MRI (fMRI) are non-invasive and powerful imaging tools to help understand and evaluate the anatomical and functional neural changes related to AD (Herrup, 2011; Jr et al., 2011; Liu et al., 2014). In recent years, extensive efforts have been done to develop computer-aided system using the various machine learning methods to decode the disease states with MR images (Herrup, 2011; Jr et al., 2011; Zhang et al., 2011; Liu et al., 2013; Suk et al., 2015).

Since the raw MR brain image is too huge to be directly used for classification, it is necessary to preprocess the MR images and perform the feature extraction and classification for disease diagnosis. One of the most widely used methods is to partition the image into multiple anatomical regions, i.e., regions of interest (ROIs), through the warping of a labeled atlas, and the regional measurements such as volumes are computed as the features for AD classification (Herrup, 2011; Zhang et al., 2011; Suk et al., 2015). For feature selection, a discriminative multi-task method was proposed to select the most discriminative features from 93 ROIs for multi-modality classification of AD/MCI (Ye et al., 2016). Furthermore, a hierarchical feature and sample selection framework was proposed to gradually select informative features from 93 predefined ROIs and discard ambiguous samples for improving classifier learning (Le et al., 2017). A multi-kernel learned method was combined with marginal fisher analysis to simultaneously select a subset of the relevant brain ROIs and learn a dimensionality transformation (Cao et al., 2017). In addition to the ROI features, deep learning networks were recently used to extract the latent high-level features from measurements of ROIs for AD classification (Zhang et al., 2011; Suk et al., 2015). A stacked autoencoder was investigated to learn the latent high-level features from ROIs for improvement of classification performance (Suk et al., 2015). A novel diagnostic framework with stacked autoencoder was proposed to learn high-level features of ROIs and with a zero-masking strategy for data fusion of multiple image modalities (Liu et al., 2015aa and b). Although promising results of brain image analysis have been reported, there are still some limitations in the ROI based methods. First, the definition of ROIs requires the accumulation of long-term experience of researchers. Second, the segmentation of ROIs is also affected by the individual differences and subjective factors of scientific researches. Third, the morphological abnormalities caused by the brain disorders do not always occur in the pre-defined ROIs, and they may involve multiple ROIs or part of the extracted ROI, so the performance may not be stable.

Instead of partitioning the brain image into ROIs, a landmark-based feature extraction method was proposed for fast AD diagnosis without nonlinear registration and tissue segmentation (Zhang et al., 2016). A number of landmark points were detected based on shape constraint and the morphological features were extracted from the landmarks to train a linear SVM classifier for AD diagnosis. Furthermore, the landmark-based method was extended for analysis of longitudinal MR images (Zhang et al., 2017). The high-level statistical spatial and contextual longitudinal features were extracted from the landmarks to capture the spatial structural abnormalities and longitudinal variations, which were input to train a linear SVM classifier for AD diagnosis. The circular harmonic functions (CHFs) were investigated to extract the local features from the most involved areas of the disease: Hippocampus and Posterior Cingulate Cortex (PCC) in three brain projections and classify the brain images (Ben et al., 2015).

In recent years, the deep learning methods have been widely investigated to jointly learn the features from the images and class discrimination for image classification and computer vision (Simonyan and Zisserman, 2014; Zhu et al., 2017). They also achieved great success to learn the feature and identify the patterns for medical image analysis and computer aided disease diagnosis (Shen et al., 2017). Different from the traditional methods that extracts the handcrafted features with domain specific knowledge, deep learning can construct a deep neural network architecture to learn the hierarchical representations from the raw image data. Thus, the complex patterns can be identified with deep learning. Convolutional neural networks (CNNs) were investigated to learn the features of MR brain images for AD diagnosis (Adrien, 2015, Hosseini-Asl et al., 2016). A deep 3D convolutional neural network (3D-CNN) was built upon a 3D convolutional Autoencoders to capture the anatomical shape variations of the structural MRI scans to predict AD (Hosseini-Asl et al., 2016). This method can learn the features from the raw image data to capture AD biomarkers and adapt to different domain datasets. A deep learning classification algorithm was proposed for AD diagnosis using both structural and functional MRI (Adrien, 2015). In this method, the CNN model was built with one convolutional layer trained with sparse Autoencoder, which was explored to extract the imaging features for AD classification. The above methods can learn the features capturing AD biomarkers via convolutional network. But they require the convolutional filters pretrained on Autoencoder with carefully preprocessed data to extract features and then classify them for task-specific target. A landmark based deep feature learning (LDFL) framework was proposed for automatic diagnosis of AD using MRI (Liu et al., 2018). A number of discriminative anatomical landmarks were identified in a data-driven manner and a set of patches were extracted from the landmarks to build a deep CNN for automatical extraction of patch-based representation from MRI. A novel deep ensemble sparse regression network was proposed that combines the sparse regression and deep learning for diagnosis and prognosis of AD and MCI (Suk et al., 2017). By regarding the response values of the sparse regression models as target-level representations, a deep CNN was built for clinical decision making. A classification method was proposed by ensemble of multiple deep 3D convolutional neural networks (3D-CNNs) to learn the various features from local brain regions for AD classification, which can alleviate the problem of small number of training samples (Cheng and Wang, 2017).

Recently, DenseNet (Huang et al., 2016) was proposed as a new structure of deep convolutional neural network, which connects each layer to every other layers in a feed-forward fashion to capture and reuse the rich features of different layers and thus achieves better performance than other CNN. Motivated by the success of DenseNet in computer vison, this paper proposes a novel classification method based on combination of multiple cluster DenseNets for MR brain image classification and disease diagnosis. Instead of extracting the region of interests (ROIs) predefined by human experts, we uniformly partition the whole brain image into 3 × 3 × 3 different regions and a number of 3D patches are sampled and extracted from each region. Then, K-means clustering is applied to group the patches from one region into different clusters and a deep DenseNet is trained for each cluster. The features learned by multiple cluster DenseNets are aggregated for the representation of local region. Finally, classification results of multiple local regions are combined to make the final classification. Compared to the existing methods, our proposed method has the following advantages: 1) it can alleviate the problem of small image set on training DenseNet. Usually training a DenseNet requires a large image set, which is not applicable for AD diagnosis. Instead of training a deep DenseNet with the whole brain image, we can build a DenseNet on each cluster with a number of local image patches sampled from image region for network training. 2) No tissue and ROI segmentations are required in image processing, which can simplify the diagnosis procedure and save the computation costs. 3) No rigid registration is required before feature extraction, which can reduce the computation costs. Clustering is used to group similar image patches into clusters, which can achieve the robustness of image variances.

The rest of this paper is organized as follows. In Section 2, we present the materials and the proposed method in details. In Section 3, we provide the experiments and results. A conclusion will be given in Section 4.

Section snippets

Proposed method

In this section, we will present the proposed classification framework in detail. Our proposed method makes no assumption on a specific neuroimaging modality. The T1-weighted MR brain images are widely available, non-invasive and often used as the first biomarker in AD diagnosis. Thus, they are used to test the proposed method in this work. For brain image analysis, one direct way is to build a deep DenseNet with the whole 3D image for feature learning and classification jointly. However,

Experimental results

In this section, we will first introduce the image datasets and implementation of our proposed method. Then, we will present the extensive experiments to test the proposed method on classifications of AD vs. NC and MCI vs.NC. We will further compare our proposed method with other methods reported in the literature and give the discussion.

Conclusion

This paper has proposed a classification method based on combination of multiple cluster DenseNets for AD and MCI diagnosis using MR brain images. The whole brain image is partitioned into a number of local regions and a number of 3D patches are extracted from each image region. K-means clustering is used to group patches with similar spatial structure into several clusters and a DenseNet is built and trained for each cluster to extract the patch-level features. The features learned by multiple

Acknowledgments

This work was supported in part by National Natural Science Foundation of China (NSFC) under grants (No. 6181101049, 61773263, 61375112), The National Key Research and Development Program of China (No.2016YFC0100903) and SMC Excellent Young Faculty program of SJTU.

References (32)

  • J.A. Hartigan et al.

    Algorithm AS 136: a K-Means clustering algorithm

    Appl. Stat.

    (1979)
  • K. He et al.

    Deep residual learning for image recognition

    IEEE Conference on Computer Vision and Pattern Recognition

    (2016)
  • E. Hosseini-Asl et al.

    Alzheimer’s disease diagnostics by adaptation of 3d convolutional network

    2016 IEEE International Conference on Image Processing (Icip)

    (2016)
  • G. Huang et al.

    Densely Connected Convolutional Networks

    (2016)
  • J.C. Jr et al.

    Introduction to the recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease

    Alzheimers Dementia J. Alzheimers Assoc.

    (2011)
  • A. Le et al.

    A hierarchical feature and sample selection framework and its application for Alzheimer’s disease diagnosis

    Sci. Rep.

    (2017)
  • Cited by (134)

    View all citing articles on Scopus
    View full text