Alzheimer's disease diagnosis based on multiple cluster dense convolutional networks

doi:10.1016/j.compmedimag.2018.09.009

Computerized Medical Imaging and Graphics

Volume 70, December 2018, Pages 101-110

https://doi.org/10.1016/j.compmedimag.2018.09.009 Get rights and content

Highlights

•
We propose a classification method based on multiple cluster dense convolutional neural networks (DenseNets) using MR images for AD diagnosis.
•
The whole brain image is first partitioned into different local regions. A number of patches are extracted from each region and grouped into different clusters. Multiple cluster DenseNets are built to learn the patch-level features which are further aggregated for region-level representations. The final image classification is made by combining the representations of different regions. The features of MR brain images are gradually learned from the local patches to global image level for the classification task.
•
The proposed method is a data driven method by jointly learning the features and classification without the domain expert knowledge. There are no rigid registration and segmentation required for preprocessing MR images.
•
Our experiments on ADNI database have demonstrated the promising classification performances of the proposed algorithm.

Abstract

Alzheimer's disease (AD) is an irreversible neurodegenerative disorder with progressive impairment of memory and cognitive functions. Structural magnetic resonance images (MRI) play important role to evaluate the brain anatomical changes for AD Diagnosis. Machine learning technologies have been widely studied on MRI computation and analysis for quantitative evaluation and computer-aided-diagnosis of AD. Most existing methods extract the hand-craft features after image processing such as registration and segmentation, and then train a classifier to distinguish AD subjects from other groups. Motivated by the success of deep learning in image classification, this paper proposes a classification method based on multiple cluster dense convolutional neural networks (DenseNets) to learn the various local features of MR brain images, which are combined for AD classification. First, we partition the whole brain image into different local regions and extract a number of 3D patches from each region. Second, the patches from each region are grouped into different clusters with the K-Means clustering method. Third, we construct a DenseNet to learn the patch features for each cluster and the features learned from the discriminative clusters of each region are ensembled for classification. Finally, the classification results from different local regions are combined to enhance final image classification. The proposed method can gradually learn the MRI features from the local patches to global image level for the classification task. There are no rigid registration and segmentation required for preprocessing MRI images. Our method is evaluated using T1-weighted MRIs of 831 subjects including 199 AD patients, 403 mild cognitive impairment (MCI) and 229 normal control (NC) subjects from Alzheimer's Disease Neuroimaging Initiative (ADNI) database. Experimental results show that the proposed method achieves an accuracy of 89.5% and an AUC (area under the ROC curve) of 92.4% for AD vs. NC classification, and an accuracy of 73.8% and an AUC of 77.5% for MCI vs. NC classification, demonstrating the promising classification performances.

Introduction

Alzheimer's disease (AD) is an irreversible brain disorder with progressive impairment of the memory and cognitive functions. It is the most common case of dementia in the late life of humans. Mild cognitive impairment (MCI) is a transitional state from healthy to dementia and it is usually considered as a clinical precursor of AD. Currently, there are no effective cure for AD. But some treatments can be developed to delay its progression, especially if AD can be diagnosed at an early stage. Thus, its early diagnosis is important for patient care and treatment. But it is still a challenging problem for accurate and early diagnosis of AD/MCI in clinic. Magnetic resonance images (MRI) including structural magnetic resonance images (sMRI) and functional MRI (fMRI) are non-invasive and powerful imaging tools to help understand and evaluate the anatomical and functional neural changes related to AD (Herrup, 2011; Jr et al., 2011; Liu et al., 2014). In recent years, extensive efforts have been done to develop computer-aided system using the various machine learning methods to decode the disease states with MR images (Herrup, 2011; Jr et al., 2011; Zhang et al., 2011; Liu et al., 2013; Suk et al., 2015).

Since the raw MR brain image is too huge to be directly used for classification, it is necessary to preprocess the MR images and perform the feature extraction and classification for disease diagnosis. One of the most widely used methods is to partition the image into multiple anatomical regions, i.e., regions of interest (ROIs), through the warping of a labeled atlas, and the regional measurements such as volumes are computed as the features for AD classification (Herrup, 2011; Zhang et al., 2011; Suk et al., 2015). For feature selection, a discriminative multi-task method was proposed to select the most discriminative features from 93 ROIs for multi-modality classification of AD/MCI (Ye et al., 2016). Furthermore, a hierarchical feature and sample selection framework was proposed to gradually select informative features from 93 predefined ROIs and discard ambiguous samples for improving classifier learning (Le et al., 2017). A multi-kernel learned method was combined with marginal fisher analysis to simultaneously select a subset of the relevant brain ROIs and learn a dimensionality transformation (Cao et al., 2017). In addition to the ROI features, deep learning networks were recently used to extract the latent high-level features from measurements of ROIs for AD classification (Zhang et al., 2011; Suk et al., 2015). A stacked autoencoder was investigated to learn the latent high-level features from ROIs for improvement of classification performance (Suk et al., 2015). A novel diagnostic framework with stacked autoencoder was proposed to learn high-level features of ROIs and with a zero-masking strategy for data fusion of multiple image modalities (Liu et al., 2015aa and b). Although promising results of brain image analysis have been reported, there are still some limitations in the ROI based methods. First, the definition of ROIs requires the accumulation of long-term experience of researchers. Second, the segmentation of ROIs is also affected by the individual differences and subjective factors of scientific researches. Third, the morphological abnormalities caused by the brain disorders do not always occur in the pre-defined ROIs, and they may involve multiple ROIs or part of the extracted ROI, so the performance may not be stable.

Instead of partitioning the brain image into ROIs, a landmark-based feature extraction method was proposed for fast AD diagnosis without nonlinear registration and tissue segmentation (Zhang et al., 2016). A number of landmark points were detected based on shape constraint and the morphological features were extracted from the landmarks to train a linear SVM classifier for AD diagnosis. Furthermore, the landmark-based method was extended for analysis of longitudinal MR images (Zhang et al., 2017). The high-level statistical spatial and contextual longitudinal features were extracted from the landmarks to capture the spatial structural abnormalities and longitudinal variations, which were input to train a linear SVM classifier for AD diagnosis. The circular harmonic functions (CHFs) were investigated to extract the local features from the most involved areas of the disease: Hippocampus and Posterior Cingulate Cortex (PCC) in three brain projections and classify the brain images (Ben et al., 2015).

In recent years, the deep learning methods have been widely investigated to jointly learn the features from the images and class discrimination for image classification and computer vision (Simonyan and Zisserman, 2014; Zhu et al., 2017). They also achieved great success to learn the feature and identify the patterns for medical image analysis and computer aided disease diagnosis (Shen et al., 2017). Different from the traditional methods that extracts the handcrafted features with domain specific knowledge, deep learning can construct a deep neural network architecture to learn the hierarchical representations from the raw image data. Thus, the complex patterns can be identified with deep learning. Convolutional neural networks (CNNs) were investigated to learn the features of MR brain images for AD diagnosis (Adrien, 2015, Hosseini-Asl et al., 2016). A deep 3D convolutional neural network (3D-CNN) was built upon a 3D convolutional Autoencoders to capture the anatomical shape variations of the structural MRI scans to predict AD (Hosseini-Asl et al., 2016). This method can learn the features from the raw image data to capture AD biomarkers and adapt to different domain datasets. A deep learning classification algorithm was proposed for AD diagnosis using both structural and functional MRI (Adrien, 2015). In this method, the CNN model was built with one convolutional layer trained with sparse Autoencoder, which was explored to extract the imaging features for AD classification. The above methods can learn the features capturing AD biomarkers via convolutional network. But they require the convolutional filters pretrained on Autoencoder with carefully preprocessed data to extract features and then classify them for task-specific target. A landmark based deep feature learning (LDFL) framework was proposed for automatic diagnosis of AD using MRI (Liu et al., 2018). A number of discriminative anatomical landmarks were identified in a data-driven manner and a set of patches were extracted from the landmarks to build a deep CNN for automatical extraction of patch-based representation from MRI. A novel deep ensemble sparse regression network was proposed that combines the sparse regression and deep learning for diagnosis and prognosis of AD and MCI (Suk et al., 2017). By regarding the response values of the sparse regression models as target-level representations, a deep CNN was built for clinical decision making. A classification method was proposed by ensemble of multiple deep 3D convolutional neural networks (3D-CNNs) to learn the various features from local brain regions for AD classification, which can alleviate the problem of small number of training samples (Cheng and Wang, 2017).

Recently, DenseNet (Huang et al., 2016) was proposed as a new structure of deep convolutional neural network, which connects each layer to every other layers in a feed-forward fashion to capture and reuse the rich features of different layers and thus achieves better performance than other CNN. Motivated by the success of DenseNet in computer vison, this paper proposes a novel classification method based on combination of multiple cluster DenseNets for MR brain image classification and disease diagnosis. Instead of extracting the region of interests (ROIs) predefined by human experts, we uniformly partition the whole brain image into 3 × 3 × 3 different regions and a number of 3D patches are sampled and extracted from each region. Then, K-means clustering is applied to group the patches from one region into different clusters and a deep DenseNet is trained for each cluster. The features learned by multiple cluster DenseNets are aggregated for the representation of local region. Finally, classification results of multiple local regions are combined to make the final classification. Compared to the existing methods, our proposed method has the following advantages: 1) it can alleviate the problem of small image set on training DenseNet. Usually training a DenseNet requires a large image set, which is not applicable for AD diagnosis. Instead of training a deep DenseNet with the whole brain image, we can build a DenseNet on each cluster with a number of local image patches sampled from image region for network training. 2) No tissue and ROI segmentations are required in image processing, which can simplify the diagnosis procedure and save the computation costs. 3) No rigid registration is required before feature extraction, which can reduce the computation costs. Clustering is used to group similar image patches into clusters, which can achieve the robustness of image variances.

The rest of this paper is organized as follows. In Section 2, we present the materials and the proposed method in details. In Section 3, we provide the experiments and results. A conclusion will be given in Section 4.

Section snippets

Proposed method

In this section, we will present the proposed classification framework in detail. Our proposed method makes no assumption on a specific neuroimaging modality. The T1-weighted MR brain images are widely available, non-invasive and often used as the first biomarker in AD diagnosis. Thus, they are used to test the proposed method in this work. For brain image analysis, one direct way is to build a deep DenseNet with the whole 3D image for feature learning and classification jointly. However,

Experimental results

In this section, we will first introduce the image datasets and implementation of our proposed method. Then, we will present the extensive experiments to test the proposed method on classifications of AD vs. NC and MCI vs.NC. We will further compare our proposed method with other methods reported in the literature and give the discussion.

Conclusion

This paper has proposed a classification method based on combination of multiple cluster DenseNets for AD and MCI diagnosis using MR brain images. The whole brain image is partitioned into a number of local regions and a number of 3D patches are extracted from each image region. K-means clustering is used to group patches with similar spatial structure into several clusters and a DenseNet is built and trained for each cluster to extract the patch-level features. The features learned by multiple

Acknowledgments

This work was supported in part by National Natural Science Foundation of China (NSFC) under grants (No. 6181101049, 61773263, 61375112), The National Key Research and Development Program of China (No.2016YFC0100903) and SMC Excellent Young Faculty program of SJTU.

References (32)

K. Herrup
Commentary on "Recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease." Addressing the challenge of Alzheimer’s disease in the 21st century
Alzheimers Dementia J. Alzheimers Assoc.
(2011)
N. Kabani et al.
A 3D atlas of the human brain
Neuroimage
(1998)
M. Liu et al.
Ensemble sparse classification of Alzheimer’s disease
Neuroimage
(2012)
F. Liu et al.
Inter-modality relationship constrained multi-modality multi-task feature selection for Alzheimer’s disease and mild cognitive impairment identification
Neuroimage
(2014)
H.I. Suk et al.
Deep ensemble learning of sparse regression models for brain disease diagnosis
Med. Image Anal.
(2017)
D. Zhang et al.
Multimodal classification of Alzheimer’s disease and mild cognitive impairment
Neuroimage
(2011)
Pa.G.M. Adrien
Predicting Alzheimer’s Disease: A Neuroimaging Study with 3d Convolutional Neural Networks
(2015)
A.O. Ben et al.
Alzheimer’s disease diagnosis on structural MR images using circular harmonic functions descriptors on hippocampus and posterior cingulate cortex
Comput. Med. Imaging Graph.
(2015)
P. Cao et al.
Nonlinearity-aware based dimensionality reduction and over-sampling for AD/MCI classification from MRI measures
Comput. Biol. Med.
(2017)
D. Cheng et al.
Classification of MR brain images by combination of multi-CNNs for AD diagnosis
International Conference on Digital Image Processing
(2017)

J.A. Hartigan et al.

Algorithm AS 136: a K-Means clustering algorithm

Appl. Stat.

(1979)

K. He et al.

Deep residual learning for image recognition

IEEE Conference on Computer Vision and Pattern Recognition

(2016)

E. Hosseini-Asl et al.

Alzheimer’s disease diagnostics by adaptation of 3d convolutional network

2016 IEEE International Conference on Image Processing (Icip)

(2016)

G. Huang et al.

Densely Connected Convolutional Networks

(2016)

J.C. Jr et al.

Introduction to the recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease

Alzheimers Dementia J. Alzheimers Assoc.

(2011)

A. Le et al.

A hierarchical feature and sample selection framework and its application for Alzheimer’s disease diagnosis

Sci. Rep.

(2017)

Cited by (134)

Deep neural network CSES-NET and multi-channel feature fusion for Alzheimer's disease diagnosis
2024, Biomedical Signal Processing and Control
Alzheimer's disease (AD) is an irreversible brain disease. The structural Magnetic Resonance Imaging (sMRI) has been widely used in the diagnosis of AD. However, the characteristic information from a single-mode is not comprehensive. In this paper, we proposed a Convolutional- Squeeze-Excitation-Softmax-NET (CSES-NET) deep neural network combined with multi-channel feature fusion for the diagnosis of AD. First, three kinds of features were extracted including patches based on voxel morphology, cortical features based on surface morphology, and radiomics features. Next, the residual network CSES-NET was proposed to extract the deep features from the patch images in which the features were re-scaled in the residual structure in order to fit the correlation between channels. Then, the fused features of the three channels were applied to classify AD/EMCI/LMCI/NC with the fully connected neural network. Finally, radiomics and cortical features were combined with genetic data for genome-wide association study to assess genetic variants. We performed experiments with 1539 subjects from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database. The experimental results verified that the proposed method improved the effectiveness of the model by extracting nonlinear deep features and fusing the multi-channel features. In addition, the genome-wide association study identified multiple risk SNPs loci which were associated with the pathological of AD and contributed to the early prevention and control of AD.
Diagnosis of Alzheimer's disease by joining dual attention CNN and MLP based on structural MRIs, clinical and genetic data
2023, Artificial Intelligence in Medicine
Alzheimer’s disease (AD) is an irreversible central nervous degenerative disease, while mild cognitive impairment (MCI) is a precursor state of AD. Accurate early diagnosis of AD is conducive to the prevention and early intervention treatment of AD. Although some computational methods have been developed for AD diagnosis, most employ only neuroimaging, ignoring other data (e.g., genetic, clinical) that may have potential disease information. In addition, the results of some methods lack interpretability. In this work, we proposed a novel method (called DANMLP) of joining dual attention convolutional neural network (CNN) and multilayer perceptron (MLP) for computer-aided AD diagnosis by integrating multi-modality data of the structural magnetic resonance imaging (sMRI), clinical data (i.e., demographics, neuropsychology), and APOE genetic data. Our DANMLP consists of four primary components: (1) the Patch-CNN for extracting the image characteristics from each local patch, (2) the position self-attention block for capturing the dependencies between features within a patch, (3) the channel self-attention block for capturing dependencies of inter-patch features, (4) two MLP networks for extracting the clinical features and outputting the AD classification results, respectively. Compared with other state-of-the-art methods in the 5CV test, DANMLP achieves 93% and 82.4% classification accuracy for the AD vs. MCI and MCI vs. NC tasks on the ADNI database, which is 0.2% $\sim$ 15.2% and 3.4% $\sim$ 26.8% higher than that of other five methods, respectively. The individualized visualization of focal areas can also help clinicians in the early diagnosis of AD. These results indicate that DANMLP can be effectively used for diagnosing AD and MCI patients.
DE-JANet: A unified network based on dual encoder and joint attention for Alzheimer's disease classification using multi-modal data
2023, Computers in Biology and Medicine
Structural magnetic resonance imaging (sMRI), which can reflect cerebral atrophy, plays an important role in the early detection of Alzheimer’s disease (AD). However, the information provided by analyzing only the morphological changes in sMRI is relatively limited, and the assessment of the atrophy degree is subjective. Therefore, it is meaningful to combine sMRI with other clinical information to acquire complementary diagnosis information and achieve a more accurate classification of AD. Nevertheless, how to fuse these multi-modal data effectively is still challenging. In this paper, we propose DE-JANet, a unified AD classification network that integrates image data sMRI with non-image clinical data, such as age and Mini-Mental State Examination (MMSE) score, for more effective multi-modal analysis. DE-JANet consists of three key components: (1) a dual encoder module for extracting low-level features from the image and non-image data according to specific encoding regularity, (2) a joint attention module for fusing multi-modal features, and (3) a token classification module for performing AD-related classification according to the fused multi-modal features. Our DE-JANet is evaluated on the ADNI dataset, with a mean accuracy of 0.9722 and 0.9538 for AD classification and mild cognition impairment (MCI) classification, respectively, which is superior to existing methods and indicates advanced performance on AD-related diagnosis tasks.
Evaluation of MRI-based machine learning approaches for computer-aided diagnosis of dementia in a clinical data warehouse
2023, Medical Image Analysis
A variety of algorithms have been proposed for computer-aided diagnosis of dementia from anatomical brain MRI. These approaches achieve high accuracy when applied to research data sets but their performance on real-life clinical routine data has not been evaluated yet. The aim of this work was to study the performance of such approaches on clinical routine data, based on a hospital data warehouse, and to compare the results to those obtained on a research data set. The clinical data set was extracted from the hospital data warehouse of the Greater Paris area, which includes 39 different hospitals. The research set was composed of data from the Alzheimer’s Disease Neuroimaging Initiative data set. In the clinical set, the population of interest was identified by exploiting the diagnostic codes from the 10th revision of the International Classification of Diseases that are assigned to each patient. We studied how the imbalance of the training sets, in terms of contrast agent injection and image quality, may bias the results. We demonstrated that computer-aided diagnosis performance was strongly biased upwards (over 17 percent points of balanced accuracy) by the confounders of image quality and contrast agent injection, a phenomenon known as the Clever Hans effect or shortcut learning. When these biases were removed, the performance was very poor. In any case, the performance was considerably lower than on the research data set. Our study highlights that there are still considerable challenges for translating dementia computer-aided diagnosis systems to clinical routine.
ResNet and its application to medical image processing: Research progress and challenges
2023, Computer Methods and Programs in Biomedicine
Deep learning, a novel approach and subset of machine learning, has drawn a growing amount of attention from computer vision researchers in recent years. This method has drawn a lot of interest because of its extraordinary ability to interpret medical pictures, especially when combined with residual neural networks, which have helped to progress the field.
In this paper, the following research is carried out on the residual network. First, the research status of ResNet in the medical field is introduced. The fundamental idea behind the residual neural network is then explained, along with the residual unit, its many structures, and the network architecture. Second, four aspects of the widespread use of residual neural networks in medical image processing are discussed: lung tumor, diagnosis of skin diseases, diagnosis of breast diseases, and diagnosis of diseases of the brain. Finally, the main issues and ResNet's future development in the area of processing medical images are discussed.
In the area of medical graph processing, residual neural networks have made strides and have had success in the clinical auxiliary diagnosis of serious illnesses such as lung tumors, breast cancer, skin conditions, and cardiovascular and cerebrovascular diseases.
We thoroughly sorted out the most recent developments in residual neural network research and their use in medical image processing, which serves as a crucial point of reference for this field of study. It offers a helpful reference for further promoting the application and research of the ResNet model in the field of medical image processing by summarising the application status and issues of the ResNet model in the field of medical image processing and putting forwards some future development directions.
Conv-Swinformer: Integration of CNN and shift window attention for Alzheimer's disease classification
2023, Computers in Biology and Medicine
Deep learning (DL) algorithms based on brain MRI images have achieved great success in the prediction of Alzheimer’s disease (AD), with classification accuracy exceeding even that of the most experienced clinical experts. As a novel feature fusion method, Transformer has achieved excellent performance in many computer vision tasks, which also greatly promotes the application of Transformer in medical images. However, when Transformer is used for 3D MRI image feature fusion, existing DL models treat the input local features equally, which is inconsistent with the fact that adjacent voxels have stronger semantic connections than spatially distant voxels. In addition, due to the relatively small size of the dataset for medical images, it is difficult to capture local lesion features in limited iterative training by treating all input features equally. This paper proposes a deep learning model Conv-Swinformer that focuses on extracting and integrating local fine-grained features. Conv-Swinformer consists of a CNN module and a Transformer encoder module. The CNN module summarizes the planar features of the MRI slices, and the Transformer module establishes semantic connections in 3D space for these planar features. By introducing the shift window attention mechanism in the Transformer encoder, the attention is focused on a small spatial area of the MRI image, which effectively reduces unnecessary background semantic information and enables the model to capture local features more accurately. In addition, the layer-by-layer enlarged attention window can further integrate local fine-grained features, thus enhancing the model’s attention ability. Compared with DL algorithms that indiscriminately fuse local features of MRI images, Conv-Swinformer can fine-grained extract local lesion features, thus achieving better classification results.

View all citing articles on Scopus

View full text

Alzheimer's disease diagnosis based on multiple cluster dense convolutional networks

Highlights

Abstract

Introduction

Section snippets

Proposed method

Experimental results

Conclusion

Acknowledgments

Alzheimers Dementia J. Alzheimers Assoc.

Neuroimage

Neuroimage

Neuroimage

Med. Image Anal.

Neuroimage

Predicting Alzheimer’s Disease: A Neuroimaging Study with 3d Convolutional Neural Networks

Alzheimer’s disease diagnosis on structural MR images using circular harmonic functions descriptors on hippocampus and posterior cingulate cortex

Comput. Med. Imaging Graph.

Nonlinearity-aware based dimensionality reduction and over-sampling for AD/MCI classification from MRI measures

Comput. Biol. Med.

Classification of MR brain images by combination of multi-CNNs for AD diagnosis

International Conference on Digital Image Processing

Algorithm AS 136: a K-Means clustering algorithm

Appl. Stat.

Deep residual learning for image recognition

IEEE Conference on Computer Vision and Pattern Recognition

Alzheimer’s disease diagnostics by adaptation of 3d convolutional network

2016 IEEE International Conference on Image Processing (Icip)

Densely Connected Convolutional Networks

Introduction to the recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease

Alzheimers Dementia J. Alzheimers Assoc.

A hierarchical feature and sample selection framework and its application for Alzheimer’s disease diagnosis

Sci. Rep.