Enhancing multi-center generalization of machine learning-based depression diagnosis from resting-state fMRI

Takashi Nakano; Masahiro Takamura; Naho Ichikawa; Go Okada; Yasumasa Okamoto; Makiko Yamada; Tetsuya Suhara; Shigeto Yamawaki; Junichiro Yoshimoto

doi:10.1101/19004051

Abstract

Resting-state fMRI has the potential to find abnormal behavior in brain activity and to diagnose patients with depression. However, resting-state fMRI has a bias depending on the scanner site, which makes it difficult to diagnose depression at a new site. In this paper, we propose methods to improve the performance of the diagnosis of major depressive disorder (MDD) at an independent site by reducing the site bias effects using regression. For this, we used a subgroup of healthy subjects of the independent site to regress out site bias. We further improved the classification performance of patients with depression by focusing on melancholic depressive disorder. Our proposed methods would be useful to apply depression classifiers to subjects at completely new sites.

Introduction

Depressive disorder is a mental disorder characterized by long-lasting low mood. The diagnosis of depressive disorder has traditionally been made through the interaction between patients and doctors. It is important to develop more objective ways to diagnose depressive disorder in order to increase the reliability and accuracy of the diagnosis.

The combination of machine learning and functional magnetic resonance imaging (fMRI) has been used to diagnose or to find the physiological characteristics of psychiatric disorders [1-11].

Recently, functional connectivity (the correlation coefficients of the brain activity between brain regions) easily calculated from resting-state fMRI data are being used for the diagnosis of psychiatric disorders, such as autism, obsessive-compulsive disorder, and depression [1-7]. For resting-state fMRI, spontaneous brain activity was measured from subjects lying in an fMRI scanner without any stimulations. Because of these advantages of resting-state fMRI, functional connectivity has the potential to become a clinical tool in wide-spread hospitals.

However, functional connectivity is affected by site bias [2, 12]. To overcome site bias, using data from as many multiple sites as possible for the training of classification algorithms can be effective. Even though data from many sites were used for machine learning, it is difficult to apply this to a completely new site data.

In addition to site bias, the heterogeneity of major depressive disorder would be a problem when the classifier is applied to new data. Abnormality of functional connectivity of depression might be different depending on the subtype of the depression. It would be possible to improve classification performance by focusing on a typical severe depression called melancholic depressive disorder. In this paper, we propose the methods to improve the performance of diagnosis of major depressive disorder (MDD) at an independent site by reducing the site bias effects. In addition, we investigated the performance depending on the classification algorithms and the heterogeneity of the major depressive disorder.

Materials and Methods

Subjects

One hundred sixty-three patients with MDD (age 20-75, average 44.1 ± 12.2) were recruited by 5 sites (the Psychiatry Department of Hiroshima University and collaborating medical institutions, Table 1). They were screened using the Mini International Neuropsychiatric Interview (M.I.N.I,) [13, 14], which enables medical doctors to identify psychiatric disorders according to DSM-IV criteria [15]. Patients had an initial MRI scan before or after starting medication within 0-2 weeks.

View this table:

Table 1.

Demographic data of study participants.

As a control group, 195 healthy subjects (age 20-75, average 39.1 ± 15.5) with no history of mental or neurological disease were recruited from the local community. All control subjects underwent the same self-assessment and examination administered to the MDD group. Thereafter, for the melancholic MDD classifier, the dataset was limited to have the subtype of melancholia (based on M.I.N.I.). The numbers of patients and healthy controls were set to be equal for each site except for the subjects from the National Institutes for Quantum and Radiological Science and Technology, in order to develop a classifier unbiased toward either group (see Table 1). The subjects from the National Institutes for Quantum and Radiological Science and Technology were used only for the test data (replication cohort).

The study protocol in this study was approved by the Ethics Committee of Hiroshima University and the Radiation Drug Safety Committee and by the institutional review board of the National Institutes for Quantum and Radiological Science and Technology, in accordance with the ethical standards laid down in the 1964 Declaration of Helsinki and its later amendments. All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived. The detailed properties of the subjects are shown in Table 1.

Acquisition and preprocessing of functional MRI data

fMRI scanners were used to generate magnetic resonance images. Functional data were collected using gradient echo planar imaging (EPI) sequences. High-resolution T1-weighted magnetization-prepared rapid gradient echo images were also acquired before scanning the functional data. In the scan room with dimmed lights, participants were instructed not to think of anything in particular, not to sleep, and keep looking at a cross mark at the center of the monitor screen. The first 4 to 7 images were discarded to allow magnetization to reach equilibrium. All participants underwent an approximately 5 to 10 min resting-state scan. The scanners and imaging parameters are different depending on the site (see Table 1 and 2).

View this table:

Table 2.

Imaging protocols for resting-state fMRI.

View this table:

Table 3.

Leave-One-Site-Out Cross-Validation

All the resting-state fMRI data were preprocessed using the identical procedures described in [7]. T1-weighted structural image and resting-state functional images were preprocessed using SPM8 (Wellcome Trust Centre for Neuroimaging, University College London, UK) on Matlab R2014a (Mathworks inc., USA). The functional images were preprocessed with slice-timing correction and realignment to the mean image. Thereafter, using the normalization parameters obtained through the segmentation of the structural image aligned with the mean functional image, the fMRI data was normalized and resampled in 2 ⨯ 2 ⨯ 2 mm³ voxels. Finally, the functional images were smoothed with an isotropic 6mm full-width half-maximum Gaussian kernel. After these preprocessing steps, the scrubbing procedure [16] was performed to exclude any volume (i.e., functional image) with excessive head motions, based on the frame-to-frame relative changes in time series data.

Calculation of functional connectivity

For each individual, the time course of fMRI data was extracted for each of 137 regions of interest (ROIs), anatomically defined in the Brainvisa Sulci Atlas (BSA; http://brainvisa.Info) [6, 16] covering the entire cerebral cortex without a cerebellum. After applying a band-pass filter (0.008– 0.1 Hz), the following nine parameters were linearly regressed out: the six head motion parameters from realignment; the temporal fluctuation of the white matter; that of the cerebrospinal fluid; and that of the entire brain. Pair-wise Pearson correlations between 137 ROIs were calculated to obtain a matrix of 9,316 functional connectivities for each participant.

Data standardization, removal of nuisance variables, and feature selections

For functional connectivity data, we applied the Fisher z-transformation to each correlation coefficient. We then regressed them on training subjects’ ages, sex, and dummy variables for each site. The resulting residuals of the regression are used for the subsequent procedures as the features controlling for age, sex, and site effects. We also used the regression coefficients to remove nuisance variables such as age, sex, and site from test subjects. Features like functional connectivities were high-dimensional. We then used a rank-sum test for dimensional reduction; we used the threshold of 0.05 unless noted. The procedures are summarized in Figure 1.

Figure 1. The procedure of the removal of site bias and feature selection.

LOOCV

Use one subject as test data and the remaining subjects as training data, and repeat for all subjects LOSOCV

Use subjects from one site as test data and subjects from the remaining 3 sites as training data, and repeat for all subjects

Independent site test

Use subjects from four sites as training data, and use subjects from the independent site as test data. In some cases, a part of data was used for the removal of nuisance variables

Classification algorithms

We then applied machine learning classification using the above features. We used ensemble learning (bagging and AdaBoost), support vector machine (SVM), or sparse logistic regression (SLR) [17] to classify depressed patients and healthy controls, or melancholic patients and healthy controls. Ensemble learning is a method to make a classifier by combining weak classifiers. While bagging makes the decision based on voting by parallel weak classifiers, AdaBoost combines weak classifiers sequentially. The hyperparameters adjusting tree depth (minimum leaf size and the maximum number of splits), and the other hyperparameters such as the number of ensemble learning cycles, and learning rates were optimized using Bayesian optimization described in the following section. Ensemble learning has advantages in dealing with small sample size, and high-dimensionality [18]. SVM is a widely used classifier and performs classification by finding the hyperplane that differentiates the two classes. Types of the kernel and penalty parameter were determined by the optimization in the following section. SLR is a Bayesian extension of logistic regression in which a sparseness prior is imposed on the logistic regression. SLR has the ability to select features objectively that are related to classifying depressive disorders [2, 7, 17].

Evaluation and estimation of the hyperparameters of Classifiers

The performance of the classification was evaluated using Leave-One-Subject-Out cross-validation, Leave-One-Site-Out cross-validation within discovery cohort or independent dataset from a replication cohort.

For Leave-One-Subject-Out cross-validation, one subject was used for the test and the rest subjects were used for the training and this process was repeated until all subjects were used as a test data. The hyperparameters of the AdaBoost, Bagging, and SVM were optimized with Bayesian optimization using Gaussian process [19].

For the optimization, the whole data was divided into 9-folds keeping an equal amount of (diagnosis, gender, and site) combinations per fold. The optimization was performed using the 8-folds with 8-folds cross-validation (inner 7-folds are used for the training and the remaining inner one-fold was used for the validation). The test data was selected in the outer 1-fold. The rest from outer 1-fold and the outer 8-folds were used for the training of the classifier [7].

For Leave-One-Site-Out cross-validation, subjects from three out of the four sites were used as the training data and subjects from the remaining sites were used as test data. In the case that independent dataset was used as a test data, all subjects from the four sites were used for training. For both cases, the optimization of the hyperparameters was done using 8-folds cross-validation within the training data.

Results

We trained classifier of the major depressive disorder using functional connectivity as the features. Because of the high dimensionality of functional connectivity, we selected the features using a rank-sum test with only the training data. Figure 2a shows the classification performance of AdaBoost, Bagging, SLR, and SVM with 0.05 threshold of the rank-sum test. while SVM shows the best classification accuracy, SLR did not show good accuracy. Then we investigated the classification performance such as accuracy, sensitivity (the number of subjects classified as patients divided by the number of patients) and specificity (the number of subjects classified as healthy controls divided by the number of healthy controls) and effect of threshold for the feature selection focusing on the SVM (Figure 2b). The classification performance of SVM was 73.3 % accuracy, 74.3 % sensitivity, and 72.3 % specificity with 0.05 threshold of the rank-sum test. The performance was not different even as we changed the threshold (Figure 2b).

Figure 2. The performance of the classification of depressed patients.

(a) The SVM shows an accuracy of 73.3%, sensitivity of 74.3%, and specificity of 72.3%.

(b) The classification performance with different threshold of feature selection.

We then did a permutation test in order to confirm that the performance was significant. The diagnostic labels were randomly permuted and the performance of the classification was evaluated. The results showed that the classification accuracy was significantly higher than the change level for each algorithm (Figure 3).

Figure 3. The permutation test.

The diagnostic labels were randomly permuted for each subject and classification algorithms were applied, such as (a) random forest, (b) AdaBoost, (c) SLR, (d) SVM. The procedure was repeated 500 times.

In order to investigate the effects of site bias on the classification, the evaluation of classification was performed by leave-one-site-out cross-validation. The classification accuracies varied widely depending on the site (Table 2). Especially, the sensitivity and specificity were highly dependent on the site, which indicated that the classification was based on site bias rather than the feature associated with depression.

Depression is heterogeneous. This can be the reason why the classification performance was affected by the site rather than the diagnostic label. We then focused on melancholic depression that is a subtype of major depressive disorder with biological homogeneity [20-22]. The classification performance between melancholic patients and healthy controls was slightly higher than the one between major depressed patients, including non-melancholic patients and healthy controls (Figure 4). The classifier classified the patients with non-melancholic depression with an accuracy of about 75.3 %. The tendency of the accuracy depending on the algorithms was the same as in the case of the patients with major depressive disorder vs healthy controls. That is, SVM and ensemble learning showed good performance, whereas SLR did not. Moreover, the performance was evaluated by leave-one-site-out cross-validation. The classification accuracies varied widely depending on the site even without non-melancholic patients.

Figure 4. The performance of classification of melancholic patients and healthy controls

(a) Accuracy, sensitivity, and specificity of the random forest by LOOCV. The rightmost bar is the accuracy rate that the melancholic classifier correctly classifies non-melancholic patients as depressed patients.

(b) LOOCV performance of melancholic patients’ classification using other classification algorithms.

(c) Leave-one-site-out CV performance of melancholic patients’ classification. Area under the ROC Curve for Test Sites 1, 2, 3, and 4 are 0.66, 0.5, 0.82, and 0.79, respectively.

We then used subjects from the independent site (Figure 5). We trained the classifier using subjects from four sites and tested the classifier using subjects from the independent site. The performance shows the classification was based on site bias rather than the diagnostic label (Figure 5a). The decision boundary between patients and healthy controls obtained using training data is different from the boundary between patients and healthy controls of the test data. In order to overcome this problem, we used a part of the healthy controls from the independent site for regressing out the site, ages, and sex information, and the remaining healthy controls and depressed patients were used as test data. Note that the number of healthy controls and depressed patients used for the test data were set to be equal. Although the accuracy was 54.7%, this ingenuity reduced site bias (Figure 5b). Furthermore, when we focused on melancholic patients, the performance had drastically much improved (Figures 5d-f) and the accuracy was 71.9 %.

Figure 5. The improvement of the classification performance using the independent site.

(a) All subjects from independent site data were used as test data.

(b) The performance of the classification after part of healthy subjects (not used as either the training or test data) are used for the regressing out. Because there are many combinations from which to choose healthy controls used for the regressing out, we randomly chose healthy controls for the regressing out, which procedure was repeating 100 times. The error bar indicated the standard deviation.

(d-f) The same as above, but used only melancholic patients and healthy controls.

Discussion

In this study, we proposed a method to reduce the site bias effect and improve the classification accuracy of patients with the major depressive disorder for an independent test dataset. The most effective improvement was achieved by the calibration using a part of the independent dataset. Specifically, a part of healthy controls was used for regressing out site bias. Furthermore, when we focused on the classification between melancholic depressed patients and healthy controls, the effect of the calibration was further improved. In this case, SVM showed the best performance.

Generalization for different sites

There are some studies trying to overcome site bias. Some studies used independent component analysis or sparse canonical component analysis to obtain site-independent brain activity [23]. These methods are time-consuming and it is possible to lose important features that characterize patients with depression. On the other hand, regressing out is a simple way to reduce site bias and retains all features.

In our study, SVM showed good classification performance for independent site data. Depressive disorder is known to be heterogeneous. The melancholic depressive disorder is a typical and severe type of disorder. This would be a reason why the classification performance is better when we focus on melancholic patients. Targeting other types of depressive disorders would be future work. There are some studies about the subtyping the depressive disorder by using biological markers including functional connectivity [1, 6]. If a new subtype of depressive disorder that showed characteristic neural behavior is found, the diagnosis for this subtype using machine learning would be improved.

For clinical diagnosis

When we consider an application for clinical diagnosis, site bias is a large problem. If traveling subjects, the same persons visit multiple sites, are available, the calibration would be more effective and better performance of the diagnosis of patients with major depression would be expected [2]. Even if traveling subjects are not available, our proposed procedure would be helpful. Site bias should be calibrated using data from healthy subjects before clinical diagnosis, using the fMRI scanner used for the clinical diagnosis, which would be a more practical way for an application at a novel site.

Author Contributions Statement

T.N. and J.Y. carried out statistical analysis and gave interpretations. M.T., N.I., G.O., Y.O., M.Y., S.T., and S.Y. designed a study, collected functional MRI data. T.N and J.Y. wrote the main manuscript text and prepared the figures with input from all authors.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Contribution to the Field Statement

An objective and accurate diagnosis method of depression is highly demanded to avoid misdiagnosis and suboptimal treatment outcomes. For the realization, resting-state functional magnetic resonance imaging (fMRI) combined with machine learning has recently attracted great attention. However, the potential measurement bias of fMRI derived from heterogeneous scanners and protocols makes it difficult to apply the classification method of depressive patients with depression at wide-spread hospitals. Therefore, even if a good classifier of the patients is created within a specific facility, the performance is deteriorated at other facilities. Here, we propose a method to overcome the problem by correcting the bias in advance using healthy persons in the new facility. In addition, by focusing only on the melancholy type, the classification performance at new facilities was improved. Our method is easy to implement at any facility, contributing to the diagnosis of depression at wide-spread hospitals.

Acknowledgments

This research was supported by AMED under Grant Number JP19dm0107096, and Research on Medical ICT and Artificial Intelligence (H29-ICT-General-010), Health, Labour and Welfare Sciences Research Grants, Ministry of Health, Labour and Welfare Japan.

References

1.↵
Drysdale AT, Grosenick L, Downar J, Dunlop K, Mansouri F, Meng Y, et al. Resting-state connectivity biomarkers define neurophysiological subtypes of depression. Nat Med. 2017 Jan;23(1):28–38.
OpenUrl CrossRef PubMed Google Scholar
2.↵
Yamashita A, Yahata N, Itahashi T, Lisi G, Yamada T, Ichikawa N, Takamura M, Yoshihara Y, Kunimatsu A, Okada N, et al. Harmonization of resting-state functional MRI data across multiple imaging sites via the separation of site differences into sampling bias and measurement bias. Plos Biol. 2019;17:e3000042.
OpenUrl CrossRef Google Scholar
3.
Ichikawa N, Lisi G, Yahata N, Okada G, Takamura M, Yamada M, Suhara T, Hashimoto R-I, Yamada T, Yoshihara Y, et al. Identifying melancholic depression biomarker using whole-brain functional connectivity. arXiv 2017
Google Scholar
4.
Yoshida K, Shimizu Y, Yoshimoto J, Takamura M, Okada G, Okamoto Y, et al. Prediction of clinical depression scores and detection of changes in whole-brain using resting-state functional MRI data with partial least squares regression. PLoS ONE. 2017;12(7):e0179638.
OpenUrl Google Scholar
5.
Takagi Y, Sakai Y, Lisi G, Yahata N, Abe Y, Nishida S, et al. A Neural Marker of Obsessive-Compulsive Disorder from Whole-Brain Functional Connectivity. Sci Rep. Nature Publishing Group; 2017 Aug 8;7(1):7538.
OpenUrl Google Scholar
6.↵
Rivière D, Mangin J-F, Papadopoulos-Orfanos D, Martinez J-M, Frouin V, Régis J. Automatic recognition of cortical sulci of the human brain using a congregation of neural networks. Med Image Anal. 2002 Jun;6(2):77–92.
OpenUrl CrossRef PubMed Web of Science Google Scholar
7.↵
Yahata N, Morimoto J, Hashimoto R, Lisi G, Shibata K, Kawakubo Y, et al. A small number of abnormal brain connections predicts adult autism spectrum disorder. Nat Commun. 2016 Apr 14;7:11254.
OpenUrl CrossRef PubMed Google Scholar
8.
1. Breakspear M
Shimizu Y, Yoshimoto J, Toki S, Takamura M, Yoshimura S, Okamoto Y, et al. Toward Probabilistic Diagnosis and Understanding of Depression Based on Functional MRI Data Analysis with Logistic Group LASSO. Breakspear M, editor. PLoS ONE. Public Library of Science; 2015;10(5):e0123524.
OpenUrl Google Scholar
9.
Yoshida K, Yoshimoto J, Doya K. Sparse kernel canonical correlation analysis for discovery of nonlinear interactions in high-dimensional data. BMC Bioinformatics. 2017 Feb 14;18(1):108.
OpenUrl Google Scholar
10.
Chen C-H, Ridler K, Suckling J, Williams S, Fu CHY, Merlo-Pich E, et al. Brain imaging correlates of depressive symptom severity and predictors of symptom improvement after antidepressant treatment. Biol Psychiatry. 2007 Sep 1;62(5):407–14.
OpenUrl CrossRef PubMed Web of Science Google Scholar
11.↵
Greicius MD, Flores BH, Menon V, Glover GH, Solvason HB, Kenna H, et al. Resting-state functional connectivity in major depression: abnormally increased contributions from subgenual cingulate cortex and thalamus. Biol Psychiatry. 2007 Sep 1;62(5):429–37.
OpenUrl CrossRef PubMed Web of Science Google Scholar
12.↵
Noble S, Scheinost D, Finn ES, Shen X, Papademetris X, McEwen SC, et al. Multisite reliability of MR-based functional connectivity. Neuroimage. 2017 Feb 1;146:959–70.
OpenUrl Google Scholar
13.↵
Otsubo T, Tanaka K, Koda R, Shinoda J, Sano N, Tanaka S, et al. Reliability and validity of Japanese version of the Mini-International Neuropsychiatric Interview. Psychiatry Clin Neurosci. 3rd ed. Wiley/Blackwell (10.1111); 2005 Oct;59(5):517–26.
OpenUrl Google Scholar
14.↵
Sheehan DV, Lecrubier Y, Sheehan KH, Amorim P, Janavs J, Weiller E, et al. The Mini-International Neuropsychiatric Interview (M.I.N.I.): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. J Clin Psychiatry. 1998;59 Suppl 20:22–33–quiz34–57.
OpenUrl CrossRef PubMed Google Scholar
15.↵
American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders. American Psychiatric Publishing, 2013.
Google Scholar
16.↵
Perrot M, Rivière D, Mangin J-F. Cortical sulci recognition and spatial normalization. Med Image Anal. 2011 Aug;15(4):529–50.
OpenUrl CrossRef PubMed Web of Science Google Scholar
17.↵
Yamashita O, Sato M-A, Yoshioka T, Tong F, Kamitani Y. Sparse estimation automatically selects voxels relevant for the decoding of fMRI activity patterns. Neuroimage. 2008 Oct 1;42(4):1414–29.
OpenUrl CrossRef PubMed Web of Science Google Scholar
18.↵
Yang P, Hwa Yang y, B Zhou B, Y Zomaya A. A Review of Ensemble Methods in Bioinformatics. CBIO. 2010 Dec 1;5(4):296–308.
OpenUrl Google Scholar
19.↵
Bishop, Christopher M. Pattern recognition and machine learning. springer, 2006.
Google Scholar
20.↵
Kendler KS. The diagnostic validity of melancholic major depression in a population-based sample of female twins. Arch Gen Psychiatry. 1997 Apr;54(4):299–304.
OpenUrl CrossRef PubMed Web of Science Google Scholar
21.
Parker G, McClure G, Paterson A. Melancholia and catatonia: disorders or specifiers? Curr Psychiatry Rep. Springer US; 2015 Jan;17(1):536.
OpenUrl Google Scholar
22.↵
Sun N, Li Y, Cai Y, Chen J, Shen Y, Sun J, et al. A comparison of melancholic and nonmelancholic recurrent major depression in Han Chinese women. Depress Anxiety. Wiley-Blackwell; 2012 Jan;29(1):4–9.
OpenUrl Google Scholar
23.↵
Feis RA, Smith SM, Filippini N, Douaud G, Dopper EGP, Heise V, et al. ICA-based artifact removal diminishes scan site differences in multi-center resting-state fMRI. Frontiers in neuroscience. 2015;9:395.
OpenUrl Google Scholar

Posted August 25, 2019.

Download PDF

Author Declarations

Data/Code

Citation Tools

Get QR code

Tweet Widget

Subject Area

Psychiatry and Clinical Psychology

Reviews and Context

Comment

TRIP Peer Reviews

Community Reviews

Automated Services

Blogs/Media

Author Videos

Subject Areas

All Articles

Addiction Medicine (419)
Allergy and Immunology (741)
Anesthesia (217)
Cardiovascular Medicine (3192)
Dentistry and Oral Medicine (355)
Dermatology (270)
Emergency Medicine (472)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1136)
Epidemiology (13179)
Forensic Medicine (19)
Gastroenterology (883)
Genetic and Genomic Medicine (5008)
Geriatric Medicine (467)
Health Economics (767)
Health Informatics (3153)
Health Policy (1118)
Health Systems and Quality Improvement (1160)
Hematology (418)
HIV/AIDS (992)
Infectious Diseases (except HIV/AIDS) (14480)
Intensive Care and Critical Care Medicine (900)
Medical Education (466)
Medical Ethics (122)
Nephrology (512)
Neurology (4761)
Nursing (253)
Nutrition (706)
Obstetrics and Gynecology (864)
Occupational and Environmental Health (775)
Oncology (2449)
Ophthalmology (696)
Orthopedics (273)
Otolaryngology (335)
Pain Medicine (317)
Palliative Medicine (89)
Pathology (525)
Pediatrics (1270)
Pharmacology and Therapeutics (538)
Primary Care Research (541)
Psychiatry and Clinical Psychology (4088)
Public and Global Health (7323)
Radiology and Imaging (1643)
Rehabilitation Medicine and Physical Therapy (978)
Respiratory Medicine (959)
Rheumatology (469)
Sexual and Reproductive Health (486)
Sports Medicine (413)
Surgery (530)
Toxicology (68)
Transplantation (227)
Urology (197)

Comments

medRxiv aims to provide a venue for anyone to comment on a medRxiv preprint. Comments are moderated for offensive or irrelevant content (this can take ~24 h). Please avoid duplicate submissions and read our Comment Policy before commenting. The content of a comment is not endorsed by medRxiv.

medRxiv aims to inform readers about online discussion of this preprint occurring elsewhere. The content at the links below is not endorsed by either medRxiv or the preprint's authors.

Community reviews for this article:

There are no community reviews for this paper.

Automated Evaluations

Certain services provide automated analysis of preprints. Analyses invited by the authors are displayed at the top of this tab. Those done independently of authors are shown underneath . None of these analyses is endorsed by medRxiv.

Automated Evaluations:

There are no automated evaluations for this paper.

[1] 1.↵
Drysdale AT, Grosenick L, Downar J, Dunlop K, Mansouri F, Meng Y, et al. Resting-state connectivity biomarkers define neurophysiological subtypes of depression. Nat Med. 2017 Jan;23(1):28–38.
OpenUrl CrossRef PubMed Google Scholar

[2] 2.↵
Yamashita A, Yahata N, Itahashi T, Lisi G, Yamada T, Ichikawa N, Takamura M, Yoshihara Y, Kunimatsu A, Okada N, et al. Harmonization of resting-state functional MRI data across multiple imaging sites via the separation of site differences into sampling bias and measurement bias. Plos Biol. 2019;17:e3000042.
OpenUrl CrossRef Google Scholar

[3] 3.
Ichikawa N, Lisi G, Yahata N, Okada G, Takamura M, Yamada M, Suhara T, Hashimoto R-I, Yamada T, Yoshihara Y, et al. Identifying melancholic depression biomarker using whole-brain functional connectivity. arXiv 2017
Google Scholar

[4] 4.
Yoshida K, Shimizu Y, Yoshimoto J, Takamura M, Okada G, Okamoto Y, et al. Prediction of clinical depression scores and detection of changes in whole-brain using resting-state functional MRI data with partial least squares regression. PLoS ONE. 2017;12(7):e0179638.
OpenUrl Google Scholar

[5] 5.
Takagi Y, Sakai Y, Lisi G, Yahata N, Abe Y, Nishida S, et al. A Neural Marker of Obsessive-Compulsive Disorder from Whole-Brain Functional Connectivity. Sci Rep. Nature Publishing Group; 2017 Aug 8;7(1):7538.
OpenUrl Google Scholar

[6] 6.↵
Rivière D, Mangin J-F, Papadopoulos-Orfanos D, Martinez J-M, Frouin V, Régis J. Automatic recognition of cortical sulci of the human brain using a congregation of neural networks. Med Image Anal. 2002 Jun;6(2):77–92.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[7] 7.↵
Yahata N, Morimoto J, Hashimoto R, Lisi G, Shibata K, Kawakubo Y, et al. A small number of abnormal brain connections predicts adult autism spectrum disorder. Nat Commun. 2016 Apr 14;7:11254.
OpenUrl CrossRef PubMed Google Scholar

[8] 8.
Breakspear M
Shimizu Y, Yoshimoto J, Toki S, Takamura M, Yoshimura S, Okamoto Y, et al. Toward Probabilistic Diagnosis and Understanding of Depression Based on Functional MRI Data Analysis with Logistic Group LASSO. Breakspear M, editor. PLoS ONE. Public Library of Science; 2015;10(5):e0123524.
OpenUrl Google Scholar

[9] Breakspear M

[10] 9.
Yoshida K, Yoshimoto J, Doya K. Sparse kernel canonical correlation analysis for discovery of nonlinear interactions in high-dimensional data. BMC Bioinformatics. 2017 Feb 14;18(1):108.
OpenUrl Google Scholar

[11] 10.
Chen C-H, Ridler K, Suckling J, Williams S, Fu CHY, Merlo-Pich E, et al. Brain imaging correlates of depressive symptom severity and predictors of symptom improvement after antidepressant treatment. Biol Psychiatry. 2007 Sep 1;62(5):407–14.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[12] 11.↵
Greicius MD, Flores BH, Menon V, Glover GH, Solvason HB, Kenna H, et al. Resting-state functional connectivity in major depression: abnormally increased contributions from subgenual cingulate cortex and thalamus. Biol Psychiatry. 2007 Sep 1;62(5):429–37.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[13] 12.↵
Noble S, Scheinost D, Finn ES, Shen X, Papademetris X, McEwen SC, et al. Multisite reliability of MR-based functional connectivity. Neuroimage. 2017 Feb 1;146:959–70.
OpenUrl Google Scholar

[14] 13.↵
Otsubo T, Tanaka K, Koda R, Shinoda J, Sano N, Tanaka S, et al. Reliability and validity of Japanese version of the Mini-International Neuropsychiatric Interview. Psychiatry Clin Neurosci. 3rd ed. Wiley/Blackwell (10.1111); 2005 Oct;59(5):517–26.
OpenUrl Google Scholar

[15] 14.↵
Sheehan DV, Lecrubier Y, Sheehan KH, Amorim P, Janavs J, Weiller E, et al. The Mini-International Neuropsychiatric Interview (M.I.N.I.): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. J Clin Psychiatry. 1998;59 Suppl 20:22–33–quiz34–57.
OpenUrl CrossRef PubMed Google Scholar

[16] 15.↵
American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders. American Psychiatric Publishing, 2013.
Google Scholar

[17] 16.↵
Perrot M, Rivière D, Mangin J-F. Cortical sulci recognition and spatial normalization. Med Image Anal. 2011 Aug;15(4):529–50.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[18] 17.↵
Yamashita O, Sato M-A, Yoshioka T, Tong F, Kamitani Y. Sparse estimation automatically selects voxels relevant for the decoding of fMRI activity patterns. Neuroimage. 2008 Oct 1;42(4):1414–29.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[19] 18.↵
Yang P, Hwa Yang y, B Zhou B, Y Zomaya A. A Review of Ensemble Methods in Bioinformatics. CBIO. 2010 Dec 1;5(4):296–308.
OpenUrl Google Scholar

[20] 19.↵
Bishop, Christopher M. Pattern recognition and machine learning. springer, 2006.
Google Scholar

[21] 20.↵
Kendler KS. The diagnostic validity of melancholic major depression in a population-based sample of female twins. Arch Gen Psychiatry. 1997 Apr;54(4):299–304.
OpenUrl CrossRef PubMed Web of Science Google Scholar

[22] 21.
Parker G, McClure G, Paterson A. Melancholia and catatonia: disorders or specifiers? Curr Psychiatry Rep. Springer US; 2015 Jan;17(1):536.
OpenUrl Google Scholar

[23] 22.↵
Sun N, Li Y, Cai Y, Chen J, Shen Y, Sun J, et al. A comparison of melancholic and nonmelancholic recurrent major depression in Han Chinese women. Depress Anxiety. Wiley-Blackwell; 2012 Jan;29(1):4–9.
OpenUrl Google Scholar

[24] 23.↵
Feis RA, Smith SM, Filippini N, Douaud G, Dopper EGP, Heise V, et al. ICA-based artifact removal diminishes scan site differences in multi-center resting-state fMRI. Frontiers in neuroscience. 2015;9:395.
OpenUrl Google Scholar

Enhancing multi-center generalization of machine learning-based depression diagnosis from resting-state fMRI

Abstract

Introduction