Mild cognitive impairment is not a valid concept and should be replaced by a more restrictive, biologically validated class, named Mild Cognitive Dysfunctions (MCD): a nomothetic network approach.

Background: No studies have examined whether interactions between the apolipoprotein E4 (ApoE4) allele and peripheral biomarkers, hypertension, and type 2 diabetes mellitus (T2DM) may impact the neurocognitive, behavioral and social dysfunctions in amnestic mild cognitive impairment (aMCI) and Alzheimer disease (AD). Aims: To clinically define and biologically validate a subgroup of aMCI subjects that take up an intermediate position between controls and AD patients. Methods: In 61 healthy controls, 60 subjects with aMCI, and 60 AD patients we measured the features of aMCI/AD using the Consortium to Establish a Registry for Alzheimer s Disease (CERAD). A composite BIORISK score was computed using the ApoE4 allele, serum folate, albumin, white blood cells, fasting blood glucose (FBG), atherogenic index of plasma (AIP), T2DM and hypertension. Results: Clustering and nearest neighbour analyses were unable to validate the aMCI subgroup. We constructed two z unit-based composite scores, the first indicating overall burden of cognitive, social, and behavioural deterioration (OBD), and a second reflecting the interactions between ApoE4, all other biomarkers, hypertension and T2DM (BIORISK). We found that 40.2 % of the variance in the OBD score was explained by BIORISK, ApoE4, age and education. The OBD index was used to construct three subgroups (normal, medium, and high OBD) with the medium group (n=45) showing mild cognitive dysfunctions (MCD) in memory, language, orientation, and ADL. People with MCD show OBD and BIORISK scores that are significantly different from controls and AD. Conclusions: Petersen s aMCI criteria cannot be validated and should be replaced by the more restrictive, biologically validated MCD class.

Introduction genotype with peripheral biomarkers predict greater impairments in semantic and episodic memory and recall, suggesting that such interactions may play a role in the pathophysiology of AD [13]. For example, interactions between the presence of the E4 allele and fasting blood glucose (FBG) and albumin and cumulative effects of the E4 allele with folic acid, glucose, albumin, and the atherogenic index of plasma (AIP) may increase cognitive deficits in memory and naming and the symptomatic burden in AD [6,13]. Moreover, both hypertension and T2DM may increase risk of AD through immune and oxidative stress associated mechanisms [14][15][16][17]. This is important as these peripheral biomarkers, hypertension, and T2DM may increase inflammatory and oxidative pathways thereby aggravating the detrimental effects of the ApoE4 genotypes [6,13]. Nevertheless, no studies have examined whether interactions between those biomarkers, hypertension and T2DM may impact the neurocognitive as well as behavioral and social dysfunctions in AD.
Another unresolved issue is whether similar interaction patterns between the E4 allele and peripheral biomarkers, hypertension, and T2DM may be observed in amnestic mild cognitive impairment (aMCI). Such associations would support that aMCI is a transition stage between normal aging and AD. aMCI is defined as a decline in memory beyond and above that expected by age and the absence of dementia symptoms and dysfunctions in ADL [18]. In the classic classification of aMCI, two subtypes were described, namely singledomain aMCI, with isolated impairments in episodic memory, and multiple-domain aMCI with impairments in episodic memory and one or more other cognitive domains [19].
Individuals with aMCI show an elevated risk to develop AD with a yearly conversion rate from aMCI to AD between 14% and16%, although some aMCI individuals (8%) may return to the normal state [20]. Interestingly, the ApoE4 allele coupled with an interaction between ApoE4 x FBG is associated with memory deficits in people with aMCI [13].
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 21, 2021. ; https://doi.org/10.1101/2021.06. 20.21259193 doi: medRxiv preprint Participants. This is a cross-sectional study which included people with AD and aMCI and normal controls, aged 55-90 years and of both sexes. Patients were recruited at the Dementia Clinic, Outpatient Department, King Chulalongkorn Memorial Hospital, Bangkok, Thailand. Healthy volunteers were recruited from the same catchment area as the patients, namely Patumwan district, Bangkok province. The normal controls were community senior club members, senior Red Cross volunteers, healthy individuals who visited the Health Check Up Clinic, and normal elderly caregivers of the AD patients who visited the Dementia Clinic. We excluded patient and controls with a) abnormal VDRL, HIV, vitamin B12, and thyroid function blood tests; b) other dementia syndromes, including frontotemporal lobe dementia and vascular dementia; c) neurologic disorders, including Parkinson's disease, stroke, multiple sclerosis, encephalitis, meningitis, and traumatic brain injury; d) major psychiatric disorders including major depressive disorder, bipolar disorder, schizophrenia, substance use disorders, and anxiety disorders; e) (auto)immune disorders, including rheumatoid arthritis, chronic obstructive pulmonary disease, systemic lupus erythematosus, inflammatory bowel disease, chronic kidney disease, cancer, diabetes type 1, and severe heart disease (functional class II or more). Furthermore, controls and patients were excluded when the score on the Thai Geriatric Depression Scale was > 13 in order to exclude people with a recent depression [7].
We conducted magnetic resonance imaging of the brain in the AD patients to rule out vascular dementia and brain tumors.
Finally, all participants were allocated into three study samples, i.e. 61 healthy controls, 60 subjects with aMCI, and 60 AD patients. All participants and guardians of aMCI and AD individuals gave written informed consent prior to participation in this study. The study was conducted according to Thai and international ethics and privacy laws. Approval for the study was obtained from the Institutional Review Board of the Faculty of Medicine, . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The neuropsychologist was blinded from the screening data of the physicians and the latter were blinded from the assessment results of the neuropsychologist.
We made the diagnosis of AD using the National Institute of Neurological and Communicative Disorders and Stroke and the Alzheimer's Disease and Related Disorders Association (NINCDS-ADRDA) diagnostic criteria [27]. Moreover, other inclusion criteria were a TMSE score between 10 and 23, and a CDR score of 1 and 2. aMCI patients were included if they showed subjective memory complaints and when they complied with Peterson's criteria [18]. Subjective memory complaints were assessed using the question do you feel that your memory had become worse? Objective memory complaints were established with a CDR score of 0.5 and a CDR memory component score of 0.5. aMCI patients were included when the TMSE score was > 23 and when they were not diagnosed with dementia according to the NINCDS-ADRDA) [27]. The healthy controls did not complain of subjective memory and they showed a TMSE score > 23 and CDR = 0. The . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted June 21, 2021. ; https://doi.org/10.1101/2021.06.20.21259193 doi: medRxiv preprint diagnoses were discussed among two physicians for agreement and in case of disagreement, a third opinion from another psychiatrist or neurologist was requestd.
The same day, a senior neuropsychologist completed the CERAD Neuropsychological Assessment Battery (CERAD-NP) [7,29] in a Thai, validated translation. In this study we used: the Verbal Fluency Test (VFT) to assess semantic memory and cognitive flexibility; the Modified Boston Naming Test (BNT) to assess confrontational word retrieval; the Word List Memory (WLM) to assess episodic memory and learning ability for new verbal information and immediate working memory; WL recall, Delayed, true recall (WLRecall) to probe verbal episodic memory and the ability to recall; the WL Recognition test to assess verbal episodic memory-discriminability or verbal learning recall recognition; and the Constructional Praxis and recall tests to probe visuoconstructive abilities and later task recall. Moreover, we assessed a) C1 or the clinical history items, including memory, language, personality and behavior, orientation for time and place, ADL, social activities, judgment and problem solving, and other cognitive problems; b) C2 or the ADL Blessed Dementia Scale, part a (BDS); C3 or the Behavior Rating Scale for Dementia (BRSD) including depressive features, defective self-regulation, irritability/agitation, vegetative features, inertia/apathy, and psychotic features; C4 or the Short Blessed test (orientation-memory-concentration); and C5 or calculation, clock and expressive language (CCL).

APOE Genotyping
As described previously [7], we extracted genomic DNA from peripheral blood leukocytes by standard procedures with a DNA Mini Kit (QIAGEN GmbH, Hilden, Germany). Consequently, "DNA was amplified by using two primers, 5'-ACAGAATTCGCCCCGGCCTGGTACACAC-3' and 5'-. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted June 21, 2021.

Other biomarkers
Fasting blood was sampled between 8.00 and 8.30 a.m. We used 3 mL clotted blood (serum), which was centrifuged at 1,000 g for 5 minutes, to assay biomarkers at the Central Laboratory, Department of Laboratory Medicine, King Chulalongkorn Memorial Hospital, Bangkok, Thailand. As explained previously [6,13], we used the Architect C8000 (Abbott Laboratories, Abbott Park, Illinois, USA) to measure the biomarkers. Plasma glucose was measured using A Hexakinase/ G-6-PDH technique (inter-assay coefficients of variability (CV) of 2.0%). Based on the lipid profile, we computed the AIP index as a z-unit weighted composite score, i.e. z-transformed triglyceride values (zTG)z high density lipoprotein chlolesterol (labeled as zAIP) [6]. In addition to albumin, the present study used total number of while blood cells (WBC) as another indicator of immune activation. Folate levels were measured using electrochemiluminescence immunoassay (ECLIA) using the Cobas 6.000 Analyzer (Roche, Germany).
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted June 21, 2021.

Statistics
We used analyses of variance (ANOVA) to check differences in scale variables between study groups and analyses of contingence tables (Χ 2 -test) to check associations among categorical variables. We employed multivariate general linear model (GLM) analysis to check the associations between diagnostic classes and clinical and biomarker data while adjusting for age, sex and education. Tests for between-subject effects were used to check the univariate associations between the classes and clinical and biomarker data. Consequently, we computed GLM model-derived estimated marginal means (SE) after adjusting for age, sex and education. We used the protected least significant difference (LSD) to assess pair-wise differences among group means. False discovery rate p-correction was used to correct for multiple comparisons [30]. We used multiple regression analysis to assess the biomarkers that predict latent vector scores while allowing for the effects of age, sex and education. An automated stepwise method was employed with p-to-enter of 0.05 and a p-to-remove of 0.06.
Multivariate normality (Cook's distance and leverage), the R 2 changes, multicollinearity (using the variance inflation factor and tolerance), and homoscedasticity (tested with the White and modified Breusch-Pagan test) were always checked. Moreover, the regression analysis was performed on 5.000 bootstrap samples and the latter results are shown if the results are not concordant. Tests were 2-tailed and a p-value of 0.05 was used for statistical significance. Two-step cluster analysis was employed to define clusters of patients based on the cognitome and phenome features. Nearest neighbor analysis was employed to classify subjects based on their feature similarities. All statistical analyses were performed using IBM SPSS windows version 25.
Smart Partial Least Squares (SmartPLS)-SEM analysis [31] was used to assess the causal associations between ApoE4, age, sex, education, the cognitome and symptomatome . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 21, 2021. ; https://doi.org/10.1101/2021.06.20.21259193 doi: medRxiv preprint of aMCI and AD. We used a multi-step, multiple mediated PLS path model with ApoE4, age, sex and education as input variables and symptomatome data as output variables, while cognitome data mediated the effects of the input on the output variables.  Figure 1 shows the final PLS model. The symptomatome was structured in 4 different vectors, namely ADL+OR (BDSa, SBT score and C1 orientation), BEHAVIOR (the C3 BRSD items comprising depressive features, irritability/agitation, vegetative features, inertia/apathy, and defective self-regulation), MEM+LANG (comprising the C1 clinical history items memory and language), and SOCIAL (including the C1 clinical history items social activities, judgment and problem solving, and other cognitive problems). We also . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. We found that 87.7% of the variance in the diagnostic variable was explained by (in ascending order of importance) CERAD, ADL+OR, MEM+LANG, and BEHAVIOR, while CCL and SOCIAL did not have a significant impact. PLS showed that 44.9% of the variance in CERAD could be explained by ApoE4, age, and education, and that the same indicators explained 22.8% of the variance in CCL. Moreover, the effects of CERAD and CCL on the diagnostic spectrum was partially mediated by ADL+OR, BEHAVIOR, and SOCIAL. The specific indirect effects showed that most pathways from the input to the output variables were significant. For example, the effects of ApoE4, age and education on the diagnosis were mediated by CERAD, or CERAD → ADL+OR, or CERAD → BEHAVIOR, CERAD → MEM+LANG, but not CCL → ADL_OR. The significant specific indirect effects of sex on the diagnosis were mediated by ADL+OR. In addition, all possible total effects were . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Results of nearest neighbor analyses.
Two different nearest neighbor analyses were performed to classify subjects as controls, aMCI or AD patients. The first was conducted using memory scores only, namely WLM, WLRecall and C1 memory scores (3k, Euclidian distance, training sample of 70% and a holdout sample of 30%). The classification table showed that many aMCI cases were misclassified as controls in the training (35.7%) and holdout sample (45.4%) yielding a total accuracy of only 68.9% in the holdout sample. The second analysis was conducted using all cognitome and phenome latent vectors extracted by PLS scores (3k, Euclidian distance, training sample of 70% and a holdout sample of 30%) and this analysis showed 45.0% misclassifications in both the training and holdout samples with many aMCI subjects being allocated to the normal control class.

Construction of an overall burden of disease (OBD) score and associated subgroups.
We computed the latent variable scores of all cognitome, and phenome data obtained by PLS and computed an overall composite score indicating overall burden of disease (OBD) computed as: z score of (z CCL + z ADL+OR + z BEHAVIOR + z MEM+LANG + z SOCIALz CERAD). Consequently, using a visual binning method (based on inspection of the apparent modes and local minima of the frequency histogram and the results of the twostep cluster analysis described below), we divided the study group into three non-overlapping samples, namely normal, medium and high OBD (cutoff points were -0.53 and 0.4, respectively). Two-step cluster analysis performed on CCL, CERAD, ADL+OR, MEM_LANG, BEHAVIOR, SOCIAL, and OBD scores in the normal + medium OBD . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 21, 2021. ; https://doi.org/10.1101/2021.06.20.21259193 doi: medRxiv preprint groups retrieved 2 clusters (based on Akaike's Information criterion) with an adequate silhouette measure of cohesion and separation of 0.57. This cluster solution separated the medium OBD from the normal OBD class. Table 1 shows the association between this new OBD and cluster analysis classification and the classification into HC, aMCI and AD. There was a highly significant association between both classification systems (χ 2 =206.97, df=4, p<0.001), whereby group 1 comprised most controls (except 2) + 21 aMCI subjects (this group was labeled "normal OBD group"), group 2 consisted of 36 aMCI + 2 healthy controls + 7 AD subjects (labeled:" medium OBD group"), and group 3 comprised 53 AD + 3 aMCI subjects (labeled "high OBD group"). Table 2 shows the features of the 3 OBD groups formed. Patients with high OBD are somewhat older that the other groups and years of education was lower. There were no differences in the sex ratio or cardio-vascular disease frequency among the three study groups. The frequency of hypertension increased from controls → medium OBD → high OBD and that of type 2 diabetes mellitus (T2DM) was higher in both OBD groups than in controls. The frequency of ApoE4 was significantly higher in the high OBD group than in the normal OBD group.

Clinical and biological features of the OBD classes.
In order to assess the associations between these groups and the clinical and neuropsychological scores we used multivariate GLM analysis and adjusted for age, sex and education. There was a highly significant effect of the diagnostic classes (partial eta squared=0.629; F=41.01, df=14/340, p<0.001) and a very modest effect of education (partial eta squared=0.082; F=2.15, df=7/169, p=0.041), while sex and years of education did not have significant effects. Tests of parameter estimates shows that education was only associated with the CERAD score (inversely, p=0.005). The CCL, CERAD, ADL+OR and . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 21, 2021. ; https://doi.org/10.1101/2021.06.20.21259193 doi: medRxiv preprint MEM+LANG latent variable scores increased from normal OBD → medium OBD → high OBD. The BEHAVIOR and SOCIAL scores were significantly increased in the high OBD group as compared with the other two groups. These differences remained significant after FDR p-correction. Table 2 also shows the measurement of blood biomarkers in the three study groups. FBG was significantly higher in the high OBD group than in the normal and medium OBD group. Albumin and folate were significantly lower in the high OBD group than in the normal and medium OBD group, while WBC number and zAIP were significantly increased in the high OBD group as compared with the other two groups. FDR p-correction did not change these results.
We have also computed a z unit weighted composite score that comprises all biomarkers (zBiomarkers) as a z ApoE4 + z WBC + z FBG + z AIPz Alb -zFolate (labeled as: zBiomarkers). In addition, we computed another z unit-based composite score as: zBiomarkers + z hypertension + z T2DM (labeled as zBIORISK). Table 2 shows that both indices were significantly different between the three groups and increased from the normal OBD → medium OBD → high OBD group. The distance from the medium OBD to the normal OBD group was 0.480 SD and between the medium and high OBD the distance was 0.743 SDs. Finally, we calculated a composite score as zBIORISK + zOBD score. The distance from the medium OBD to the normal OBD group was 0.641 SDs and between the medium and high OBD the distance was 1.314 SDs indicating that the medium OBD group takes up an intermediate position between the normal and high OBD groups, but is much closer to the normal OBD group than to the high OBD group. Table 3 shows the intercorrelation matrix between the zBiomarker and zBIORISK factors and the clinical and neuropsychological scores. We found that these risk indices were . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 21, 2021. ; https://doi.org/10.1101/2021.06.20.21259193 doi: medRxiv preprint significantly associated with CCL, CERAD, ADL+OR, BEHAVIOR, MEM+LANG, and SOCIAL scores. Table 4 shows the results of multiple regression analysis with the OBD score as dependent variable and the biomarker indices as explanatory variables while allowing for the effects of age, sex and education. Regression #1 shows that 40.2% of the variance in OBD was explained by the regression on zBIORISK, age, ApoE4 (all positively) and education (inversely). Figure 2 shows the partial regression of the OBD score on the zBIORISK score. We have also examined the effects of the biomarker score on the OBD score in restricted samples, namely normal + medium OBD and medium + high OBD groups.
Regression #2 shows that in the restricted study sample of normal + medium OBD subjects, 19.5% of the variance in the OBD score could be explained by zBIORISK and age (both positively) and education (inversely). Regression #3 shows that in the medium + high OBD group, 17.0% of the variance in the OBD score could be explained by zBiomarkers (positively) and education (inversely).
Discussion aMCI cannot be validated as a class.
The first major finding of this study is that nearest neighbor (supervised learning) and two-step clustering (unsupervised learning) analysis were not able to confirm the existence of aMCI even when using memory scores, which defined aMCI. These results corroborate those of our previous report showing that using SIMCA, another supervised learning technique [7], the aMCI subgroup cannot even be modeled using neuropsychological memory tests, which are intended to describe this subgroup [18]. Moreover, using SIMCA, we found that many participants with aMCI were authenticated as controls or as AD patients. Thus, both unsupervised and supervised techniques show that aMCI according to Peterson's criteria is a heterogenous group and, consequently, does not exist as a distinct class. In fact, this is further . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Mild Cognitive Dysfunctions (MCD) as an intermediate class
The second major finding of this study is that we were able to compute a new overall burden of cognitive, social and behavioural deterioration (OBD) score, which is useful as a severity score and to delineate a reliable subgroup located between controls and AD patients. This OBD index combines the different cognitome (CCL and CERAD) and phenome (ADL+OR; BEHAVIOR, MEM+LANG, SOCIAL) domains of the dementia spectrum into one integrated index of overall burden of disease and, therefore, ranks subjects along a continuum from a normal condition to severe dementia. Using a visual binning method performed on the OBD scores, we divided the study sample into three classes, namely subjects with normal, high and medium OBD scores and validated the medium OBD class using two-step cluster analysis. This medium OBD group is less inclusive than aMCI and is characterized by impairments in CERAD, CCL, ADL+OR and MEM+LANG and, therefore, we propose to name this group Mild Cognitive Dysfunctions (MCD) including in memory, ADL, language and orientation. Peterson's criteria [18] on the other hand, stress the absence of ADL dysfunctions in aMCI. As such, the features of MCD differ from those of singledomain aMCI and of multiple-domain aMCI, which was thought to be characterized by deficits in episodic memory and one or more other cognitive domains [19]. Moreover, the MCD criteria do not correspond with those of mild behavioral impairment (MBI) which is . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 21, 2021. ; https://doi.org/10.1101/2021.06.20.21259193 doi: medRxiv preprint characterized by persistent behavioral symptoms in late life and is thought to constitute a risk for neurodegenerative disease [34,35]. Indeed, in our study the MCD group did not display any behavioral or social dysfunctions.

Biomarkers in MCD.
The third major finding of this study is that ApoE4 significantly predicted all cognitome (CCL and CERAD) and phenome (ADL+OR; BEHAVIOR, MEM+LANG, SOCIAL) domains of the dementia spectrum, and that the impact of the ApoE4 allele could be improved by constructing a new composite score reflecting the interactions between the ApoE4 allele, folate, FBG, albumin, WBC, and AIP, and comorbid illness including hypertension and T2DM. This biomarker score externally validated the continuous OBD score and the MCD class: a) people with MCD show higher biomarker scores than controls, and b) the transitions of controls to MCD and from the latter to AD are both associated with increasing biomarker scores. Phrased differently, an increased impact of interactions between factors which confer risk towards increased glucotoxicity (ApoE4 x FBG x T2DM), atherogenicity (ApoE4 x AIP x hypertension), inflammatory responses (ApoE4 x albumin x WBC), and oxidative stress (ApoE4 x folate x albumin) underpin both MCD and AD. Since the same biomarker score is associated with AD as well as with MCD, we may conclude that people with MCD show probably an increased risk to develop AD. It is known that individuals with aMCI display an increased risk to develop AD with a conversion rate from aMCI to AD of 14%-16.5% [20]. Future research should examine how many MCD subjects show the expected conversion rate to develop AD.

Limitations.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 21, 2021. ; The results of this study should be interpreted with regard to the limitations. It would have been more interesting if we had assayed brain imaging biomarkers including the connectome and neuro-immune biomarkers which are known to play a role in AD and in

Conclusions
Both supervised and unsupervised learning techniques show that aMCI is a heterogenous class and not a viable entity. In this study, we constructed two z unit-based composite scores: a first reflecting overall burden of cognitive, social and behavioural deterioration (OBD) and a second score reflecting the interactions between ApoE4 and other biomarker risk factors, hypertension and T2DM. The OBS score may be used to assess severity of OBD and AD and to classify MCD subjects. The latter show increased biomarker scores which significantly differ from controls and AD patients and, therefore, the MCD class may be at increased risk to develop dementia.

Declaration of interest
The authors have no financial conflict of interests.

Funding
There was no specific funding for this specific study.

Authorships.
All authors contributed significantly to the paper and approved the final version.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Data availability
The dataset generated during and/or analyzed during the current study will be available from the corresponding author upon reasonable request and once the dataset has been fully exploited by the authors. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 21, 2021. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 21, 2021. ; https://doi.org/10.1101/2021.06.20.21259193 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted June 21, 2021. ; https://doi.org/10.1101/2021.06.20.21259193 doi: medRxiv preprint Table 1. Associations between the classes derived from cluster analysis and the diagnosis of amnestic mild cognitive impairment (aMCI) and Alzheimer's disease.

Classes
Normal OBD A Medium OBD B High OBD C Total   HC  59  2  0  61  aMCI  21  36  3  60  AD  0  7  53  60  Total  80  45  56  181 OBD: overall burden of disease classes (computed based on the results of cognitive, behavioral and social rating scores).
HC: heathy controls; aMCI: amnestic mild cognitive impairment, AD: Alzheimer's disease. Table 2. Socio-demographic, clinical and biomarker data of the participants binned into three groups based on the overall burden of disease (OBD) score and cluster analysis. All variables are shown as mean (SD) except *: marginal estimated mean (SE) after covarying for age, sex and education.

Variables
A,B,C pairwise comparisons among treatment means.
CVD: cardio-vascular disease; T2DM: type 2 diabetes mellitus; CCL: calculation, clock, expressive language; CERAD: a latent vector extracted from various CERAD scores including memory, naming and praxis; ADL+OR: activities of daily living and orientation; OBD: overall burden of cognitive, behavioral and social deterioration; FBG: fasting blood glucose; zAIP: atherogenic index of plasma.
zBiomarkers: z unit weighted composite score computed as z ApoE4 + z white blood cells + z FBG + z AIPz Albuminz Folate.
Social: C1 clinical items, including social activities, judgment and problem solving, and other cognitive problems. The OBD index was computed using cognitive, behavioural, and social symptoms of the CERAD. The BIORISK score was computed using the ApoE genotypes, serum folate, albumin, white blood cells, fasting blood glucose, atherogenic index of plasma, type 2 diabetes mellitus, and hypertension.