Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Short-Term Delayed Recall of Auditory Verbal Learning Test Is Equivalent to Long-Term Delayed Recall for Identifying Amnestic Mild Cognitive Impairment

  • Qianhua Zhao ,

    Contributed equally to this work with: Qianhua Zhao, Yingru Lv

    Affiliation Department of Neurology and Institute of Neurology, Huashan Hospital, Fudan University, Shanghai, China

  • Yingru Lv ,

    Contributed equally to this work with: Qianhua Zhao, Yingru Lv

    Affiliation Department of Radiology, Huashan Hospital, Fudan University, Shanghai, China

  • Yan Zhou,

    Affiliation Department of Neurology and Institute of Neurology, Huashan Hospital, Fudan University, Shanghai, China

  • Zhen Hong,

    Affiliation Department of Neurology and Institute of Neurology, Huashan Hospital, Fudan University, Shanghai, China

  • Qihao Guo

    dr.guoqihao@126.com

    Affiliation Department of Neurology and Institute of Neurology, Huashan Hospital, Fudan University, Shanghai, China

Abstract

Delayed recall of words in a verbal learning test is a sensitive measure for the diagnosis of amnestic mild cognitive impairment (aMCI) and early Alzheimer’s disease (AD). The relative validity of different retention intervals of delayed recall has not been well characterized. Using the Auditory Verbal Learning Test–Huashan version, we compared the differentiating value of short-term delayed recall (AVL-SR, that is, a 3- to 5-minute delay time) and long-term delayed recall (AVL-LR, that is, a 20-minute delay time) in distinguishing patients with aMCI (n = 897) and mild AD (n = 530) from the healthy elderly (n = 1215). In patients with aMCI, the correlation between AVL-SR and AVL-LR was very high (r = 0.94), and the difference between the two indicators was less than 0.5 points. There was no difference between AVL-SR and AVL-LR in the frequency of zero scores. In the receiver operating characteristic curves analysis, although the area under the curve (AUC) of AVL-SR and AVL-LR for diagnosing aMCI was significantly different, the cut-off scores of the two indicators were identical. In the subgroup of ages 80 to 89, the AUC of the two indicators showed no significant difference. Therefore, we concluded that AVL-SR could substitute for AVL-LR in identifying aMCI, especially for the oldest patients.

Introduction

Episodic memory impairment is the core feature of Alzheimer’s disease (AD) and amnestic mild cognitive impairment (aMCI). Accurate testing for episodic memory deficits is an essential part of detecting early cognitive impairment. Basing on a word list, the verbal learning test (VLT) is a popular method of episodic memory detection; the VLT’s index of “delayed recall” is considered the most sensitive measure for the early diagnosis of AD. It is also regarded as the best predictor of conversion from MCI to AD [1].

A variety of standardized VLTs, such as the Rey Auditory Verbal Learning Test [2] (RAVLT, 1958), California Verbal Learning Test [3][4] (CVLT, Delis 1989; CVLT-II, 2000), and Hopkins Verbal Learning Test [5] (HVLT-R, Brandt and Benedict, 2001), are commonly used for clinical diagnosis and disease monitoring [6]. Other VLTs [7], used as a subtest for a battery of tests (for example, ADAS-cog [8] and CERAD [9], [10]), are easier and more convenient. These tasks present with relatively fewer words (a 10-word list) and fewer learning trials (2 to 3) (Table 1).

thumbnail
Table 1. Summary of the profile of commonly used verbal memory tests or tests that involve verbal memory.

https://doi.org/10.1371/journal.pone.0051157.t001

Besides the difference in lengths of word lists and numbers of trials, these verbal memory tasks also differ in testing procedure. The most obvious difference is the retention interval of delayed recall: standardized VLTs usually require 20 to 30 minutes (i.e., long-term delayed recall), whereas other VLTs require less than 10 minutes, mostly 2 to 5 minutes (that is, short-term delayed recall). As such, although word lists with delayed recall are well-established paradigms, the retention interval for delayed recall remains controversial.

The Auditory Verbal Learning Test–Huashan version (AVLT-H) [11] adopts the rationale and methods of the CVLT and Hong Kong Verbal Learning Tests. It includes short-term delayed recall (AVL-SR, that is, a 5-minute delay time) and long-term delayed recall (AVL-LR, that is, a 20-minute delay time). It is proved to be acceptable to Mandarin speakers and is sensitive to detecting aMCI. Many screening tests, such as the Mini-Mental State Examination (MMSE), the Montreal Cognitive Assessment (MoCA), and Dem Tect, use short-term delayed recall to detect memory impairment, and these tests have achieved acceptable validity. Therefore, we hypothesized that the diagnostic value of short-term (SR) and long-term delayed recall (LR) for aMCI is similar. Further, because age is a significant factor that correlates with memory decline, we also examine whether age affects the discriminating ability of the AVL-SR and AVL-LR for diagnosing aMCI.

Methods

Participants

The AD and aMCI patients were recruited consecutively at the Memory Clinic of Huashan Hospital from 2005 to 2010. Cognitively normal controls (NC), were enrolled using cluster sampling from Jingansi Community, Shanghai, China in a normal-aging study. The common inclusion criteria for all participants were as follows: (1) aged 50 to 89 years old; (2) formal education not less than 6 years; (3) adequate visual and auditory acuity to allow cognitive testing; (4) absence of significant medical or neurological diseases and psychiatric disorders or psychotic features that could compromise cognition. In total, the sample consisted of 1215 NC subjects, 897 aMCI patients, and 530 mild AD patients.

Criteria for aMCI [12], in addition to those outlined above, included the following: (1) memory complaints and memory decline, which were verified by an informant; (2) symptoms lasting more than 3 months; (3) total score on MMSE-Chinese version (C-MMSE) [13] (Katzman, 1988) not less than the education-adjusted cut-off scores; (4) abnormal objective memory impairment documented by the scores falling 1.5 SD below the age- and education-specific norms on one of the following two memory tests: the delayed recall of the Logical Memory Test (LMT, the paragraph recall test from the Wechsler Memory Scale-Chinese revised) and the delayed recall of the Rey-Osterreich Complex Figure Test (RCFT) [14]; (5) preserved basic activities of daily living and minimal impairment in complex instrumental functions, assessed on the basis of patient and informant interviews and ratings on a Functional Activities Questionnaire (FAQ) [15]; (6) etiology unknown; (7) non-demented according to the criteria of the National Institute of Neurological and Communicative Disorders and Stroke and the Alzheimer’s Disease and Related Disorders Association (NINCDS-ADRDA) [16].

The additional inclusion criteria for NC included the following: (1) cognitively normal, with no memory complaints or memory difficulties, verified by an informant; (2) global Clinical Dementia Rating (CDR) = 0 [17].

Mild AD was diagnosed according to the NINCDS-ADRDA criteria [16] and the global CDR = 1 [17].

To determine the age effect of delayed recall, participants were further divided into four subgroups: 50–59 years, 60–69 years, 70–79 years, and 80–89 years.

Ethnic Issues

The study was approved by the Institutional Review Board of Huashan Hospital. Written informed consent was obtained from all the participants.

Procedures

Each subject had a uniform structured evaluation performed by a neurologist, which included a medical history inquiry and neurological examination. Blood tests included complete blood count, thyroid function tests, serum vitamin B12, and Venereal Disease Research Laboratories test. CT or MRI scans were performed for all the participants. A comprehensive neuropsychological battery including memory, language, attention, executive functioning, and visuospatial ability was administered. The tests were as follows: the C-MMSE [13], the LMT [18], the RCFT [19], the Boston Naming Test (the 30-item version) [20], [21], the Animal Verbal Fluency Test [22], the Symbol Digit Modalities Test [23], the Trail Making Test–A and B [24], the Stroop Color-Word Test [25], the Similarity Test [26], the Clock-drawing Test [27], the CDR [17], and the FAQ [15]. All these tests have been proved to have good reliability and validity in Chinese. The neuropsychological tests were performed by three highly trained raters (Y Zhou, YM Sun, and MR Chen). The diagnoses were kept blind to the raters.

thumbnail
Table 2. Demographic characteristics and AVLT-H indicators in age-specific diagnostic groups.

https://doi.org/10.1371/journal.pone.0051157.t002

AVLT-H

The word list is composed of 12 two-character words from three semantic categories (flowers, occupations, apparels) with four words for each category. The AVLT-H measures both recall and recognition of the word lists over a number of trials. Administration of the test begins by evaluating an individual’s ability to recall the 12-word list over three learning trials. A non-verbal test (copy part of RCFT) is then presented for interference with a 3–5-minute interval. The Short-term Delayed Free Recall follows. After a 20-minute delay during which a nonverbal task (including the Symbol Digit Modalities Test and the Trail Making Test) occurs, free recall is again tested (that is, Long-term Delayed Free Recall), as well as category-cued recall. Finally, a recognition test that includes the 12 target words and 12 distracter words (semantically related) is performed.

The test scores of the AVLT-H are as follows: (1) AVL-T, sum of all correct responses given in the first three consecutive trials; (2) AVL-SR, number of words responded correctly in the short-term delayed free recall; (3) AVL-LR, number of words responded correctly in the long-term delayed free recall; (4) AVL-CR, number of words answered correctly with category-cued recall; (5) AVL-REC, number of words answered correctly in the recognition test. Because we sought to explore the recession of episodic memory function, our target variables excluded the AVL-T, AVL-CR, and AVL-REC.

thumbnail
Figure 1. AVL scores in three diagnostic groups in each trial.

https://doi.org/10.1371/journal.pone.0051157.g001

thumbnail
Figure 2. AVL scores of different age groups in each trial in patients with aMCI.

https://doi.org/10.1371/journal.pone.0051157.g002

Statistical Analysis

Chi-square analysis was adopted for ordinal data. Overall continuous variables among the three groups (aMCI, mild AD, and NC groups) were assessed with one-way analysis of variance. Post hoc pairwise comparisons between groups were assessed using the LSD test. Receiver operating characteristic (ROC) curves were used to determine the ability of AVL-SR and AVL-LR to discriminate aMCI from NC. The level of significance was set at α = 0.05. Statistical analyses were carried out using SPSS 16.0.

thumbnail
Table 3. Percentage of zero scores of the AVL-SR and AVL-LR in different age groups in patients with aMCI.

https://doi.org/10.1371/journal.pone.0051157.t003

Results

Demographic Characteristics of Three Diagnostic Groups

Table 2 summarizes demographic information, MMSE, and AVLT-H scores for the three diagnostic groups (NC, aMCI, mild AD). Patients in the three groups were comparable in age, education level, and gender distribution.

thumbnail
Table 4. ROC analysis of AVL-SR and AVL-SR for identifying aMCI in different age groups.

https://doi.org/10.1371/journal.pone.0051157.t004

Correlations between Demographic Variables and AVLT-H Indicators

Age was negatively correlated with AVL-SR and AVL-LR (r = −0.24 and −0.26, p<0.01) in NC group. Both indicators had a significant gender difference (p<0.01). The higher difference was found in AVL-LR with an average score of 5.3 for males and 6.3 for women. Scores on the AVLT indicators were not significantly related to education (r = 0.01–0.03, p> 0.05).

Correlations between AVL-SR and AVL-LR

For all participants, the correlation between AVL-SR and AVL-LR was very high (r = 0.94, p<0.01). Regardless of cognitive status and age, the difference in mean value between AVL-SR and AVL-LR was very small, less than 0.5 points, and not significant (p>0.5). For delayed recall, from 5 minutes to 20 minutes, the score curve was shaped like a platform (see Figures 1 and 2).

Percentage of Zero Scores of AVLT-H Indicators

A score of zero for AVLT-H also reflects memory impairment in patients. Thus we compared the percentage of zero scores on AVL-SR and AVL-LR. No significant difference was found between the two indicators in aMCI (Table 3).

ROC Analysis of AVLT Indicators for Identifying aMCI in Different Age Groups

In the 80- to 89-year-old group, ROC areas under the curve (AUCs) were similar between AVL-SR and AVL-LR (Z = 1.76, p = 0.08). In other age groups (that is, the 50–59, 60–69, and 70–79 age groups), the AUC of AVL-LR was greater than that for AVL-SR (Z = 4.28, 2.93, 2.97, respectively, p<0.01) (Table 4).

Discussion

To our knowledge, there is little information detailing the diagnostic utility of different retention intervals of delayed recall for aMCI, the prodromal stage of AD. The present study used a case-control design to determine the diagnostic value of using short-term and long-term delayed recall performance of the AVLT-H for aMCI identification. We also measured age-specific classification accuracy for the two delayed recall indicators.

Long-term delayed recall of word list learning appeared to have the highest diagnostic accuracy for differentiating MCI patients from controls and might provide the most specific measure for early AD diagnosis. Long-term delayed recall reflects entorhinal and hippocampal cortical function, where the earliest neuropathological changes in AD occur [28], [29]. Retention intervals of delayed recall have to be long enough to be sensitive to impairment but short enough to keep the examinee’s compliance. So far, few have compared the use of AVL-SR and AVL-LR in identifying aMCI. In the current study, both indicators had ideal diagnostic value. Moreover, in the 80- to 89-year-old group, the AUC of AVL-SR and AVL-LR showed no significant difference.

The AVLT-H has been shown to accurately distinguish between aMCI and controls. Our study shows that the correlation between AVL-SR and AVL-LR was very high (r = 0.94), and the percentage of zero scores for the two indicators was similar. The numerical difference of the two indicators was less than 0.5 points in each age group of aMCI. There were few studies focusing on the time interval of delayed recall in VTLs. The Alzheimer’s Disease Neuroimaging Initiative (ADNI) group once examined the Rey AVLT measures and brain volume. They reported that, for both 5-minute and 30-minute delayed recall trials, the hippocampus was the only region correlated with the performance [28]. The ADNI results suggested a common biological basis of short-term and long-term delayed recall. They did not, however, provide direct evidence of the equivalence between the two trials. It is worth noting that, in the current study, although the AUCs for the two indicators were different in some age groups, the cut-off scores were identical. Practically, the AVL-SR appeared to be sensitive enough to detect a memory deficit. It was the first time that the close correlation of AVL-SR and LR had been reported in a Chinese version of verbal learning test.

To identify MCI and determine its subtypes, it is necessary to assess multiple cognitive domains such as memory, language, attention, and visual-spatial and executive function. A comprehensive neuropsychological test battery is always time-consuming, especially the memory test. There is an urgent need for a test that requires minimal time investment but maintains the clinical diagnosing value for subtle memory decline like in MCI. But if the content and procedure of a classical test is arbitrarily reduced without re-validation, the accuracy and reliability of the test will undoubtedly be affected. The current study suggests that, using a relatively shorter time interval (like AVL-SR), it is possible to identify aMCI in a feasible time duration, especially for the oldest (80–89 years old) with poor physical and psychological tolerance. Yet, if time permits, the full version of the VLT including AVL-LR is still recommended for a more accurate and sensitive measurement.

The present study has some strength. Three memory tests including LMT, RCFT, and AVLT were administered to all the participants. The aMCI was diagnosed according to the scores of long-term delayed recall of both LMT and RCFT, whereas the AVLT-H scores were not taken into consideration for diagnosis, thus avoiding a circular argument. Additionally, the relatively large sample size allowed the analysis to be performed in age-stratified groups, which minimized the confounding effect of age. There was, however, an important limitation as well. Without AD-specific biomarker analysis such as beta-amyloid and Tau protein measurement, we could only base our diagnosis of AD and MCI on clinical evidence in a probable level of likelihood [30], [31].

In general, short-term delayed recall (that is, 3- to 5-minute delay time) and long-term delayed recall (that is, 20-minute delay time) of AVLT-H has a similar ability to discriminate aMCI from cognitively normal subjects. Future research will assess the value of each AVLT-H score in the differential diagnosis, prognosis, and conversion prediction of MCI to dementia.

Acknowledgments

The authors thank Prof. Agnes Chan of the Psychology Department of the Chinese University of Hong Kong for providing the Hong Kong Verbal Learning Test.

Author Contributions

Conceived and designed the experiments: QHG. Performed the experiments: YZ. Analyzed the data: QHG. Contributed reagents/materials/analysis tools: QHZ YRL. Wrote the paper: QHZ QHG YRL. Supervised the study and critically reviewed the manuscript: ZH.

References

  1. 1. Elias MF, Beiser A, Wolf PA, Au R, White RF, et al. (2000) The preclinical phase of alzheimer disease: A 22-year prospective study of the Framingham Cohort. Arch Neurol 57: 808–813.
  2. 2. Rey A (1958) L’examenclinique en psychologie [The psychologicalexamination]. Paris: Presses Universitaires de France.
  3. 3. Delis DC, Kramer JH, Kaplan E, Ober BA (1987) California Verbal Learning Test: Adult Version Manual. San Antonio, TX: The psychological Corporation.
  4. 4. Strauss E, Spreen O (2006) A compendium of neuropsychological tests. New York: Oxford university press.
  5. 5. Brandt J, Benedict RHB (2001) Hopkins Verbal Learning Test-Revised. Lutz, FL: Psychological Assessment Resources.
  6. 6. Lacritz LH, Cullum CM, Weiner MF, Rosenberg RN (2001) Comparison of the hopkins verbal learning test-revised to the California verbal learning test in Alzheimer’s disease. Appl Neuropsychol 8: 180–184.
  7. 7. Kalbe E, Kessler J, Calabrese P, Smith R, Passmore AP, et al. (2004) DemTect: a new, sensitive cognitive screening test to support the diagnosis of mild cognitive impairment and early dementia. Int J Geriatr Psychiatry 19: 136–143.
  8. 8. Mohs RC, Rosen WG, Davis KL (1983) The Alzheimer’s disease assessment scale: an instrument for assessing treatment efficacy. Psychopharmacol Bull 19: 448–450.
  9. 9. Morris JC, Mohs RC, Rogers H, Fillenbaum G, Heyman A (1988) Consortium to establish a registry for Alzheimer’s disease (CERAD) clinical and neuropsychological assessment of Alzheimer’s disease. Psychopharmacol Bull 24: 641–652.
  10. 10. Nasreddine ZS, Phillips NA, Bedirian V, Charbonneau S, Whitehead V, et al. (2005) The Montreal Cognitive Assessment, MoCA: a brief screening tool for mild cognitive impairment. J Am Geriatr Soc 53: 695–699.
  11. 11. Guo QH, Lv CZ, Hong Z (2001) Reliability and validity of auditory verbal learning test on Chinese elderly patients. Chinese Mental Health J 15: 3.
  12. 12. Petersen RC (2004) Mild cognitive impairment as a diagnostic entity. J Intern Med 256: 12.
  13. 13. Katzman R, Zhang MY, Ouang Ya Q, Wang ZY, Liu WT, et al. (1988) A Chinese version of the Mini-Mental State Examination; impact of illiteracy in a Shanghai dementia survey. J Clin Epidemiol 41: 971–978.
  14. 14. Guo QH, Zhao QH, Chen MR, Ding D, Hong Z (2009) A comparison study of mild cognitive impairment with 3 memory tests among Chinese individuals. Alzheimer Dis Assoc Disord 23: 253–259.
  15. 15. Pfeffer RI, Kurosaki TT, Harrah CH, Chance JM, Filos S (1982) Measurement of functional activities in older adults in the community. J Gerontol 37: 323–329.
  16. 16. McKhann G, Drachman D, Folstein M, Katzman R, Price D, et al. (1984) Clinical diagnosis of Alzheimer’s disease: report of the NINCDS-ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer’s Disease. Neurology 34: 939–944.
  17. 17. Morris JC (1993) The Clinical Dementia Rating (CDR): current version and scoring rules. Neurology 43: 2412–2414.
  18. 18. Yu H, Guo QH, Hong Z, Lv CZ (2005) Logic Memory Test in Early Detection of Alzheimer’s Disease. Nerve Disease and Mental Hygeine 5: 89–91.
  19. 19. Guo QH, Lv CZ, Hong Z (2000) Application of Rey-Osterrieth complex figure test in Chinese normal old people. Chinese J Clinical Psychology 8: 205–207.
  20. 20. Guo QH, Hong Z, Shi WX, Sun YM, Lv CZ (2006) Boston Naming Test in Chinese Elderly, Patient with Mild Cognitive Impairment and Alzheimer s Dementia. Chinese Mental Health J 20: 81–84.
  21. 21. Kaplan E GH, Weintraub S (1983) The Boston naming test. Philadelphia: Lea & Febiger.
  22. 22. Zhao QH, Guo QH, Shi WX, Zhou Y, Hong Z (2007) Category verbal fluency test in identification and differential diagnosis of dementia. Chinese J Clinical Psychology 15: 9.
  23. 23. Sheridan LK, Fitzgerald HE, Adams KM, Nigg JT, Martel MM, et al. (2006) Normative Symbol Digit Modalities Test performance in a community-based sample. Arch Clin Neuropsychol 21: 23–28.
  24. 24. Lu JC, Guo QH, Hong Z (2006) Trail making test used by Chinese elderly patients with mild cognitive impairment and mild Alzheimer dementia. Chinese J Clinical Psychology 14: 118–121.
  25. 25. Guo QH, Hong Z, Lv CZ (2005) Application of Stroop color-word test on Chinese elderly patients with mild cognitive impairment and mild Alzheimer’s dementia. Chinese Journal of Neuromedicine 4: 701–704.
  26. 26. Guo QH, Sun YM, Yuan J, Hong Z, Lv CZ (2007) Application of eight executive tests in participants of Shanghai Community. Chin J of Behavioral Med Sci 16: 4.
  27. 27. Guo QH, Fu JH, Yuan J, Zhao QH, Cao XY, et al. (2008) A study of validity of a new scoring system of clock drawing test. Chinese J Neurology 41: 4.
  28. 28. Wolk DA, Dickerson BC (2011) Fractionating verbal episodic memory in Alzheimer’s disease. Neuroimage 54: 1530–1539.
  29. 29. Shankle WR, Romney AK, Hara J, Fortier D, Dick MB, et al. (2005) Methods to improve the detection of mild cognitive impairment. Proc Natl Acad Sci U S A 102: 4919–4924.
  30. 30. Gomar JJ, Bobes-Bascaran MT, Conejero-Goldberg C, Davies P, Goldberg TE (2011) Utility of combinations of biomarkers, cognitive markers, and risk factors to predict conversion from mild cognitive impairment to Alzheimer disease in patients in the Alzheimer’s disease neuroimaging initiative. Arch Gen Psychiatry 68: 961–969.
  31. 31. Wagner M, Wolf S, Reischies FM, Daerr M, Wolfsgruber S, et al. (2012) Biomarker validation of a cued recall memory deficit in prodromal Alzheimer disease. Neurology 78: 379–386.