Ann Geriatr Med Res Search

CLOSE


Ann Geriatr Med Res > Epub ahead of print
Jung and Togo: Validity and Reliability of Smartphone-Based Cognitive Assessment in Older Adults

Abstract

Background

This study examined the validity and reliability of two smartphone-based tasks, the Symbol Search Task (SST-SP) and the Dot Memory Task (DMT-SP), in cognitively healthy older adults.

Methods

Sixty participants (32 older and 28 younger adults) completed two sessions approximately one week apart. In the first session, participants performed standardized cognitive tests (Symbol Digit Substitution Test and Visual Digit Span Test) on a personal computer (PC), PC-based Symbol Search Task (SST-PC), and Dot Memory Task (DMT-PC), followed by the SST-SP and DMT-SP. The second session included only SST-SP and DMT-SP.

Results

In older adults, the correlation between the SST-SP and SST-PC was very strong (r=0.83). In addition, the SST-SP showed a moderate correlation with the Symbol Digit Substitution Test (r=0.50). A very strong correlation was observed between DMT-SP and DMT-PC (r=0.86). The DMT-SP also showed a moderate correlation with the Visual Digit Span Test (r=-0.47). The SST-SP demonstrated good test–retest reliability and high split-half reliability, whereas the DMT-SP showed moderate test–retest reliability and acceptable-to-good split-half reliability. Older adults scored lower than younger adults on both the SST-SP and DMT-SP.

Conclusion

The SST-SP and DMT-SP are valid and reliable tools for assessing processing speed and working memory, respectively, in cognitively healthy older adults. These tools may provide new strategies for assessing the cognitive performance and/or ability in older adults in both research and clinical settings.

INTRODUCTION

Mild cognitive impairment (MCI) is a transitional stage between normal aging and dementia, with 10%–15% of individuals developing dementia annually.1) Given the growing prevalence of dementia, global trends toward population aging, and the significant disruptions in daily functioning caused by dementia,2) the prevention and mitigation of abnormal cognitive decline have become a global public health challenge.3) Although no effective treatment currently exists to reverse dementia, nonpharmacological interventions, such as cognitive training and physical activity, have shown positive effects on cognitive function in individuals with MCI.4) Therefore, early detection of MCI is critical for maintaining independence and reducing the risk of progression to dementia.5)
Several traditional assessment tools for cognitive function, including the Mini-Mental State Examination and the Montreal Cognitive Assessment (MoCA), are widely used for MCI screening. However, they require in-person administration, even when administered through video conference, which has recently been shown to have acceptable reliability and agreement in older adults.6) Other paper-based or computerized alternatives, such as the Addenbrooke’s Cognitive Examination and Alzheimer’s Disease Assessment Scale–Cognitive Subscale, have been developed to improve diagnostic precision; however, they still require trained examiners. Moreover, these assessments are susceptible to practice effects, particularly those arising from repeated exposure to identical stimuli, and are typically administered infrequently (e.g., every 6–12 months). Consequently, they may be less suitable for detecting subtle early cognitive changes indicative of abnormal decline.
Recent studies suggest that smartphone-based cognitive assessments can be administered without an examiner and can incorporate varying stimuli, enabling frequent, repeated, and low-burden cognitive performance measurements in daily life.7) Previous studies have validated several smartphone-based cognitive tasks in diverse populations, including individuals with multiple sclerosis8) and fibromyalgia9) and healthy adults aged 25–65 years.10) Among these tasks, the Symbol Search Task (SST) and Dot Memory Task (DMT) were designed to assess processing speed and working memory, respectively. Validation studies have shown that the SST is moderately to strongly correlated with standard measures of processing speed, whereas the DMT shows moderate correlations with working memory measures that are comparable to those observed for traditional personal computer (PC)-based tasks.8-10)
Evidence suggests that a decline in working memory and processing speed beyond that expected with normal aging may already be present in the early MCI stages.11,12) As these changes may be subtle and difficult to detect in a single assessment, repeated assessments have been suggested as a potentially effective approach for detecting them.13) Despite these indications, the smartphone-based SST and DMT have not yet been validated for older adults. Therefore, the present study evaluated the validity and reliability of smartphone-based SST and DMT in cognitively healthy older adults and examined age-related differences in performance.
In a previous study, the smartphone-based SST and DMT included 12 and two trials per session, respectively, yielding intraclass correlation coefficients (ICCs; i.e., the reliability of a single assessment) of 0.54 and 0.39 across 70 repeated assessments (five sessions per day over 14 days).10) These results indicate that 54% and 39% of the variance in performance reflect between-person differences, respectively. When scores were averaged across five consecutive assessments, ICCs reached 0.90 (SST) and 0.75 (DMT), indicating high reliability (ICC ≥0.90). Based on these findings, the present study increased the number of trials per session to 60 for the SST and 15 for the DMT to improve single-session reliability.

MATERIALS AND METHODS

Participants

This study included 32 older and 28 younger adults. An a priori power analysis for correlation analysis was conducted using G*Power (version 3.1). To detect a medium-to-large effect size (ρ=0.50) with α=0.05 and power=0.80,14) the minimum required sample was 21 participants. Participants were recruited through websites (https://www.jikken-baito.com) and the Bunkyo Ward Silver Human Resources Center. All participants provided written informed consent before participation. Eligibility criteria included no history of major psychiatric or neurological conditions (e.g., Parkinson disease, cerebrovascular disease, or brain injury), age ≥60 years for older adults or 21–29 years for younger adults, independent daily living activities (Tokyo Metropolitan Institute of Gerontology Index of Competence [TMIG-IC] ≥10), normal level of general cognitive functioning (Japanese version of the MoCA [MoCA-J] ≥26), and proficiency in Japanese. The study protocol was reviewed and approved by the University of Tokyo Human Research Ethics Committee (Approval No. 24-390) and conducted in accordance with the Declaration of Helsinki.

Study Procedure

The study consisted of two sessions. In Session 1, participants completed a baseline self-administered questionnaire, followed by the MoCA-J. Subsequently, they completed cognitive tests in the following order: a PC version of the Symbol Digit Substitution Test, Visual Digit Span Test, Symbol Search Task (SST-PC), and Dot Memory Task (DMT-PC). Then, they completed the smartphone versions of the Symbol Search Task (SST-SP) and Dot Memory Task (DMT-SP). Practice trials were provided for all tasks and repeated as needed until participants were familiar with the procedures. Session 2, which included the SST-SP and DMT-SP, was conducted approximately 1–2 weeks later. All sessions were conducted individually in a quiet testing room by trained research staff. Session 1 lasted approximately 1.5 hours, including breaks, whereas Session 2 lasted approximately 10 min. Participants were compensated with 2,000 yen per session. Data were collected from November 2024 to April 2025.

Baseline Characteristics

The baseline questionnaire included questions regarding age, sex, education, depressive symptoms, functional ability, and global cognitive function. Depressive symptoms were assessed using the Center for Epidemiologic Studies Depression Scale (CES-D) (possible range, 0–60), with a score ≥16 indicating possible depression.15) Functional ability was assessed using the TMIG-IC (possible range, 0–13), with a score of ≤9 suggesting difficulty in independent living.16) Global cognitive function was assessed using the MoCA-J (possible range, 0–30), with cognitive impairment defined as a score ≤25. The MoCA-J assesses multiple cognitive domains including attention, executive function, memory, language, visuospatial ability, conceptual thinking, calculations, and orientation.17)

Cognitive Assessments

The Symbol Digit Substitution Test18,19) and Visual Digit Span Test20) were administered on a PC using Inquisit 6 software (Millisecond Software, Seattle, WA, USA), which included computerized adaptations of the tests. The SST-SP, SST-PC, DMT-SP, and DMT-PC10) were developed using R and JavaScript. We used a desktop computer with a 23.8-inch display (DELL XPS8920) for the PC-based tasks, and an Android smartphone (ASUS ZenFone) with a 5.5-inch display for the smartphone-based tasks.

Symbol Digit Substitution Test

The Symbol Digit Substitution Test is a widely used neuropsychological assessment tool designed to evaluate processing speed and attention. Participants were presented with a key at the top of the screen, pairing numbers 1–9 with unique symbols. Below the key, a randomized sequence of symbols appeared, and participants were given 120 seconds to match as many symbols as possible by clicking on their corresponding numbers displayed on the screen using a mouse. The task began when participants clicked "Start," and the times between successive clicks were recorded as response times. The test score was defined as the median response time for correct responses.

Visual Digit Span Test

The Visual Digit Span Test, which requires participants to recall and reproduce digit sequences, was used to assess working memory. In each trial, a sequence of digits ranging from 1 to 9 (excluding consecutive patterns such as "123") was displayed at a rate of one digit per second. After the final digit disappeared, a red dot was displayed for 1 second, which prompted participants to recall and input the recalled digit sequence into a text box. If the response was correct, the next trial increased by one digit in the sequence. If the response was incorrect, the same sequence length was repeated. When the participants made two consecutive errors in the same sequence length, the sequence length was reduced by one digit. The test score was defined as the average length of the sequences correctly recalled.20)

Symbol Search Task

The SST was designed to measure processing speed by assessing how quickly participants processed visual information and made comparisons. In each trial, participants were presented with three symbol pairs in the upper row and two in the lower row of the screen. They were instructed to identify which pair in the lower row matched one of the pairs in the upper row and select it using a mouse (SST-PC) or by touching the screen (SST-SP) as quickly as possible (Fig. 1). The task consisted of 60 trials, and participants responded at their own pace. When a pair in the lower row was selected, all symbols disappeared, and the next stimulus appeared after 200 ms. Before the main task, practice trials were repeated until adequate familiarization was confirmed. The SST score was defined as the median response time for correct answers, with shorter response times indicating a higher processing speed.

Dot Memory Task

The DMT was designed to assess working memory. It consisted of three phases: encoding, distraction, and recall. In the encoding phase, the participants were shown a 5×5 grid with three red dots placed at specific locations and asked to remember their positions. After a 3-second encoding phase, the grid and red dots disappeared, and the distraction phase began, during which participants were asked to identify and touch the "F" letters among an array of "E" letters for 8 seconds. Following the distraction phase, an empty 5×5 grid reappeared, and participants were prompted to recall the locations of the three red dots and indicate them on the grid using a mouse (DMT-PC) or by touching a screen (DMT-SP). Participants then pressed the "Done" button to complete the trial. Each participant completed 15 trials with a 1-second interval between trials (Fig. 2). Before the main task, practice trials were repeated until adequate familiarization was confirmed. The performance in each trial was defined as the error score, which was calculated as the minimum total Euclidean distance between the recalled and correct dot locations. The DMT score was defined as the total error score of 15 trials, with lower scores indicating higher recall accuracy.

Statistical Analysis

Four older adults were excluded because their MoCA-J scores were <26, leaving 28 older adults (62–83 years) and 28 younger adults (21–29 years) in the final analysis. To assess the validity of SST-SP and DMT-SP in older adults, Pearson correlation coefficients were computed for the SST-SP, DMT-SP, SST-PC, DMT-PC, and other PC-based cognitive tasks in older adults. Correlation coefficients were interpreted according to Evans’ criteria: less than 0.20 indicated very weak, between 0.20 and 0.39 indicated weak, between 0.40 and 0.59 indicated moderate, between 0.60 and 0.79 indicated strong, and greater than 0.80 indicated very strong.21) Test–retest reliability of the SST-SP and DMT-SP in older adults was assessed using ICC with a 95% confidence interval (CI) calculated from the scores obtained across the two sessions. ICC values were interpreted as poor (<0.5), moderate (0.5–0.75), good (0.75–0.9), and excellent (>0.90).22) We also assessed the split-half reliability to examine the internal consistency of the SST-SP and DMT-SP in older adults. The SST-SP and DMT-SP trials were divided into odd- and even-numbered trials, and the correlation between their scores within each task was calculated for older adults using the Spearman–Brown correction,23) with coefficients ≥0.70 acceptable, ≥0.80 good, and ≥0.90 excellent.24) Welch’s two-sample t-test was used to assess group differences in SST-SP and DMT-SP scores between older and younger adults. Statistical significance was set at p<0.05. All statistical analyses were conducted using R software (version 4.4.2).

RESULTS

Table 1 presents participant demographics, including age, sex, educational level, and MoCA-J, CES-D, and TMIG-IC scores, indicating high education levels and no signs of dementia, depression, or functional difficulties.
Table 2 shows cognitive assessment scores and Pearson correlations in Session 1 for older adults. The SST-SP was strongly correlated with SST-PC (r=0.83, p<0.001) and moderately correlated with the Symbol Digit Substitution Test (r=0.50, p=0.006). In contrast, its correlation with the Visual Digit Span Test was weak and not statistically significant (r=−0.36, p=0.060). The DMT-SP was strongly correlated with DMT-PC (r=0.86, p<0.001) and moderately correlated with the Visual Digit Span Test (r=−0.47, p=0.011). In contrast, its correlation with the Symbol Digit Substitution Test was weak and not statistically significant (r=0.27, p=0.165).
Regarding the SST-SP and DMT-SP scores in Sessions 1 and 2 among the older adults, the mean SST-SP score (i.e., reaction time) was 2,204.9±398.1 ms in Session 1 and 2,104.6±489.6 ms in Session 2. The mean DMT-SP score (i.e., total error score) was 17.4±12.5 in Session 1 and 17.0±11.6 in Session 2.
In older adults, ICC of the SST-SP between Sessions 1 and 2 was 0.760 (95% CI, 0.545–0.822; p<0.001), indicating good test–retest reliability (Table 3, Fig. 3); thus, 24% of the total variance (1 − ICC) reflected intra-individual variability across sessions. The ICC for the DMT-SP was 0.661 (95% CI, 0.385–0.828; p<0.001), indicating moderate test–retest reliability (Table 3, Fig. 3), with 34% of the total variance reflecting intra-individual variability. The mean interval between sessions was 10.1±3.0 days.
In older adults, the split-half reliability of the SST-SP was 0.934 (95% CI, 0.858–0.970; p<0.001) and 0.963 (95% CI, 0.920–0.983; p<0.001) for Sessions 1 and 2, respectively. The reliability of the DMT-SP was 0.732 (95% CI, 0.412–0.878; p=0.001) and 0.874 (95% CI, 0.723–0.942; p<0.001) for Sessions 1 and 2, respectively (Table 3).
Reaction time for the SST-SP was significantly longer in older adults (mean 2,204.9±398.1 ms) than in younger adults (mean 1,885.2±525.1 ms; p=0.008). The total error score on the DMT-SP was significantly higher in older adults (mean 17.4±12.5 a.u.) than in younger adults (mean 10.5±9.0 a.u.; p=0.017) (Table 4).

DISCUSSION

This study evaluated the validity and reliability of two smartphone-based cognitive tasks, the SST-SP and the DMT-SP, in older adults with no history of depression, dementia, or functional difficulties. Construct validity was examined through correlations with standardized cognitive tests, test–retest reliability, internal consistency (split-half analysis), and performance differences between older and younger adults. To the best of our knowledge, this is the first study to assess smartphone-based task validity and reliability in older adults.
Previous studies conducted on individuals with multiple sclerosis,8) fibromyalgia,9) or healthy adults aged 25–65 years10) validated smartphone-based cognitive tasks by examining their correlation with standardized neuropsychological measures. For instance, smartphone-based SSTs showed moderate-to-strong correlations with conventional processing speed measures (Symbol Digit Modalities Test, National Institutes of Health [NIH] Toolbox Pattern Comparison Test), whereas correlations with working memory tasks were weak, supporting their validity for evaluating processing speed. In contrast, smartphone-based DMTs showed moderate correlations with traditional working memory tasks (Operation Span, Paced Auditory Serial Addition Test, NIH Toolbox List Sorting Working Memory Test) and weaker correlations with processing speed, indicating that DMTs may be more specifically suited for assessing working memory performance.8-10)
Although smartphone-based assessments are considered promising tools for monitoring cognitive performance and detecting subtle or early cognitive decline,25) the validity of smartphone-based SST and DMT in older adults has not been specifically examined. To address this gap, we examined the validity of the SST-SP and the DMT-SP in cognitively healthy older adults. SST-SP scores showed a moderate correlation with Symbol Digit Substitution Test scores and a weak correlation with Visual Digit Span Test scores. Conversely, DMT-SP scores showed a moderate correlation with Visual Digit Span Test scores and a weak correlation with Symbol Digit Substitution Test scores. In addition, the SST-SP and DMT-SP scores were strongly correlated with their corresponding PC-based tasks (SST-PC and DMT-PC, respectively). These results suggest that the SST-SP and DMT-SP may be useful for evaluating processing speed and working memory, respectively, in older adults.
Regarding reliability, a previous study reported single-administration ICCs of 0.54 for SST and 0.39 for DMT.10) In that study, the SST consisted of 12 trials, and the DMT consisted of two trials. In contrast, in our study, the estimated single-administration ICCs were 0.77 for the SST-SP and 0.65 for the DMT-SP; the SST-SP consisted of 60 trials, and the DMT-SP consisted of 15 trials. These findings suggest that the smartphone-based tasks used in our study provide more stable and consistent single-administration estimates of cognitive performance than the previously reported smartphone-based SST and DMT tasks. Moreover, the SST-SP demonstrated good test–retest reliability, and the DMT-SP demonstrated moderate test–retest reliability. Both tasks also showed acceptable-to-good internal consistency within a single session, as indicated by the split-half reliability analyses. These results also support the reliability of the SST-SP and DMT-SP performances in a single administration.
The SST-SP demonstrated high split-half reliability (r = 0.93, 0.96), indicating that the number of trials in a single assessment may be sufficient to obtain scores with excellent reliability (≥0.90). In contrast, the split-half reliability of the DMT-SP was moderate to good (r=0.73, 0.87), suggesting that the number of trials in a single assessment of the DMT-SP may be sufficient for research purposes. In addition, the results indicated that more trials are required to achieve excellent reliability. However, increasing the number of trials may introduce participant burden, fatigue, and disengagement, particularly in older adults, which may affect the validity and feasibility. This issue should be investigated further in future studies.
For the SST-SP and DMT-SP, approximately 24% and 34% of the variance in cognitive performance scores, respectively, was attributable to intra-individual variability. Although intra-individual variability includes random variances in performance scores, previous studies have identified various factors that may contribute to systematic variability in cognitive performance, including fatigue, mood, sleep quality, attentional lapses, and contextual or environmental stressors.26,27) Additionally, greater intra-individual variability has been linked to early neural disruption and an increased risk of dementia, particularly Alzheimer’s disease.28,29) Thus, identifying the factors that drive intra-individual variation may provide deeper insights into real-world cognitive functioning and serve as potential causes of abnormal cognitive decline in older adults.
Consistent with the well-established findings on cognitive aging,30,31) significant age-related differences were observed in both smartphone-based tasks. Older adults exhibited slower processing speeds and more working memory errors than younger adults, suggesting that smartphone-based assessments can capture age-related cognitive changes. In addition, the SST-SP showed a moderate correlation with the Symbol Digit Substitution Test (r=0.50, p=0.006), and the DMT-SP showed a moderate correlation with the Visual Digit Span Test (r=−0.47, p=0.011), further supporting the convergent validity of smartphone-based tasks. Given that these conventional measures are sensitive to MCI-related changes in processing speed and working memory32,33) and that smartphone-based tasks can be administered repeatedly with varied stimuli, SST-SP and DMT-SP may be useful for detecting early signs of cognitive decline beyond normal aging. Future studies should examine this possibility in individuals with an elevated MCI or dementia risk.
This study has some limitations. First, all the participants were daily smartphone users with relatively high educational levels. Thus, the generalizability of these results to the broader older adult population, including individuals with limited digital literacy and relatively low educational levels, remains unclear. Future research should examine the influence of smartphone proficiency and educational levels on task performance. Second, the older adult participants in this study were cognitively healthy, with MoCA-J scores ≥26. Further validation is needed to determine whether our smartphone-based tasks can reliably distinguish between healthy aging and cognitive impairments such as MCI. Third, the performance in Session 2 was better than that in Session 1. This difference may reflect practice/familiarization effects, as well as order-related fatigue. Although the participants completed a practice/familiarization session before the main tests in Session 1, additional practice-related gains may have occurred. Such gains may partly reflect offline learning (i.e., consolidation), whereby performance can improve between sessions even without further practice, for example, through consolidation processes occurring during the intervening days (including sleep). Moreover, in Session 1, smartphone-based tasks were administered after PC-based assessments, whereas in Session 2, only smartphone-based tasks were administered. Thus, participants may have experienced greater fatigue in Session 1, which may have contributed to their poorer performance than in Session 2. These practice-related gains and order-related fatigue may have increased the intra-individual variability between Sessions 1 and 2, thereby contributing to an underestimation of test–retest reliability. Given the possibility of offline learning effects, performance at the initial assessment in repeated-measures designs should be cautiously interpreted.
In summary, we confirmed the validity and reliability of smartphone-based cognitive tasks to assess processing speed and working memory in cognitively healthy older adults. These tasks may allow self-administered, repeated, momentary, and ecological assessments, thereby offering new strategies for assessing cognitive performance and/or ability in older adults in both research and clinical settings. Future studies are required to explore whether these tests are useful for the early MCI detection.

ACKNOWLEDGMENTS

The authors would like to thank all participants for their time and effort in participating in this study. We extend our sincere gratitude to the members of our laboratory for their valuable comments and insightful discussion.

CONFLICT OF INTEREST

The authors declare no conflicts of interest.

FUNDING

None.

AUTHOR CONTRIBUTIONS

Conceptualization, MJ and FT; Data curation, MJ; Formal analysis, MJ; Investigation, MJ; Methodology, MJ and FT; Project administration, FT; Supervision, FT; Writing—original draft, MJ; Writing—review & editing, MJ and FT.

Fig. 1.
Symbol Search Task. Participants were asked to select a pair from two pairs in the lower row that matched one of the pairs in the upper row on the screen as quickly as possible.
agmr-25-0175f1.jpg
Fig. 2.
Dot Memory Task. (A) Encoding phase: participants were shown the 5×5 grid with three red dots for 3 seconds on the screen and were asked to memorize the location of the red dots. (B) Distraction phase: after the grid and dots disappeared, they completed a distraction task for 8 seconds, identifying the letter "F"s among a series of "E"s. (C) Recall phase: the empty 5×5 grid reappeared, and participants were asked to recall and indicate the locations of the three red dots.
agmr-25-0175f2.jpg
Fig. 3.
Associations between scores on smartphone-based tasks across two sessions in older adults (n=28): (A) Symbol Search Task (SST-SP), (B) Dot Memory Task (DMT-SP). ICC, intraclass correlation coefficient.
agmr-25-0175f3.jpg
Table 1.
Demographic characteristics of the participants (n=56)
Older adults (n=28) Younger adults (n=28)
Age (y) 70.0±4.8 24.6±2.4
Sex, female 14 (50.0) 12 (42.9)
Education (y) 15.7±1.5 15.3±2.1
MoCA-J (points) 27.8±1.1 28.3±1.2
CES-D (points) 6.5±4.0 13.7±6.9
TMIG-IC (points) 11.8±1.1 -

Values are presented as mean±standard deviation or number (%).

MoCA-J, Japanese version of the Montreal Cognitive Assessment; CES-D, Center for Epidemiologic Studies Depression Scale; TMIG-IC, Tokyo Metropolitan Institute of Gerontology Index of Competence.

Table 2.
Scores of each cognitive assessment and the Pearson correlations between them in Session 1 for older adults (n=28)
Scores SST-SP SST-PC SDS Test DMT-SP DMT-PC VDS Test
SST-SP 2,204.9±398.1a) 1
SST-PC 2,568.4±434.8a) 0.83*** 1
SDST 2,751.5±566.4a) 0.50** 0.63*** 1
DMT-SP 17.4±12.5b) −0.11 −0.08 0.27 1
DMT-PC 17.1±12.7b) 0.08 0.10 0.32 0.86*** 1
VDST 6.7±1.0c) −0.36 −0.40* −0.60*** −0.47* −0.54** 1

Values are presented as mean±standard deviation.

SST, Symbol Search Task; DMT, Dot Memory Task; SP, smartphone; PC, personal computer; SDST, Symbol Digit Substitution Test; VDST, Visual Digit Span Test.

a)Reaction time (ms), b)Euclidean distance (a.u.), c)Number of digits.

*p<0.05, **p<0.01, ***p<0.001.

Table 3.
Reliability of the SST-SP and DMT-SP among older adults (n = 28)
Test–retest reliability Split-half reliability
Session 1 Session 2
ICC (95% CI) p-value Spearman-Brown coefficient (95% CI) p-value Spearman-Brown coefficient (95% CI) p-value
SST-SP 0.760 (0.545–0.822) <0.001 0.934 (0.858–0.970) <0.001 0.963 (0.920–0.983) <0.001
DMT-SP 0.661 (0.385–0.828) <0.001 0.732 (0.412–0.878) 0.001 0.874 (0.723–0.942) <0.001

SST, Symbol Search Task; DMT, Dot Memory Task; SP, smartphone; ICC, intraclass correlation coefficient; CI, confidence interval.

Table 4.
Scores on the SST-SP and DMT-SP by age group
Older adults (n=28) Younger adults (n=28) t p-value
SST-SP score 2,204.9±398.1a) 1,885.2±525.1a) 2.72 0.008
DMT-SP score 17.4±12.5b) 10.5±9.0b) 2.47 0.017

Values are presented as mean±standard deviation.

SST, Symbol Search Task; DMT, Dot Memory Task; SP, smartphone.

a)Reaction time (ms), b)Euclidean distance (a.u.).

REFERENCES

1. Petersen RC, Lopez O, Armstrong MJ, Getchius TS, Ganguli M, Gloss D, et al. Practice guideline update summary: mild cognitive impairment [RETIRED]: report of the guideline development, dissemination, and implementation subcommittee of the American Academy of Neurology. Neurology 2018;90:126-35.
crossref pmid
2. Winblad B, Amouyel P, Andrieu S, Ballard C, Brayne C, Brodaty H, et al. Defeating Alzheimer's disease and other dementias: a priority for European science and society. Lancet Neurol 2016;15:455-532.
crossref pmid
3. World Health Organization. A blueprint for dementia research [Internet]. Geneva, Switzerland: World Health Organization; 2022 [cited 2025 Oct 16]. Available from: https://www.who.int/publications/i/item/9789240058248.

4. Horr T, Messinger-Rapport B, Pillai JA. Systematic review of strengths and limitations of randomized controlled trials for non-pharmacological interventions in mild cognitive impairment: focus on Alzheimer's disease. J Nutr Health Aging 2015;19:141-53.
crossref pmid pmc pdf
5. Rasmussen J, Langerman H. Alzheimer's disease: why we need early diagnosis. Degener Neurol Neuromuscul Dis 2019;9:123-30.
crossref pmid pmc
6. Hernandez HH, Ong PL, Anthony P, Ang SL, Salim NB, Yew PY, et al. Cognitive assessment by telemedicine: reliability and agreement between face-to-face and remote videoconference-based cognitive tests in older adults attending a memory clinic. Ann Geriatr Med Res 2022;26:42-8.
crossref pmid pmc pdf
7. Nicosia J, Aschenbrenner AJ, Balota DA, Sliwinski MJ, Tahan M, Adams S, et al. Unsupervised high-frequency smartphone-based cognitive assessments are reliable, valid, and feasible in older adults at risk for Alzheimer's disease. J Int Neuropsychol Soc 2023;29:459-71.
crossref pmid
8. Goga JJ, Ginell KM, Ng YT, Ehde DM, Alschuler KN, Sliwinski MJ, et al. Feasibility, reliability, and validity of ambulatory smartphone-administered cognitive tests in multiple sclerosis. Mult Scler 2025;31:363-75.
crossref pmid pmc pdf
9. Valentine TR, Kratz AL. Feasibility, reliability, and validity of ambulatory cognitive tests in fibromyalgia and matched controls. J Int Neuropsychol Soc 2023;29:893-901.
crossref pmid pmc
10. Sliwinski MJ, Mogle JA, Hyun J, Munoz E, Smyth JM, Lipton RB. Reliability and validity of ambulatory cognitive assessments. Assessment 2018;25:14-30.
crossref pmid pdf
11. Kirova AM, Bays RB, Lagalwar S. Working memory and executive function decline across normal aging, mild cognitive impairment, and Alzheimer's disease. Biomed Res Int 2015;2015:748212.
crossref pmid pmc pdf
12. Haworth J, Phillips M, Newson M, Rogers PJ, Torrens-Burton A, Tales A. Measuring information processing speed in mild cognitive impairment: clinical versus research dichotomy. J Alzheimers Dis 2016;51:263-75.
crossref pmid pmc
13. Ohman F, Hassenstab J, Berron D, Scholl M, Papp KV. Current advances in digital cognitive assessment for preclinical Alzheimer's disease. Alzheimers Dement (Amst) 2021;13:e12217.
crossref pmid pmc
14. Cohen J. Statistical power analysis for the behavioral sciences. 2nd ed. Hillsdale, NJ: Lawrence Erlbaum; 1988.

15. Lewinsohn PM, Seeley JR, Roberts RE, Allen NB. Center for Epidemiologic Studies Depression Scale (CES-D) as a screening instrument for depression among community-residing older adults. Psychol Aging 1997;12:277-87.
crossref pmid
16. Koyano W, Shibata H, Nakazato K, Haga H, Suyama Y. Measurement of competence: reliability and validity of the TMIG index of competence. Arch Gerontol Geriatr 1991;13:103-16.
crossref pmid
17. Fujiwara Y, Suzuki H, Yasunaga M, Sugiyama M, Ijuin M, Sakuma N, et al. Brief screening tool for mild cognitive impairment in older Japanese: validation of the Japanese version of the Montreal cognitive assessment. Geriatr Gerontol Int 2010;10:225-32.
crossref pmid
18. Jaeger J. Digit symbol substitution test: the case for sensitivity over specificity in neuropsychological testing. J Clin Psychopharmacol 2018;38:513-9.
crossref pmid pmc
19. Thorndike EL. A standardized group examination of intelligence independent of language. J Appl Psychol 1919;3:13.
crossref
20. Woods DL, Kishiyamaa MM, Lund EW, Herron TJ, Edwards B, Poliva O, et al. Improving digit span assessment of short-term verbal memory. J Clin Exp Neuropsychol 2011;33:101-11.
crossref pmid pmc
21. Papageorgiou SN. On correlation coefficients and their interpretation. J Orthod 2022;49:359-61.
crossref pmid pmc pdf
22. Koo TK, Li MY. A Guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med 2016;15:155-63.
crossref pmid pmc
23. Parsons S, Kruijt AW, Fox E. Psychological science needs a standard practice of reporting the reliability of cognitive-behavioral measurements. Adv Methods Pract Psychol Sci 2019;2:378-95.
crossref pdf
24. Nunnally JC, Bernstein IH. Psychometric theory. 3rd ed. New York, NY: McGraw-Hill; 1994.

25. Koo BM, Vizer LM. Mobile technology for cognitive assessment of older adults: a scoping review. Innov Aging 2019;3:igy038.
crossref pmid pmc
26. Brose A, Schmiedek F, Gerstorf D, Voelkle MC. The measurement of within-person affect variation. Emotion 2020;20:677-99.
crossref pmid
27. Sliwinski MJ, Smyth JM, Hofer SM, Stawski RS. Intraindividual coupling of daily stress and cognition. Psychol Aging 2006;21:545-57.
crossref pmid pmc
28. Anderson ED, Wahoske M, Huber M, Norton D, Li Z, Koscik RL, et al. Cognitive variability: a marker for incident MCI and AD: an analysis for the Alzheimer's disease neuroimaging initiative. Alzheimers Dement (Amst) 2016;4:47-55.
crossref pmid pmc pdf
29. Christ BU, Combrinck MI, Thomas KGF. Both reaction time and accuracy measures of intraindividual variability predict cognitive performance in Alzheimer's disease. Front Hum Neurosci 2018;12:124.
crossref pmid pmc
30. Salthouse TA. The aging of working memory. Neuropsychol 1994;8:535.
crossref
31. Salthouse TA. The processing-speed theory of adult age differences in cognition. Psychol Rev 1996;103:403-28.
crossref pmid
32. Tsatali M, Poptsi E, Moraitou D, Agogiatou C, Bakoglidou E, Gialaouzidis M, et al. Discriminant validity of the WAIS-R digit symbol substitution test in subjective cognitive decline, mild cognitive impairment (amnestic subtype) and Alzheimer's Disease Dementia (ADD) in Greece. Brain Sci 2021;11:881.
crossref pmid pmc
33. De Tollis M, De Simone MS, Perri R, Fadda L, Caltagirone C, Carlesimo GA. Verbal and spatial memory spans in mild cognitive impairment. Acta Neurol Scand 2021;144:383-93.
crossref pmid pdf


ABOUT
ARTICLE & TOPICS
Article Category

Browse all articles >

TOPICS

Browse all articles >

BROWSE ARTICLES
EDITORIAL POLICY
FOR CONTRIBUTORS
Editorial Office
#401 Yuksam Hyundai Venturetel, 20, Teheran-ro 25-gil, Gangnam-gu, Seoul 06132, Korea
Tel: +82-2-2269-1039    Fax: +82-2-2269-1040    E-mail: agmr.editorial@gmail.com                

Copyright © 2026 by Korean Geriatrics Society.

Developed in M2PI

Close layer
prev next