Standardized Measurement of Muscle Strength and Physical Performance for Sarcopenia: An Expert-Based Delphi Consensus

Article information

Ann Geriatr Med Res. 2025;29(2):185-198
Publication date (electronic) : 2025 June 11
doi : https://doi.org/10.4235/agmr.25.0070
1Department of Rehabilitation Medicine, Soonchunhyang University Cheonan Hospital, Soonchunhyang University College of Medicine, Cheonan, Republic of Korea
2Department of Rehabilitation Medicine, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam, Republic of Korea
3Department of Rehabilitation Medicine, Seoul National University Hospital, Seoul National University College of Medicine, Seoul, Republic of Korea
4Department of Rehabilitation Medicine, CHA Bundang Medical Center, CHA University School of Medicine, Seongnam, Republic of Korea
5Department of Orthopaedic Surgery, CHA Bundang Medical Center, CHA University School of Medicine, Seongnam, Republic of Korea
6Department of Orthopaedic Surgery, Daejeon Eulji Medical Center, Eulji University School of Medicine, Daejeon, Republic of Korea
7Department of Rehabilitation Medicine, Regional Rheumatoid and Degenerative Arthritis Center, Jeju National University Hospital, Jeju National University College of Medicine, Jeju, Republic of Korea
8Department of Internal Medicine, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam, Republic of Korea
9Department of Physical and Rehabilitation Medicine, Chonnam National University Medical School & Hospital, Gwangju, Republic of Korea
10Division of Geriatrics, Department of Internal Medicine, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
11Department of Rehabilitation Medicine, Ewha Woman's University Seoul Hospital, Ewha Woman's University School of Medicine, Seoul, Republic of Korea
12Department of Family Medicine, CHA Bundang Medical Center, CHA University School of Medicine, Seongnam, Republic of Korea
13Department of Physical Medicine and Rehabilitation, Dong-A University Hospital, Dong-A University College of Medicine, Busan, Republic of Korea
14Department of Rehabilitation Medicine, Seoul St. Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
15Department of Physical Medicine and Rehabilitation, Soonchunhyang University Hospital, Soonchunhyang University College of Medicine, Bucheon, Republic of Korea
16Department of Rehabilitation Medicine, Ilsan Paik Hospital, Inje University College of Medicine, Goyang, Republic of Korea
17Department of Family Medicine, SMG-SNU Boramae Medical Center, Seoul, Republic of Korea
18Department of Sport Science, Korea Institute of Sport Science, Seoul, Republic of Korea
19Department of Physical and Rehabilitation Medicine, Kyung Hee University Hospital, Seoul, Republic of Korea
20Department of Physical Education, Seoul National University, Seoul, Republic of Korea
21Institute of Sport Science, Seoul National University, Seoul, Republic of Korea
22Institute on Aging, Seoul National University, Seoul, Republic of Korea
23Department of Family Medicine, Kyung Hee University Hospital, Kyung Hee University College of Medicine, Seoul, Republic of Korea
24Department of Orthopedic Surgery, Inha University Hospital, Inha University School of Medicine, Incheon, Republic of Korea
25Department of Physical Medicine and Rehabilitation, Kyung Hee University Hospital at Gangdong, Seoul, Republic of Korea
Corresponding Author: Jae-Young Lim, MD, PhD Department of Rehabilitation Medicine, Seoul National University Bundang Hospital, Seoul National University College of Medicine, 82 Gumi-ro 173beon-gil, Bundang-gu, Seongnam 13620, Republic of Korea E-mail: drlim1@snu.ac.kr
Received 2025 May 8; Revised 2025 June 8; Accepted 2025 June 11.

Abstract

Background

Despite updated sarcopenia guidelines, inconsistent protocols still cause clinical confusion and may compromise diagnostic and outcome accuracy. This Delphi study aimed to establish expert consensus to support the standardization of muscle strength and physical performance assessments for sarcopenia.

Methods

A two-round modified Delphi study was conducted with 26 experts in geriatrics and sarcopenia. Participants completed two rounds of anonymous questionnaires evaluating 39 items across seven domains using a nine-point Likert scale or choice-based questions. Consensus was defined as ≥75% agreement.

Results

In total, 27 of 38 statements (71.1%) reached consensus across two rounds Experts supported further standardization of assessments in alignment with the Asian and Korean Working Group on Sarcopenia (AWGS and KWGS) guidelines. For handgrip strength, consensus was achieved on using both mechanical and hydraulic dynamometers, hydraulic protocols, value selection, measurement time, and positioning, but not on mechanical protocols, repetitions, recovery intervals, repetitions, or unified cutoff values. For calf circumference, consensus was reached on measurement position, method, and value selection, but not on guideline application. In gait speed assessment, agreement was reached on speed, repetitions, assistive device use, and equipment type, but not on value selection, distance, acceleration/deceleration phases, or device interchangeability. For the 400-m walk test, the KWGS guideline and speed were endorsed. Chair stand test (CST) and Timed up-and-go (TUG) test reached consensus on armrest use, value selection, and repetitions, but not on seat height, (CST), or speed (TUG).

Conclusion

This study highlights areas of agreement and ongoing uncertainty, supporting future standardization efforts sarcopenia assessment methods.

INTRODUCTION

Sarcopenia is a progressive skeletal muscle disorder characterized by the loss of muscle mass and strength, leading to an impaired physical performance and increased risks of falls, functional decline, frailty, and mortality.1,2) A definition for sarcopenia has evolved through the achievement of a global expert consensus, with the current guidelines from the European Working Group on Sarcopenia in Older People (EWGSOP2) and the Asian Working Group on Sarcopenia 2019 (AWGS 2019) requiring assessments of muscle mass, strength, and physical performance to diagnose and grade the severity of sarcopenia.3,4) Despite standardized protocols, definitions for sarcopenia still vary due to differences in assessment methods and population characteristics. For example, EWGSOP2 and AWGS 2019 guidelines recommend different thresholds. To reflect population-specific needs, the Korean Working Group on Sarcopenia (KWGS) recently introduced guidelines tailored to the Korean population.5)

However, a universally accepted definition of sarcopenia has yet to be established. This lack of standardization has contributed to a substantial variation in the reported prevalence, incidence, and treatment outcomes for sarcopenia across studies. The absence of a unified definition has likely impeded effective identification and management of sarcopenia in both the clinical and research settings.1) To address this issue, the Global Leadership Initiative in Sarcopenia (GLIS) was recently established to develop a globally applicable definition.1) Through a Delphi-format consensus process, GLIS has proposed a definition that includes reduced muscle mass and strength, including muscle-specific strength. Importantly, physical performance is not considered part of the diagnostic criteria but rather as an outcome measure.

Despite advances in sarcopenia diagnostic guidelines, substantial variability remains in measurement procedures and cutoff values. Dual-energy X-ray absorptiometry and bioelectrical impedance analysis are widely used to assess muscle mass, but inconsistencies persist due to differences in calibration, software algorithms, and scanning protocols.6,7) The AWGS 2019 guidelines provide method-specific cutoff points,3) underscoring the need for cautious interpretation and cross-method compatibility. Unlike instrument-based assessments of muscle mass, muscle strength—particularly handgrip strength (HGS)—is assessed manually and is more vulnerable to variability due to differences in protocols, device types, and operator technique. Discrepancies between hydraulic and mechanical dynamometers have also been reported,8-10) but no consensus exists on whether device-specific cutoffs are needed or if universal thresholds with calibration adjustments suffice. HGS protocols also lack standardization,11) with variations in the number of trials,3,5,12,13) grip duration,11-14) and recovery intervals.3,5,11,13,15)

Similar challenges also affect physical performance assessments. Standardized, accurate evaluation of functional outcomes is essential for monitoring intervention efficacy. However, heterogeneous protocols that were reported across studies and became part of guidelines may create confusion for clinicians and hinder an appropriate selection of assessment tools. Procedural variability can affect diagnostic accuracy and prevalence estimates, complicating risk identification and longitudinal monitoring. Although KWGS has recently introduced sarcopenia guidelines adapted to the Korean population, implementation gaps remain due to limited clinician familiarity and uncertainty. This Delphi study aimed to establish an expert consensus on standardized protocols for assessing muscle strength and physical performance to enhance diagnostic accuracy, consistency, and the clinical relevance of sarcopenia outcome evaluations.

MATERIALS AND METHODS

Study Design

We conducted a two-round modified Delphi study, following the Conducting and Reporting Delphi studies guidelines.16) Our process for achieving a consensus employed structured questionnaires, with a pre-defined first-round questionnaire rather than open-ended items.17) An Institutional Review Board approval was not required, as the present study collected expert opinions to inform clinical practice rather than new data from human participants.

Delphi Panel

Healthcare experts were recruited from the Korean Society of Sarcopenia, the Korean Geriatrics Society, and the Korean Society for Bone and Mineral Research. All panelists had recognized expertise in the care and research of older adults and sarcopenia. The panel included both clinical and non-clinical professionals, such as orthopedic surgeons, geriatricians, rehabilitation physicians, exercise physiologists, and endocrinologists. An invitation package—including an overview of the study objectives, the Delphi process adopted, and the study timeline—was sent to 58 experts. Of these, 26 experts (44.8%) agreed to participate in the full Delphi process.

Delphi Questionnaire Domains/Statements

A targeted literature review was conducted on HGS and physical performance assessments, along with the existing sarcopenia-related guidelines and protocols. Relevant studies, reviews, and guidelines were examined to identify inconsistencies, omissions, and areas needing further discussion. Based on this literature review, measurement items were developed across seven domains: (1) general aspects of standardizing physical performance assessments, (2) HGS, (3) calf circumference (CC), (4) gait speed, (5) 400-m walk test, (6) chair stand test (CST), and (7) timed up-and-go (TUG) test. Two structured rounds of online questionnaires (in Korean language) were used to achieve an expert consensus. A steering committee comprising five rehabilitation clinicians with extensive experience in sarcopenia research and clinical care was established to guide the development of the questionnaire. The committee oversaw the entire process, including domain selection, item formulation, and interpretation of results. An initial draft of 31 questions was developed by two principal investigators (S.K.L. and J.Y.L.) and subsequently refined through iterative discussions within the committee. During the initial group meeting, all items were reviewed for clarity and conciseness. Based on feedback, the initial questionnaire was revised and expanded to include 39 items.

Delphi Survey Method Process and Administration

A two-round modified Delphi process was conducted, as most studies have reported the achievement of consensus within two rounds.17) The first round occurred from November 1 to 21, 2023, and the second from January 15 to February 5, 2025. The experts received study materials via email and accessed questionnaires through a secure, controlled-access link. All responses were anonymous. In the first round, the participants provided background information and answered 39 questions: eight 9-point Likert-scale items (0=total disagreement, 9=total agreement), one multiple-choice question, and 30 single-choice items. Based on feedback and results, 28 questions were selected for the second round. To support decision-making, each question included relevant protocols, guidelines, and research summaries. The second round consisted of two Likert-scale statements and 26 single-choice questions. The revised versions of questionnaire items that had not reach consensus in the first round were reassessed by all experts participating in the study. For those still lacking consensus, the study team reviewed and summarized the reasons for disagreement.

Statistical Analysis

Descriptive statistics were used to analyze data from both rounds of the Delphi survey. For Likert-scale items, we calculated the mean value, standard deviation, interquartile range, consensus, and content validity ratio (CVR). A consensus was defined as strong agreement when the consensus value exceeded 0.75, using the formula: Consensus = 1 – [(Q3 – Q1) / median]. CVR was calculated as CVR = [Ne – (N/2)] / (N/2), where Ne is the number of experts providing a positive response (score ≥7), and N is the total number of experts. A CVR above 0.37 in Round 1 (N=26) and 0.42 in Round 2 (N=21) indicated sufficient convergence.18) Likert-scale items meeting both consensus and CVR thresholds were considered to have reached agreement. For single-choice items, consensus was defined as an agreement rate ≥75%. Items not meeting these criteria were classified as non-consensus. All analyses were performed using R software version 4.3.1 (R Foundation for Statistical Computing, Vienna, Austria).

RESULTS

Study Participants

Of the 58 experts invited to participate in this Delphi study, the majority were in their 40s or 50s. A total of 26 physicians agreed to participate and completed Round 1 (44.8%), while 21 completed Round 2 (80.8%). Most participants specialized in rehabilitation medicine (53.8%, n=14), and the majority had more than 10 years of professional experience (76.9%, n=20) (Table 1).

Demographic information of the Delphi experts

Agreement Ratings in the Round 1 Delphi Survey

The results of Round 1 and 2 Delphi surveys are presented in Tables 2 and 3. In Round 1, a consensus was reached for 10 out of 38 statements (36.3%), while 28 statements (73.7%) did not reach consensus. The Round 1 questionnaire is provided in Supplement A. An agreement was reached on the need for further standardization of physical performance assessments, as outlined in the AWGS and KWGS guidelines (75%, CVR=0.69). The experts identified multiple assessments requiring standardization, with HGS being cited most frequently, followed by gait speed (62.5%), and CST (50%).

Results of Rounds 1 and 2 Delphi survey: Likert-scale and multiple-choice items

Results of Rounds 1 and 2 Delphi survey on muscle strength and physical performance

For HGS, a consensus was reached on using both mechanical and hydraulic dynamometers (80.8%) and adhering to the existing protocols for hydraulic types (87.5%, CVR=0.69). However, no consensus was reached regarding protocol adherence for mechanical devices, cutoff values across device types, positioning, repetitions, measurement timing, or recovery intervals.

For CC, there was no agreement on applying AWGS and KWGS guidelines, measurement position, laterality, or value criteria. For gait speed, a consensus was reached on conducting two repetitions (76.9%), allowing assistive devices (88.5%), and using both manual and automated devices (80.7%). However, there was no agreement on measurement distance, acceleration/deceleration zones, speed, value selection, or the interchangeability of devices. In the 400-m walk test, the consensus supported an adherence to KWGS guidelines (96.4%, CVR=0.62), but not regarding an appropriate walking speed.

For the CST, an agreement was reached on following the KWGS guidelines (87.5%, CVR=0.77) and the cutoff value (85.7%, CVR=0.62), but not on chair type, end position, measurement value, or seat height. For the TUG test, a consensus was reached on adherence to the KWGS guidelines (85.7%, CVR=0.69), but not on chair type, pace, seat/armrest height, repetitions, or values.

Agreement Ratings in the Round 2 Delphi Survey

Following Round 1 analysis, the Delphi questionnaire was revised for clarity, with modified statements addressing non-consensus items. The second-round survey included 28 statements, of which 17 (60.7%) reached consensus and 11 did not. The full questionnaire is provided in Supplement B.

For HGS, a consensus was reached on protocol adherence for mechanical dynamometers (75.0%, CVR=0.87), use of the sitting position with hydraulic types (76.2%), recording the maximum value (90.5%), and a minimum measurement time of 3 seconds (76.2%). However, no agreement was reached on cutoff values by device type, positioning with mechanical types, number of repetitions, or recovery intervals.

For CC, a consensus was achieved on applying the AWGS and KWGS guidelines (75.0%, CVR=0.73), measuring in a standing position (90.5%), bilateral measurement (95.2%), and recording the maximum value from both sides (50.5%). For gait speed, an agreement was reached on using a 4-m walk test (76.2%) and measuring the usual walking speed (90.5%), but not on acceleration/deceleration phases, value selection, or device interchangeability. For the 400-m walk test, a consensus supported the use of the fastest possible walking speed.

For the CST, a consensus was reached on using a straight-back chair without armrests (95.2%) and selecting the fastest value from the 5-repetitions test or the maximum value from the 30-second test (81.0%). No agreement was reached on end position (sitting or standing) or whether seat height should be based on the mean Korean popliteal height value in the sitting position or individual popliteal height plus the mean heel height. For the TUG test, a consensus was reached on using a straight-back chair without armrests (90.5%), no need for armrests (85.7%), performing two repetitions (100%), and recording the maximum value. No consensus was reached on walking speed or seat height.

In total, 27 of 38 statements (71.1%) reached consensus across both rounds, while 11 (28.9%) did not. The final results are presented in Tables 4 and 5. Based on the results of the present Delphi study, the proposed standardized measurement protocols for physical performance and muscle strength are summarized in Table 6.

List of accepted statements from the Delphi study

List of rejected statements from the Delphi study and the most selected responses

Summary of the proposed standardized measurement protocols for muscle strength and physical performance

DISCUSSION

The diagnosis of sarcopenia and evaluation of related outcomes depend on standardized assessments of muscle strength and physical performance. However, a significant variation in assessment protocols and measurement tools poses ongoing challenges. These inconsistencies complicate clinical implementation, introduce measurement variability, and may affect diagnostic accuracy and reported prevalence. To address these issues, this study used a modified Delphi method to build an expert consensus on muscle strength and physical performance assessments employed for sarcopenia. Although an agreement was reached on adherence to AWGS and KWGS guidelines, further standardization is needed. A persistent lack of consensus on several items underscores the need for ongoing research and discussion.

Handgrip Strength

Both mechanical (e.g., Smedley) and hydraulic (e.g., Jamar) dynamometers are used to assess HGS, each with specific standardized protocols.11-14) Hydraulic devices are typically used in a seated position with the elbow flexed at 90°,11-13) while mechanical devices are generally used standing with the elbow extended,14) although a seated position with an extended elbow is recommended when standing is not feasible.5) Despite these guidelines, no consensus has been reached on the optimal posture for mechanical devices. This likely reflects differing expert perspectives—some prioritize the accommodation of older or frail adults with a seated posture, while others support permitting both positions. Given that HGS values vary between sitting and standing with mechanical devices,19) a consistent positioning based on patient condition is essential.

In conformity with KWGS guidelines,5) most experts did not agree on the interchangeability of measurements between hydraulic and mechanical dynamometers. Although calibration between device types was recognized as an important issue, no consensus was achieved. Studies have shown that hydraulic devices generally yield higher values than mechanical ones,8-10) yet current guidelines lack device-specific cutoffs or calibration-adjusted values—highlighting a need for further research in this area.

The appropriate method to be employed for estimating HGS remains inconsistent. Although some advocate using the mean of multiple trials for greater accuracy,13,20) others argue that frail individuals may fatigue quickly, leading to underestimated mean values compared to their true maximal grip strength.21) As most standard protocols11,12) and studies use the highest value,22) this approach is generally considered more practical and appropriate. Adherence to laterality is also inconsistent: although standard protocols recommend assessing both hands,11-14) and AWGS and KWGS suggest using either both arms or the dominant arm,3,5) many studies have measured only the dominant hand. Since the dominant hand is typically stronger due to muscle hypertrophy,23) while right-hand dominance in tools and activities can affect strength regardless of handedness,23,24) measuring both hands and using the maximum value is likely to yield the most accurate assessment.

Standard protocols recommend three HGS measurements,11-14) while the AWGS and KWGS guidelines recommend at least two trials.3,5) Although HGS tends to increase gradually with repeated trials,10,25) studies have shown that only the difference between the first and second trial measurements is clinically meaningful in older adults, with minimal change thereafter—supporting the sufficiency of two trials for this group.25) Additionally, the Korea National Health and Nutrition Examination Survey switched from three measurements to two starting in 2022.26) Similarly, most respondents in this study preferred two trials, underscoring the need for further discussion on the optimal repetition number to be applied.

The recommended duration for HGS assessment varies across protocols, ranging from 3–5 seconds,13) to at least 3 seconds11) or instructions like “squeeze until the needle stops rising” 12) or “until you cannot squeeze any harder.” 14) However, prolonged contraction can elevate blood pressure and heart rate,27) increasing the risk of fatigue or tendon injury in frail older adults.28) Although no consensus exists on the optimal duration for isometric tension, 3–10 seconds is generally effective,29) with a maximal effort of 3–5 seconds recommended to minimize energy depletion,30) and 3 seconds considered appropriate for older adults to reduce fatigue.27) While the AWGS and KWGS guidelines do not specify a fixed duration,3,5) a 3–5 second measurement is considered suitable.

Recovery intervals vary across protocols. Some recommend at least 15 seconds for alternating-hand measurements,13) others suggest 60 seconds11,21) or a range of 15 seconds to 1 minute15) to prevent fatigue. Generally, short to moderate rest periods (60–120 seconds) are sufficient for maximizing muscular strength gains.31) The KWGS guidelines do not specify a time limit,5) whereas the AWGS guidelines suggest avoiding a fixed acquisition time.3) An interval of around 60 seconds between trials may be appropriate, although further discussion is needed.

Calf Circumference

The KWGS guidelines recommend measuring CC in a standing position using a non-elastic tape, with cutoff values of <34 cm for men and <33 cm for women. However, it does not specify laterality or whether to use the maximum or mean value.5) Sitting measurements can overestimate CC, and may lead to an underdiagnosing of sarcopenia, while right-side standing measurements show the strongest correlation with muscle mass and function, enhancing diagnostic accuracy.32) In this Delphi study, a consensus supported the use of the maximum value from bilateral standing measurements as the most appropriate approach.

Gait Speed

The AWGS 2019 guidelines recommend measuring the time taken to walk 6 m at a normal pace from a moving start, excluding deceleration, and computing the mean value of at least two trials.3) The KWGS guidelines permit both 4 m and 6 m tests with 1–1.5 m acceleration and deceleration phases, but do not define how values should be estimated.5) In this Delphi study, a consensus supported the 4-m test with consideration of acceleration and deceleration, although no agreement was reached on their exact length—likely due to variations in clinical settings. Since frail older adults may not reach a steady gait until around 2.5 m, a proper accounting of these phases is essential for accuracy and repeatability.33)

Regarding value estimation, most respondents favored using the mean value; however, a consensus was not reached. The maximal value may better reflect true capacity and account for variability in initial attempts, while the mean value reduces measurement error.33) Further discussion is needed to determine the optimal approach.

While manual stopwatches are more accessible and commonly used in clinical settings, automatic timing devices are being increasingly adopted,3) and both are considered appropriate for assessing gait speed. Studies have reported comparable results between the two methods over various distances, with minimal error margins.33) However, discrepancies exist—for instance, slower gait speeds have been recorded with stopwatches compared to automatic timers,34) and manual static-start protocols may overestimate slowness compared to dynamic-start protocols using automatic timers.35) Additionally, manual moving-start measurements tend to yield faster speeds than both manual standing-start and automatic methods.36) Despite a general agreement on the use of both methods, a consensus was not reached on whether their results are interchangeable, warranting further investigation.

400-m Walk Test

The KWGS guidelines recommend the 400-m walk test to assess physical performance, with completion times over 6 minutes indicating reduced function, in line with EWGSOP2 criteria.4,5) The test involves walking 20 laps of 20 m along a marked corridor after a 2-minute warm-up. Participants are instructed to walk as fast as possible without running, receiving standardized encouragement at each lap. Rest is allowed without pausing the timer.4,37,38)

Chair Stand Test and Timed Up-and-Go Test

Both the CST (30 s/5 repetitions) and TUG test require a chair, but the optimal chair type to be used varies, with inconsistencies in seat height and armrest used being prevalent across studies.39,40) A straight-back chair with a seat height of 43–46 cm is generally recommended.38,41,42) For the CST, arm use is typically restricted, with participants crossing their arms over the chest.38,41,42) However, this may not be suitable for older adults with reduced function, as the absence of armrests can increase the risk of falls.39) The TUG test, involving standing, walking 3 m, turning, and returning, usually recommends a chair with armrests38,43) and permits walking aids.38,40) While both tests assess physical performance, the CST focuses on lower body strength while the TUG test assesses overall mobility. In this study, experts favored a straight-back chair without armrests for both tests. Given the variability in clinical settings, assessments should be adapted to the condition of the patient and the environment.

Seat height significantly impacts test performance: lower seats increase difficulty, while higher ones reduce hip and knee effort.44) In the 30-second CST, participants performed best with chairs at 120% and 110% of their lower leg length, while performance did not differ significantly between the standard 43 cm chair and chairs at 90% or 80% of this value.45) In the 5-repetition CST, performance improved at 115% of knee height compared to 100%, with slower, though not significantly different, times achieved at 85% of knee height.44) Among Koreans aged 60 and older, the mean knee height value is approximately 38 cm,46) and accounting for 3 cm of footwear height, the effective seat height is 41 cm, which corresponds to approximately 89%–95% of the standard chair height (43–46 cm). Thus, current protocols may overestimate performance by using chairs 104%–112% of typical knee height. Given population-specific anthropometry, further research is needed to identify the optimal seat height for Koreans.

The CST ending position differs between guidelines: EWGSOP2 specifies a standing end,4) while KWGS allows both a sitting end (>11 seconds) and a standing end (>10 seconds).5) For the TUG test, EWGSOP2 guidelines recommend walking at a “comfortable, fast, and secure pace,”4) whereas KWGS advises a usual (comfortable) pace.5) Both usual and maximum effort TUG protocols are accepted, although a maximum effort regimen is preferred because of its faster mean time, lower between-study variance, and greater reliability.40) This Delphi study did not reach a consensus on either point, highlighting the need for further discussion.

This study has several limitations. Although the expert panel consisted of specialists in the care and research of older adults and sarcopenia, most were from rehabilitation medicine, potentially limiting the generalizability of the findings. The lack of face-to-face meetings may have limited nuanced discussion, and deeper consensus-building. The Delphi questionnaire had a limited scope; although based on prior studies and guidelines to address key gaps, it could not cover all aspects of physical performance assessment, and some issues may have been overlooked. The use of predominantly closed-ended questions may also have influenced responses. Further, the small sample size and lack of consensus on several items reflect ongoing debate or limited evidence in certain areas. These limitations underscore the need for further research, including larger expert panels and clinical trials, to support the standardization of muscle strength and physical performance assessments in sarcopenia.

In conclusion, despite updates to sarcopenia guidelines and numerous studies on muscle strength and physical performance assessments, measurement variability persists due to differences in tools and protocols. The results of this Delphi study highlight the ongoing need for standardization—while some components have been clarified, and inconsistencies remain. Accurate and consistent assessments are critical for a reliable diagnosis, risk identification, and outcome evaluation. However, the efforts to improve precision must be balanced with practicality to ensure accessibility and ease of use. In addition to technical issues such as calibration and cutoff values, the practical challenges—particularly variability across clinical and community settings—must be addressed. Continued efforts are warranted to standardize assessment protocols and enhance sarcopenia diagnosis and management.

Notes

The authors would like to thank all the participants for their valuable contributions to this Delphi study.

CONFLICT OF INTEREST

The authors declare no conflicts of interest.

FUNDING

This research was supported by a grant of Patient-Centered Clinical Research Coordinating Center (PACEN) funded by the Ministry of Health & Welfare, Republic of Korea (Grant No. RS-2020-KH095863).

AUTHOR CONTRIBUTIONS

Conceptualization, SKL, JYL; Formal analysis, SKL; Funding acquisition, JYL; Investigation & validation, SKL, JWB, SYL, KHM, JYL, SEB, YHC, JHC, JYC, JYH, HCJ, HWJ, KIK, YJK, YSK, JHL, JIL, SYL, KBL, BJO, SJP, GYS, WS, CWW, JIY, SDY; Methodology, SKL, JWB, SYL, KHM, JYL; Project administration, JYL; Supervision, SKL, JYL; Writing—original draft, SKL; Writing—review & editing, SKL, JWB, SYL, KHM, JYL. All authors read and agreed to the published version of the manuscript.

SUPPLEMENTARY MATERIALS

Supplementary materials can be found via https://doi.org/10.4235/agmr.25.0070.

Supplement A.

Delphi Round 1 Questionnaire

agmr-25-0070-Supplement-A.pdf
Supplement B.

Delphi Rount 2 Questionnaire

agmr-25-0070-Supplement-B.pdf

References

1. Kirk B, Cawthon PM, Arai H, Avila-Funes JA, Barazzoni R, Bhasin S, et al. The conceptual definition of sarcopenia: Delphi consensus from the Global Leadership Initiative in Sarcopenia (GLIS). Age Ageing 2024;53:afae052. 10.1093/ageing/afae052. 38520141.
2. Cruz-Jentoft AJ, Sayer AA. Sarcopenia. Lancet 2019;393:2636–46. 10.1016/s0140-6736(19)31138-9. 31171417.
3. Chen LK, Woo J, Assantachai P, Auyeung TW, Chou MY, Iijima K, et al. Asian Working Group for Sarcopenia: 2019 consensus update on sarcopenia diagnosis and treatment. J Am Med Dir Assoc 2020;21:300–7. 10.1016/j.jamda.2019.12.012. 32033882.
4. Cruz-Jentoft AJ, Bahat G, Bauer J, Boirie Y, Bruyere O, Cederholm T, et al. Sarcopenia: revised European consensus on definition and diagnosis. Age Ageing 2019;48:16–31. 10.1093/ageing/afz046. 30312372.
5. Baek JY, Jung HW, Kim KM, Kim M, Park CY, Lee KP, et al. Korean Working Group on Sarcopenia Guideline: expert consensus on sarcopenia screening and diagnosis by the Korean Society of Sarcopenia, the Korean Society for Bone and Mineral Research, and the Korean Geriatrics Society. Ann Geriatr Med Res 2023;27:9–21. 10.4235/agmr.23.0009. 36958807.
6. Hull H, He Q, Thornton J, Javed F, Allen L, Wang J, et al. iDXA, Prodigy, and DPXL dual-energy X-ray absorptiometry whole-body scans: a cross-calibration study. J Clin Densitom 2009;12:95–102. 10.1016/j.jocd.2008.09.004. 19028125.
7. Piccoli A, Rossi B, Pillon L, Bucciante G. A new method for monitoring body fluid variation by bioimpedance analysis: the RXc graph. Kidney Int 1994;46:534–9. 10.1038/ki.1994.305. 7967368.
8. Amaral JF, Mancini M, Novo Junior JM. Comparison of three hand dynamometers in relation to the accuracy and precision of the measurements. Rev Bras Fisioter 2012;16:216–24. 10.1590/s1413-35552012000300007. 22801514.
9. Kim M, Shinkai S. Prevalence of muscle weakness based on different diagnostic criteria in community-dwelling older adults: a comparison of grip strength dynamometers. Geriatr Gerontol Int 2017;17:2089–95. 10.1111/ggi.13027. 28517036.
10. Guerra RS, Amaral TF. Comparison of hand dynamometers in elderly people. J Nutr Health Aging 2009;13:907–12. 10.1007/s12603-009-0250-3. 19924352.
11. Nunez-Cortes R, Cruz BD, Gallardo-Gomez D, Calatayud J, Cruz-Montecinos C, Lopez-Gil JF, et al. Handgrip strength measurement protocols for all-cause and cause-specific mortality outcomes in more than 3 million participants: a systematic review and meta-regression analysis. Clin Nutr 2022;41:2473–89. 10.1016/j.clnu.2022.09.006. 36215867.
12. Roberts HC, Denison HJ, Martin HJ, Patel HP, Syddall H, Cooper C, et al. A review of the measurement of grip strength in clinical and epidemiological studies: towards a standardised approach. Age Ageing 2011;40:423–9. 10.1093/ageing/afr051. 21624928.
13. MacDermid J, Solomon G, Valdes K. Clinical assessment recommendations 3rd edth ed. Mount Laurel, NJ: American Society of Hand Therapists; 2015.
14. National Center for Health Statistics. National Health and Nutrition Examination Survey (NHANES): muscle strength procedures manual [Internet]. 2011. [cited 2025 Jun 17]. Available from: https://wwwn.cdc.gov/nchs/data/nhanes/public/2011/manuals/Muscle_Strength_Proc_Manual.pdf.
15. Mathiowetz V. Effects of three trials on grip and pinch strength measurements. J Hand Ther 1990;3:195–8. 10.1016/s0894-1130(12)80377-2.
16. Junger S, Payne SA, Brine J, Radbruch L, Brearley SG. Guidance on Conducting and REporting DElphi Studies (CREDES) in palliative care: recommendations based on a methodological systematic review. Palliat Med 2017;31:684–706. 10.1177/0269216317690685. 28190381.
17. Nasa P, Jain R, Juneja D. Delphi methodology in healthcare research: how to decide its appropriateness. World J Methodol 2021;11:116–29. 10.5662/wjm.v11.i4.116. 34322364.
18. Lawshe CH. A quantitative approach to content validity. Pers Psychol 1975;28:563–75. 10.1111/j.1744-6570.1975.tb01393.x.
19. Ha YC, Yoo JI, Park YJ, Lee CH, Park KS. Measurement of uncertainty using standardized protocol of hand grip strength measurement in patients with sarcopenia. J Bone Metab 2018;25:243–9. 10.11005/jbm.2018.25.4.243. 30574469.
20. Mathiowetz V, Weber K, Volland G, Kashman N. Reliability and validity of grip and pinch strength evaluations. J Hand Surg Am 1984;9:222–6. 10.1016/s0363-5023(84)80146-x. 6715829.
21. Mehmet H, Yang AW, Robinson SR. Measurement of hand grip strength in the elderly: a scoping review with recommendations. J Bodyw Mov Ther 2020;24:235–43. 10.1016/j.jbmt.2019.05.029.
22. Ha YC, Hwang SC, Song SY, Lee C, Park KS, Yoo JI. Hand grip strength measurement in different epidemiologic studies using various methods for diagnosis of sarcopenia: a systematic review. Eur Geriatr Med 2018;9:277–88. 10.1007/s41999-018-0050-6. 34654240.
23. Bohannon RW. Grip strength: a summary of studies comparing dominant and nondominant limb measurements. Percept Mot Skills 2003;96(3 Pt 1):728–30. 10.2466/pms.2003.96.3.728. 12831245.
24. Crosby CA, Wehbe MA, Mawr B. Hand strength: normative values. J Hand Surg Am 1994;19:665–70. 10.1016/0363-5023(94)90280-1. 7963331.
25. Lee SY, Jin H, Arai H, Lim JY. Handgrip strength: should repeated measurements be performed in both hands? Geriatr Gerontol Int 2021;21:426–32. 10.1111/ggi.14146. 33709458.
26. Korea Disease Control and Prevention Agency. Guidebook for Korea National Health and Nutrition Examination Survey database [Internet]. Osong, Korea: Korea Disease Control and Prevention Agency; 2021. [cited 2025 Apr 1] Available from: https://www.data.go.kr/data/15076556/fileData.do#.
27. Smith DA, Lukens SA. Stress effects of isometric contraction in occupational therapy. Occup Ther J Res 1983;3:222–42. 10.1177/153944928300300404.
28. Kent-Braun JA. Skeletal muscle fatigue in old age: whose advantage? Exerc Sport Sci Rev 2009;37:3–9. 10.1097/JES.0b013e318190ea2e. 19098518.
29. Durall CJ, Sawhney R. Strength. In : Huber FE, Wells CL, eds. Therapeutic exercise: treatment planning for progression St. Louis, MO: W.B. Saunders; 2006. p. 96–125.
30. Trossman PB, Li PW. The effect of the duration of intertrial rest periods on isometric grip strength performance in young adults. Occup Ther J Res 1989;9:362–78. 10.1177/153944928900900604.
31. Grgic J, Schoenfeld BJ, Skrepnik M, Davies TB, Mikulic P. Effects of rest interval duration in resistance training on measures of muscular strength: a systematic review. Sports Med 2018;48:137–51. 10.1007/s40279-017-0788-x. 28933024.
32. Rose Berlin Piodena-Aportadera M, Lau S, Chew J, Lim JP, Ismail NH, Ding YY, et al. Calf circumference measurement protocols for sarcopenia screening: differences in agreement, convergent validity and diagnostic performance. Ann Geriatr Med Res 2022;26:215–24. 10.4235/agmr.22.0057. 36031936.
33. Mehmet H, Robinson SR, Yang AW. Assessment of gait speed in older adults. J Geriatr Phys Ther 2020;43:42–52. 10.1519/jpt.0000000000000224. 30720555.
34. Maggio M, Ceda GP, Ticinesi A, De Vita F, Gelmini G, Costantino C, et al. Instrumental and non-instrumental evaluation of 4-meter walking speed in older individuals. PLoS One 2016;11e0153583. 10.1371/journal.pone.0153583. 27077744.
35. Kim M, Won CW. Combinations of gait speed testing protocols (automatic vs manual timer, dynamic vs static start) can significantly influence the prevalence of slowness: results from the Korean frailty and aging cohort study. Arch Gerontol Geriatr 2019;81:215–21. 10.1016/j.archger.2018.12.009. 30623866.
36. Oh SL, Kim DY, Bae JH, Jung H, Lim JY. Comparison of the use of a manual stopwatch and an automatic instrument for measuring 4-m gait speed at the usual walking pace with different starting protocols in older adults. Eur Geriatr Med 2019;10:747–52. 10.1007/s41999-019-00210-3. 34652694.
37. Enright PL, McBurnie MA, Bittner V, Tracy RP, McNamara R, Arnold A, et al. The 6-min walk test: a quick measure of functional status in elderly adults. Chest 2003;123:387–98. 10.1097/01823246-200314020-00019. 12576356.
38. Bennell K, Dobson F, Hinman R. Measures of physical performance assessments: Self-Paced Walk Test (SPWT), Stair Climb Test (SCT), Six-Minute Walk Test (6MWT), Chair Stand Test (CST), Timed Up & Go (TUG), Sock Test, Lift and Carry Test (LCT), and car task. Arthritis Care Res (Hoboken) 2011;63 Suppl 11:S350–70. 10.1002/acr.20538. 22588756.
39. Mehmet H, Yang AW, Robinson SR. What is the optimal chair stand test protocol for older adults?: a systematic review. Disabil Rehabil 2020;42:2828–35. 10.1080/09638288.2019.1575922. 30907166.
40. Kamide N, Takahashi K, Shiba Y. Reference values for the Timed Up and Go test in healthy Japanese elderly people: determination using the methodology of meta-analysis. Geriatr Gerontol Int 2011;11:445–51. 10.1111/j.1447-0594.2011.00704.x. 21554510.
41. Munoz-Bermejo L, Adsuar JC, Mendoza-Munoz M, Barrios-Fernandez S, Garcia-Gordillo MA, Perez-Gomez J, et al. Test-retest reliability of Five Times Sit to Stand Test (FTSST) in adults: a systematic review and meta-analysis. Biology (Basel) 2021;10:510. 10.3390/biology10060510. 34207604.
42. Furlanetto KC, Correia NS, Mesquita R, Morita AA, do Amaral DP, Mont’Alverne DG, et al. Reference values for 7 different protocols of simple functional tests: a multicenter study. Arch Phys Med Rehabil 2022;103:20–8. 10.1016/j.apmr.2021.08.009. 34516997.
43. Siggeirsdottir K, Jonsson BY, Jonsson H Jr, Iwarsson S. The timed ‘Up & Go’ is dependent on chair type. Clin Rehabil 2002;16:609–16. 10.1191/0269215502cr529oa. 12392335.
44. Ng SS, Cheung SY, Lai LS, Liu AS, Ieong SH, Fong SS. Five Times Sit-To-Stand test completion times among older women: Influence of seat height and arm position. J Rehabil Med 2015;47:262–6. 10.2340/16501977-1915. 25437142.
45. Kuo YL. The influence of chair seat height on the performance of community-dwelling older adults’ 30-second chair stand test. Aging Clin Exp Res 2013;25:305–9. 10.1007/s40520-013-0041-x. 23740582.
46. Korean Agency for Technology and Standards. tSKD [Internet]. Seoul, Korea: Size Korea; 2023 [cited 2023 Jun 6]. Available from: https://sizekorea.kr/human-info/meas-report?measDegree=8.

Article information Continued

Table 1.

Demographic information of the Delphi experts

Characteristic n (%)
Age (y)
 30–39 4 (15.4)
 40–49 12 (46.2)
 50–59 8 (30.8)
 ≥60 2 (7.7)
Area of expertise
 Rehabilitation medicine 14 (53.8)
 Orthopedic surgery 3 (11.5)
 Geriatric medicine 3 (11.5)
 Endocrine medicine 1 (3.8)
 Family medicine 3 (11.5)
 Exercise physiology 2 (7.7)
Work experience (y)
 <5 2 (7.7)
 5–9 4 (15.4)
 10–14 8 (30.8)
 15–19 3 (11.5)
 >20 9 (34.6)

Table 2.

Results of Rounds 1 and 2 Delphi survey: Likert-scale and multiple-choice items

No Statements Round 1
Round 2
Mean±SD Consensus CVR IQR Outcome Mean±SD Consensus CVR IQR Outcome
1 Is further standardization needed for handgrip strength measurement and physical performance assessment as presented in the AWGS and KWGS guidelines? 7.3±1.9 0.75 0.69 2.0 Consensus agreement - - - - -
2 Which measurement items do you think require further standardization or consensus? (Multiple selections allowed) - Handgrip strength (75%), gait speed (62.5%) Chair stand test (50%), calf circumference, 400-m walk test (37.5%), SPPB (29.22%) - - - - -
5 Handgrip strength measurement using the hydraulic hand dynamometer (Jamar) is appropriate to follow the existing protocol; otherwise, additional standardization is needed. 7.2±1.6 0.875 0.69 1.0 Consensus agreement - - - - -
6 Handgrip strength measurement using the mechanical hand dynamometer (Smedley) is appropriate to follow the existing protocol; otherwise, additional standardization is needed. 6.4±2.0 0.571 0.44 3.0 No consensus 7.9±1.0 0.75 0.867 2.0 Consensus agreement
13 The protocols of the AWGS and KWGS guidelines are appropriate for calf circumference measurement. 6.5±1.8 0.679 0.385 2.25 No consensus 7.5±1.6 0.750 0.733 2.0 Consensus agreement
25 The protocol of the KWGS guidelines is appropriate for the 400-m walk test. 6.7±1.5 0.964 0.62 0.25 Consensus agreement - - - - -
27 The KWGS guideline protocol is appropriate for measuring the chair stand test (30 s/5 reps). 7.3±1.3 0.875 0.77 1.0 Consensus agreement - - - - -
28 It is appropriate to follow the KWGS guidelines for the cutoff value of the chair stand test (30 s/5 reps). 7.0±1.3 0.857 0.62 1.0 Consensus agreement - - - - -
33 The protocol of the KWGS guidelines is appropriate for the TUG test. 7.0±1.5 0.857 0.69 1.0 Consensus agreement - - - - -

AWGS, Asian Working Group on Sarcopenia; KWGS, Korean Working Group on Sarcopenia; SPPB, Short Physical Performance Battery; TUG, timed up-and-go; CVR, content validity ratio; IQR, interquartile range.

Table 3.

Results of Rounds 1 and 2 Delphi survey on muscle strength and physical performance

No Statements Round 1
Round 2
Agreement (%) Outcome Reasons for no consensus Agreement (%) Outcome Reasons for no consensus
Handgrip strength
3 What is the appropriate dynamometer to be used for handgrip strength measurement? 80.8 Consensus agreement
- Both hydraulic and mechanical types are acceptable
4 Is it appropriate to apply the current cutoff values uniformly across all types of hand dynamometers? If not, what methods are needed? 69.2 No consensus A breakdown of cutoff values for different types of hand dynamometers is necessary (23.1%) 71.4 No consensus Set separate cutoff values for each hand dynamometer (28.6%)
- Calibration of measurements between different hand dynamometers is required
7 What is the correct posture for measuring handgrip strength using a hydraulic hand dynamometer (e.g., Jamar type)? 68.0 No consensus Both positions are possible (20%) 76.2 Consensus agreement
- Sitting position
8 What is the correct posture for measuring handgrip strength using a mechanical hand dynamometer (e.g., Smedley type)? 60.0 No consensus Sitting (20%), Both positions are possible (20%) 61.9 No consensus Both positions are possible (28.6%)
- Standing position
9 What is the appropriate number of repetitions for handgrip strength measurement (bilateral measurement - 1 trial)? 52.0 No consensus 3 times (44%) 71.4 No consensus 3 times (28.6%)
- 2 times
10 What is the appropriate value for handgrip strength measurement? 65.4 No consensus Maximum value of the dominant hand (11.5%) 90.5 Consensus agreement
- Maximum value of all measurements
11 What is the appropriate measurement time for using the hand dynamometer? 65.4 No consensus 2 seconds (15.4%) 76.2 Consensus agreement
- At least 3 seconds (less than 5 seconds)
12 What is the appropriate recovery interval between measurements when assessing handgrip strength in the same hand? 42.3 No consensus 60 seconds (19.2%), More than 60 seconds (23.1%) 57.1 No consensus At least 60 seconds (<2 minutes) (38.1%)
- At least 30 seconds (<60 seconds)
Calf circumference
14 What is the appropriate posture for measuring calf circumference? 60.0 No consensus Sitting position (28%) 90.5 Consensus agreement
- Standing position
15 Which leg should be measured for calf circumference? 68.0 No consensus Dominant foot side (16%), Any side of the two legs randomly chosen (12%) 95.2 Consensus agreement
- Both sides
16 What is the appropriate value for calf circumference measurement? 34.6 No consensus Maximum calf circumference measured on both sides in a sitting position (26.9%) 90.5 Consensus agreement
- Maximum calf circumference measured on both sides in a standing position
Gait speed
17 What is the appropriate distance for measuring gait speed (excluding acceleration and deceleration zones)? 61.5 No consensus 6 m (30.8%) 76.2 Consensus agreement
- 4 m
18 Is an acceleration/deceleration zone necessary for gait speed measurement? If so, what is the minimum required length? 65.4 No consensus 1.5 m (19.2%) 71.4 No consensus Either 1 m or 1.5 m can be applied (19.0%)
- 1 m
19 What is the appropriate speed for measuring gait speed? 69.2 No consensus Maximum possible speed (15.4%) 90.5 Consensus agreement
- Usual speed
20 How many trials are appropriate for measuring gait speed? 76.9 Consensus agreement
- 2 times
21 What is the appropriate value for gait speed measurement? 64.0 No consensus Maximum value of all measurements (36.0%) 66.7 No consensus Maximum value of all measurements (33.3%)
- Mean value of all measurements
22 Is the use of walking aids permitted during gait speed measurement? 88.5 Consensus agreement
- It is allowed
23 What is an appropriate measuring device for gait speed assessment? 80.7 Consensus agreement
- Both devices are possible
24 Are the results from different measurement methods (e.g., manual stopwatch vs. automated device such as accelerometer) interchangeable? 53.9 No consensus Yes (42.9%) 57.1 No consensus Yes (42.9%)
- No, calibration is required
400-m walk test
26 What is the appropriate speed for the 400-m walk test? 57.7 No consensus Usual speed (42.3%) 76.2 Consensus agreement
- Maximum possible speed
Chair stand test
29 What is the appropriate type of chair for the chair stand test? 65.3 No consensus A straight back chair with armrests (19.2%) 95.2 Consensus agreement
- A straight back chair without armrests
30 What is the appropriate end position for the 5-repetition chair stand test? 64.0 No consensus Standing position (36.0%) 66.7 No consensus Standing position (23.8%)
- Sitting position
31 What is the appropriate measurement value for the chair stand test? 69.2 No consensus Mean value of all measurements (30.8%) 81.0 Consensus agreement
- The fastest value from all measurements of the 5-repetition test or the maximum value from the 30-second test
32 What is the appropriate seat height for the chair used in the chair stand test? 34.6 No consensus Seat height corresponding to the mean sitting popliteal height of Koreans + 3 cm (mean shoe heel height) (30.8%)
- Seat height adjusted to the individual subject's sitting popliteal height + 3 cm (mean shoe heel height)
32 What is the appropriate seat height for the chair used in the chair stand test? 52.4 No consensus Seat height adjusted to the individual subject's sitting popliteal height + 3 cm (mean shoe heel height) (33.3%)
- Seat height corresponding to the mean sitting popliteal height of Koreans + 3 cm (mean shoe heel height)
TUG test
34 What is the appropriate type of chair for the TUG test? 64.0 No consensus A straight back chair with armrests (28.0%) 90.5 Consensus agreement
- A straight back chair without armrests
35 What is the appropriate speed for the TUG test? 52.0 No consensus Usual speed (48.0%) 71.4 No consensus Usual speed (28.6%)
- Maximum possible speed
36 What is the appropriate seat height for the chair used in the TUG test? 28.0 No consensus Seat height adjusted to the individual subject's sitting popliteal height + 3 cm (mean shoe heel height) (28.0%) 52.4 No consensus Seat height adjusted to the individual subject's sitting popliteal height + 3 cm (mean shoe heel height) (28.6%)
- Seat height corresponding to the mean sitting popliteal height of Koreans + 3 cm (mean shoe heel height)
37 What is the appropriate armrest height for the chair used in the TUG test or chair stand test? 68.0 No consensus Armrest height corresponding to the mean sitting elbow height of Koreans (20%) 85.7 Consensus agreement
- Armrests are not necessary
38 How many trials are appropriate for the TUG test? 64.0 No consensus 3 times (28%) 100.0 Consensus agreement
- 2 times
39 What is the appropriate measurement value for the TUG test? 58.3 No consensus Mean value of all measurements (41.7%) 85.7 Consensus agreement
- Maximum value of all measurements

TUG, timed up-and-go.

Table 4.

List of accepted statements from the Delphi study

No. Statements Agreement (%)
General aspects of standardizing physical performance assessments
1 Further standardization of handgrip strength measurement and physical performance assessment, as outlined in the AWGS and KWGS guidelines, is needed. 75.0
Handgrip strength
2 Both hydraulic and mechanical dynamometers are appropriate for handgrip strength measurement. 80.8
3 Handgrip strength measurement using the hydraulic hand dynamometer (e.g., Jamar) is appropriate to follow the existing protocol. 87.5
4 Handgrip strength measurement using the mechanical hand dynamometer (e.g., Smedley) is appropriate to follow the existing protocol. 75.0
5 Measurements using a hydraulic hand dynamometer (e.g., Jamar) should be taken in a sitting position. 76.2
6 Handgrip strength is measured using the maximum value from both hands. 90.5
7 Handgrip strength should be measured for at least 3 seconds but no more than 5 seconds. 76.2
Calf circumference
8 It is appropriate to apply the AWGS protocol and cutoff values for calf circumference measurement. 75.0
9 Calf circumference should be measured in a standing position. 90.5
10 Calf circumference should be measured on both sides. 95.2
11 The calf circumference measurement should be the maximum value of the calf circumference measured on both sides in a standing position. 90.5
Gait speed
12 A 4-meter distance, excluding acceleration and deceleration phases, is appropriate for measuring gait speed. 76.2
13 Gait speed should be measured at a usual pace. 90.5
14 Gait speed measurement should be performed twice. 76.9
15 The use of assistive devices is allowed during gait speed measurement. 88.5
16 Both manual and automated devices can be used for gait speed measurement. 80.7
400-m walk test
17 It is appropriate to follow the KWGS guideline for the 400-meter walk test. 96.4
18 The 400-meter walk test should be performed at the fastest possible pace. 76.2
Chair stand test
19 It is appropriate to apply the KWGS guideline for the chair stand test (30 s/5 reps). 87.5
20 It is appropriate to apply the KWGS guideline for the cut-off value of the chair stand test (30 s/5 reps). 85.7
21 A straight-back chair without armrests is used for the chair stand test. 95.2
22 The fastest value from both trials (for the 5-repetition test) or the maximum count (for the 30-second test) is used as the test result. 81.0
TUG test
23 It is appropriate to apply the KWGS guideline for the TUG Test. 85.7
24 A straight-back chair without armrests is used for the TUG test. 90.5
25 Armrests are not necessary for the chair used in the TUG test or Chair Stand Test. 85.7
26 The TUG test is performed twice. 100.0
27 The fastest time recorded among the trials is used as the TUG test result. 85.7

AWGS, Asian Working Group on Sarcopenia; KWGS, Korean Working Group on Sarcopenia; TUG, timed up-and-go.

Table 5.

List of rejected statements from the Delphi study and the most selected responses

No. Statements Agreement (%)
Handgrip strength
1 What is the appropriate cutoff value when measuring handgrip strength using the two commonly used dynamometers (hydraulic type and mechanical type)? 71.4
- Calibration of measurements between different hand dynamometers is required
2 What is the correct posture for measuring handgrip strength using a mechanical hand dynamometer (e.g., Smedley type)? 61.9
- Standing position
3 What is the appropriate number of repetitions for handgrip strength measurement (bilateral measurement - 1 trial)? 71.4
- 2 times
4 What is the appropriate recovery interval between measurements when assessing handgrip strength in the same hand? 57.1
- At least 30 seconds (<60 seconds)
Gait speed
5 Is an acceleration/deceleration zone necessary for gait speed measurement? If so, what is the minimum required length? 71.4
- 1 m
6 What is the appropriate value for gait speed measurement? 66.7
- Mean value of all measurements
7 Are the results from different measurement methods (e.g., manual stopwatch vs. automated device such as accelerometer) interchangeable? 57.1
- No, calibration is required
Chair stand test
8 What is the appropriate end position for the 5-repetition chair stand test? 66.7
- Sitting position
9 What is the appropriate seat height for the chair used in the chair stand test? 52.4
- Seat height corresponding to the mean sitting popliteal height of Koreans + 3 cm (mean shoe heel height)
TUG test
10 What is the appropriate speed for the TUG test? 71.4
- Maximum possible speed
11 What is the appropriate seat height for the chair used in the TUG test? 52.4
- Seat height corresponding to the mean sitting popliteal height of Koreans + 3 cm (mean shoe heel height)

TUG, timed up-and-go.

Table 6.

Summary of the proposed standardized measurement protocols for muscle strength and physical performance

Protocol items Characteristic
Handgrip strength
 Type of dynamometer Hydraulic (Jamar) and mechanical (Smedley) dynamometers
 Laterality of the tested hand Both hands
 Body position
  Hydraulic Seated with the elbow flexed at 90°, forearm supported, and arm in a neutral position
  Mechanical Standinga) with feet shoulder-width apart, elbow fully extended, and forearm in a neutral position/sitting, if standing is not feasible.
 Hand-adjustment
  Hydraulic Second handle position
  Mechanical 90° flexion at the second joint of index finger of each hand
 Repetitions Two (three) trials a)
 Duration of grip At least 3 but no more than 5 seconds
 Value estimation Maximum value
 Recovery intervals 30–60 seconds (At least 60 seconds)a)
 Verbal encouragement Yes
 Cutoff value M: <28 kg, F: <18 kg b)
Calf circumference
 Equipment A non-elastic tape
 Body position Standing with feet shoulder-width apart to evenly distribute body weight
 Laterality of the tested leg Both lower legs
 Measurement At the widest part of the calf, the tape was applied snugly, flat against the skin, and parallel to the floor, without compressing the muscle.
 Value estimation Maximum value
 Cutoff value M: <34 cm, F: <33 cm
Gait speed
 Equipment Manual and automated timing devices
 Walking distance 4 m excluding acceleration and deceleration phases
 Acceleration/deceleration phases 1 m (1 m or 1.5 m)a)
 Type of start Dynamic (moving) start
 Walking pace Usual pace
 Repetitions Two trials
 Value estimation Mean (maximum) valuea)
 Recovery intervals 60 seconds
 Use of assistive devices Permitted
 Cutoff value <1 m/sc)
400-m walk test
 Measurement Walking 20 laps of 20 m along a marked straight corridor following a 2-minute warm-up
 Walking pace Fastest possible pace without running
 Cutoff value Non-completion or ≥6 minutes for completion
 Rest Allowed without pausing the timer
Chair stand test
 Equipment A straight-back chair without armrests
 Seat height Based on mean Korean popliteal height or individual popliteal height plus shoe height (43–46 cm)a)
 Body position Seated on a chair, feet flat on the floor at least hip-width apart, arms crossed over the chest, and wearing shoes
 Start/end position Sitting/Sitting or standing
 Repetitions Two trials
 Value estimation 5-repetition test: fastest value
30-second test: maximum count
 Recovery intervals 60 seconds
 Cutoff value 5-repetition test: >10 seconds (standing end position), >11 seconds (sitting end position)
30-second test: M <17 repetitions, F <15 repetitions
TUG test
 Equipment A straight-back chair without armrests
 Seat height Based on mean Korean popliteal height or individual popliteal height plus shoe height (43–46 cm)a)
 Measurement Rise from a chair, walk 3 m to a floor mark, turn, walk back, and sit down
 Walking pace Maximum possible pace (usual or maximum pace)a)
 Repetitions Two trials
 Value estimation Fastest value
 Cutoff value ≥12 seconds

TUG, timed up-and-go.

a)

Indicates statements without consensus and the most selected responses; ( ) denotes protocols from other major guidelines.

b)

Interchangeability between different dynamometers is not permitted.

c)

Interchangeability between different devices is not determined.