Development and Psychometric Evaluation of the Flexilevel Scale of Shoulder Function
- 1 July 2003
- journal article
- clinical trial
- Published by Wolters Kluwer Health in Medical Care
- Vol. 41 (7), 823-835
- https://doi.org/10.1097/00005650-200307000-00006
Abstract
Existing measures of self-reported shoulder function fail to measure effectively the full range of shoulder functioning. The classic approach for improving the reliability of a scale is adding items, but a scale with a substantial number of items imposes a large response burden on participants. A more efficient approach is to use modern psychometric methods to construct an adaptive scale in which patients respond only to items that are targeted at their level of shoulder function. We developed a Flexilevel Scale of Shoulder Function (FLEX-SF). This scale includes three testlets that target low, medium, and high shoulder function. Scores on the testlets were equated to a common mathematical metric. We developed an initial pool of 68 items. This pool was administered to 400 patients, and responses were calibrated using a rating scale model. Subsets of items were identified for an easy, medium difficulty, and hard testlet. Properties of the scale were evaluated in a 3-month longitudinal study of 200 shoulder patients. The FLEX-SF exhibited high reliability at both the scale level (intraclass correlation coefficient [3,1] = 0.90) and specific trait levels. The validity of the FLEX-SF was supported by its internal and external responsiveness (Guyatt responsiveness index = 1.12) and the pattern of its associations with other health status measures. The FLEX-SF can be used as a primary endpoint in clinical trials even when there are relatively few people in each treatment group. The scale also has excellent properties for use in clinical settings tracking individual changes over time.Keywords
This publication has 17 references indexed in Scilit:
- The measurement level and trait-specific reliability of 4 scales of shoulder functioning: An empiric investigationArchives of Physical Medicine and Rehabilitation, 2001
- Response to Hays et al and McHorney and Cohen: Practical Implications of Item Response Theory and Computerized Adaptive TestingMedical Care, 2000
- Generic Health Measurement: Past Accomplishments and a Measurement Paradigm for the 21st CenturyAnnals of Internal Medicine, 1997
- Health status assessment for the twenty-first century: item response theory, item banking and computer adaptive testingQuality of Life Research, 1997
- A 12-Item Short-Form Health SurveyMedical Care, 1996
- A standardized method for the assessment of shoulder functionJournal of Shoulder and Elbow Surgery, 1994
- Adaptive Designs for Likert-Type Data: An Approach for Implementing Marketing SurveysJournal of Marketing Research, 1990
- A rating formulation for ordered response categoriesPsychometrika, 1978
- A Theoretical Study of the Measurement Effectiveness of Flexilevel TestsEducational and Psychological Measurement, 1971
- THE SELF‐SCORING FLEXILEVEL TEST1Journal of Educational Measurement, 1971