Sample size and power estimation for studies with health related quality of life outcomes: a comparison of four methods using the SF-36
Open Access
- 1 January 2004
- journal article
- Published by Springer Nature in Health and Quality of Life Outcomes
- Vol. 2 (1), 26
- https://doi.org/10.1186/1477-7525-2-26
Abstract
We describe and compare four different methods for estimating sample size and power, when the primary outcome of the study is a Health Related Quality of Life (HRQoL) measure. These methods are: 1. assuming a Normal distribution and comparing two means; 2. using a non-parametric method; 3. Whitehead's method based on the proportional odds model; 4. the bootstrap. We illustrate the various methods, using data from the SF-36. For simplicity this paper deals with studies designed to compare the effectiveness (or superiority) of a new treatment compared to a standard treatment at a single point in time. The results show that if the HRQoL outcome has a limited number of discrete values (< 7) and/or the expected proportion of cases at the boundaries is high (scoring 0 or 100), then we would recommend using Whitehead's method (Method 3). Alternatively, if the HRQoL outcome has a large number of distinct values and the proportion at the boundaries is low, then we would recommend using Method 1. If a pilot or historical dataset is readily available (to estimate the shape of the distribution) then bootstrap simulation (Method 4) based on this data will provide a more accurate and reliable sample size estimate than conventional methods (Methods 1, 2, or 3). In the absence of a reliable pilot set, bootstrapping is not appropriate and conventional methods of sample size estimation or simulation will need to be used. Fortunately, with the increasing use of HRQoL outcomes in research, historical datasets are becoming more readily available. Strictly speaking, our results and conclusions only apply to the SF-36 outcome measure. Further empirical work is required to see whether these results hold true for other HRQoL outcomes. However, the SF-36 has many features in common with other HRQoL outcomes: multi-dimensional, ordinal or discrete response categories with upper and lower bounds, and skewed distributions, so therefore, we believe these results and conclusions using the SF-36 will be appropriate for other HRQoL measures.Keywords
This publication has 16 references indexed in Scilit:
- Robustness and power of analysis of covariance applied to ordinal scaled data as arising in randomized controlled trialsStatistics in Medicine, 2003
- DESIGN AND ANALYSIS OF TRIALS WITH QUALITY OF LIFE AS AN OUTCOME: A PRACTICAL GUIDEJournal of Biopharmaceutical Statistics, 2001
- Costs and effectiveness of community postnatal support workers: randomised controlled trialBMJ, 2000
- Sample sizes for randomized trials measuring quality of life in cancer patientsQuality of Life Research, 1997
- ROBUSTNESS AND POWER OF ANALYSIS OF COVARIANCE APPLIED TO DATA DISTORTED FROM NORMALITY BY FLOOR EFFECTS: HOMOGENEOUS REGRESSION SLOPESStatistics in Medicine, 1996
- Sample size calculations for ordered categorical dataStatistics in Medicine, 1993
- Validating the SF-36 health survey questionnaire: new outcome measure for primary care.BMJ, 1992
- The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection.1992
- Estimating the Power of the Two-Sample Wilcoxon Test for Location ShiftBiometrics, 1988
- Robustness of the two independent samples t‐test when applied to ordinal scaled dataStatistics in Medicine, 1987