Sample size and power estimation for studies with health related quality of life outcomes: a comparison of four methods using the SF-36

Open Access

1 January 2004

journal article
Published by Springer Nature in Health and Quality of Life Outcomes

Vol. 2 (1), 26
https://doi.org/10.1186/1477-7525-2-26

Abstract

We describe and compare four different methods for estimating sample size and power, when the primary outcome of the study is a Health Related Quality of Life (HRQoL) measure. These methods are: 1. assuming a Normal distribution and comparing two means; 2. using a non-parametric method; 3. Whitehead's method based on the proportional odds model; 4. the bootstrap. We illustrate the various methods, using data from the SF-36. For simplicity this paper deals with studies designed to compare the effectiveness (or superiority) of a new treatment compared to a standard treatment at a single point in time. The results show that if the HRQoL outcome has a limited number of discrete values (< 7) and/or the expected proportion of cases at the boundaries is high (scoring 0 or 100), then we would recommend using Whitehead's method (Method 3). Alternatively, if the HRQoL outcome has a large number of distinct values and the proportion at the boundaries is low, then we would recommend using Method 1. If a pilot or historical dataset is readily available (to estimate the shape of the distribution) then bootstrap simulation (Method 4) based on this data will provide a more accurate and reliable sample size estimate than conventional methods (Methods 1, 2, or 3). In the absence of a reliable pilot set, bootstrapping is not appropriate and conventional methods of sample size estimation or simulation will need to be used. Fortunately, with the increasing use of HRQoL outcomes in research, historical datasets are becoming more readily available. Strictly speaking, our results and conclusions only apply to the SF-36 outcome measure. Further empirical work is required to see whether these results hold true for other HRQoL outcomes. However, the SF-36 has many features in common with other HRQoL outcomes: multi-dimensional, ordinal or discrete response categories with upper and lower bounds, and skewed distributions, so therefore, we believe these results and conclusions using the SF-36 will be appropriate for other HRQoL measures.

Keywords

This publication has 16 references indexed in Scilit:

Robustness and power of analysis of covariance applied to ordinal scaled data as arising in randomized controlled trials
Statistics in Medicine, 2003
DESIGN AND ANALYSIS OF TRIALS WITH QUALITY OF LIFE AS AN OUTCOME: A PRACTICAL GUIDE
Journal of Biopharmaceutical Statistics, 2001
Costs and effectiveness of community postnatal support workers: randomised controlled trial
BMJ, 2000
Sample sizes for randomized trials measuring quality of life in cancer patients
Quality of Life Research, 1997
ROBUSTNESS AND POWER OF ANALYSIS OF COVARIANCE APPLIED TO DATA DISTORTED FROM NORMALITY BY FLOOR EFFECTS: HOMOGENEOUS REGRESSION SLOPES
Statistics in Medicine, 1996
Sample size calculations for ordered categorical data
Statistics in Medicine, 1993
Validating the SF-36 health survey questionnaire: new outcome measure for primary care.
BMJ, 1992
The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection.
1992
Estimating the Power of the Two-Sample Wilcoxon Test for Location Shift
Biometrics, 1988
Robustness of the two independent samples t‐test when applied to ordinal scaled data
Statistics in Medicine, 1987

Cited by 122 articles