Abstract
Measures of health-related quality of life and other “soft” endpoints have appeal to clinical trialists because of their direct relevance to the patient. Unfortunately, while one can define “statistical significance” precisely, what constitutes “clinical significance” remains elusive. A very small difference in a scale, while statistically significant, may have little relevance to the individual patient. Cardiologists have developed a number of soft endpoints, for example, the Killip Scale and the New York Heart Association Score, that define easily recognizable and distinguishable scenarios. Clinical trials in cardiology have used these scores as entry criteria or, occasionally, as primary endpoints and the field has been able to interpret results clinically. Many other fields rely on scales that lack verbal tags to clinical scenarios. The clinical community often cannot interpret data from such scales, even though the scales themselves are quite reliable, precise, and sensitive to change. Without meaningful approaches to defining clinical significance, such scales are unlikely to become acceptable in clinical trials except, perhaps, as exploratory endpoints. This paper discusses several approaches to defining clinical significance, such as attaching changes in scale to changes in risk as defined epidemiologically, matching levels of scale to objective levels of function, using expert panels and groups of patients to calibrate the scales, and adopting the increasingly popular metric “reliable change.”
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Huntington Study Group (1996). Unified Huntington’s Disease Rating Scale: reliability and consistency. Movement Disorders 11, 136–142.
Heald, S., Riddle, D. and Lamb, R. (1997). The Shoulder Pain and Disability Index: the construct validity and responsiveness of a region-specific disability measure. Physical Therapy 77, 1079–1089.
Gould, A. (1980). A new approach to the analysis of clinical drug trials with withdrawals. Biometrics 36, 721–727.
Little, R.J.A. and Rubin, D.B. (1987). Statistical Analysis with Missing Data. New York: John Wiley and Sons, Inc.
Bernhard, J. and Gelber, R., eds. (1998). Workshop on missing data in quality of life research in clinical trials: practical and methodological issues. Statistics in Medicine 17, 511–796.
Gelber, R., Gelman, R. and Goldhirsch, A. (1989). A quality-of-life-oriented endpoint for comparing therapies. Biometrics 45, 781–795.
Cole, B., Gelber, R. and Anderson, K. (1994). Parametric approaches to quality-adjusted survival analysis. Biometrics 50, 621–631.
Murray, S. and Cole, B. (2000). Variance and sample size calculations in quality-of-life-adjusted survival analysis (Q-TWiST). Biometrics 56, 173–182.
Lachenbruch P (2001). Comparisons of two-part models with competitors. Statistics in Medicine 20, 1215–1234.
Shih, W. and Quan, H. (1997). Testing for treatment differences with dropouts present in clinical trials-a composite approach. Statistics in Medicine 11, 1225–1239.
Stephens, S. (1957). On the psychophysical law. Psychological Reviews 64, 153–181.
Cox, D.R., Fitzpatrick, R., Fletcher, AE, Gore, SM, Spiegelhalter, D.J. and Jones, D.J. (1992). Quality of life assessment: can we keep it simple? Journal of the Royal Statistical Society, Series A 155, 353–393.
Sonis, S.T., Eilers, J.P., Epstein, J.B., LeVoque, F.G., Liggett, W.H., Jr., Mulagha, M.T., Peterson, D.E., Rose, A.H., Schubert, M.M., Spijkervet, F.K. and Wittes, J. (1999). Validation of a new scoring system for the assessment of clinical trial research of oral mucositis induced by radiation or chemotherapy. Cancer 85, 2103–2113.
Wittes, J., Lakatos, E. and Probstfield, J. (1989). Surrogate endpoints in clinical trials: cardiovascular diseases. Statistics in Medicine 8, 415–425.
The ALS CNTF Treatment Study (ACTS) Phase MI Study Group (1996). The Amyotrophic Lateral Sclerosis Functional Rating Scale. Assessment of activities of daily living in patients with amyotrophic lateral sclerosis. Archives of Neurology 53, 141–147.
Temkin, N.R., Heaton, R.K., Grant, I. and Dikonen, S.S. (1999). Detecting significant change in neuropsychological test performance: a comparison of four models. Journal of the International Neuropsychological Society 5, 357–369.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Wittes, J. (2002). The Use of Soft Endpoints in Clinical Trials: The Search for Clinical Significance. In: Mesbah, M., Cole, B.F., Lee, ML.T. (eds) Statistical Methods for Quality of Life Studies. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-3625-0_11
Download citation
DOI: https://doi.org/10.1007/978-1-4757-3625-0_11
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-5207-3
Online ISBN: 978-1-4757-3625-0
eBook Packages: Springer Book Archive