Subgroup analyses in randomized trials: risks of subgroup-specific analyses; power and sample size for the interaction test

doi:10.1016/j.jclinepi.2003.08.009

. 2004 Mar;57(3):229-36.

doi: 10.1016/j.jclinepi.2003.08.009.

Subgroup analyses in randomized trials: risks of subgroup-specific analyses; power and sample size for the interaction test

Sara T Brookes¹, Elise Whitely, Matthias Egger, George Davey Smith, Paul A Mulheran, Tim J Peters

Affiliations

PMID: 15066682
DOI: 10.1016/j.jclinepi.2003.08.009

Subgroup analyses in randomized trials: risks of subgroup-specific analyses; power and sample size for the interaction test

Sara T Brookes et al. J Clin Epidemiol. 2004 Mar.

. 2004 Mar;57(3):229-36.

doi: 10.1016/j.jclinepi.2003.08.009.

Authors

Sara T Brookes¹, Elise Whitely, Matthias Egger, George Davey Smith, Paul A Mulheran, Tim J Peters

Affiliation

¹ Department of Social Medicine, University of Bristol, Whiteladies Road, Bristol, BS8 2PR, UK. sara.t.brookes@bristol.ac.uk

PMID: 15066682
DOI: 10.1016/j.jclinepi.2003.08.009

Abstract

Objective: Despite guidelines recommending the use of formal tests of interaction in subgroup analyses in clinical trials, inappropriate subgroup-specific analyses continue. Moreover, trials designed to detect overall treatment effects have limited power to detect treatment-subgroup interactions. This article quantifies the error rates associated with subgroup analyses.

Study design and setting: Simulations quantified the risks of misinterpreting subgroup analyses as evidence of differential subgroup effects and the limited power of the interaction test in trials designed to detect overall treatment effects.

Results: Although formal interaction tests performed as expected with respect to false positives, subgroup-specific tests were considerably less reliable: A significant effect in one subgroup only was observed in 7% to 64% of simulations depending on trial characteristics. Regarding power of the interaction test, a trial with 80% power for the overall effect had only 29% power to detect an interaction effect of the same magnitude. For interactions of this size to be detected with the same power as the overall effect, sample sizes should be inflated fourfold, increasing dramatically for interactions smaller than 20% of the overall effect.

Conclusion: Although it is generally recognized that subgroup analyses can produce spurious results, the extent of the problem may be underestimated.

PubMed Disclaimer

Cited by

Perspective: Planning and Conducting Statistical Analyses for Human Nutrition Randomized Controlled Trials: Ensuring Data Quality and Integrity.
Petersen KS, Kris-Etherton PM, McCabe GP, Raman G, Miller JW, Maki KC. Petersen KS, et al. Adv Nutr. 2021 Oct 1;12(5):1610-1624. doi: 10.1093/advances/nmab045. Adv Nutr. 2021. PMID: 33957665 Free PMC article.
Investigating change across time in prevalence or association: the challenges of cross-study comparative research and possible solutions.
Bann D, Wright L, Goisis A, Hardy R, Johnson W, Maddock J, McElroy E, Moulton V, Patalay P, Scholes S, Silverwood RJ, Ploubidis GB, O'Neill D. Bann D, et al. Discov Soc Sci Health. 2022;2(1):18. doi: 10.1007/s44155-022-00021-1. Epub 2022 Oct 27. Discov Soc Sci Health. 2022. PMID: 36317190 Free PMC article.
A systematic review and individual patient data meta-analysis of published randomized clinical trials comparing early versus interval appendectomy for children with perforated appendicitis.
Duggan EM, Marshall AP, Weaver KL, St Peter SD, Tice J, Wang L, Choi L, Blakely ML. Duggan EM, et al. Pediatr Surg Int. 2016 Jul;32(7):649-55. doi: 10.1007/s00383-016-3897-y. Epub 2016 May 9. Pediatr Surg Int. 2016. PMID: 27161128 Review.
Sample size requirements for detecting treatment effect heterogeneity in cluster randomized trials.
Yang S, Li F, Starks MA, Hernandez AF, Mentz RJ, Choudhury KR. Yang S, et al. Stat Med. 2020 Dec 10;39(28):4218-4237. doi: 10.1002/sim.8721. Epub 2020 Aug 21. Stat Med. 2020. PMID: 32823372 Free PMC article.
Estimating optimal decision trees for treatment assignment: The case of K > 2 treatment alternatives.
Sies A, Doove L, Meers K, Dusseldorp E, Van Mechelen I. Sies A, et al. Behav Res Methods. 2024 Dec;56(8):8259-8268. doi: 10.3758/s13428-024-02470-9. Epub 2024 Aug 20. Behav Res Methods. 2024. PMID: 39164562

See all "Cited by" articles

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Elsevier Science
Other Literature Sources
- H1 Connect

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Subgroup analyses in randomized trials: risks of subgroup-specific analyses; power and sample size for the interaction test

Affiliation

Subgroup analyses in randomized trials: risks of subgroup-specific analyses; power and sample size for the interaction test

Authors

Affiliation

Abstract

Similar articles

Cited by

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Similar articles

Cited by

Publication types

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Other Literature Sources