Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Mar;57(3):229-36.
doi: 10.1016/j.jclinepi.2003.08.009.

Subgroup analyses in randomized trials: risks of subgroup-specific analyses; power and sample size for the interaction test

Affiliations

Subgroup analyses in randomized trials: risks of subgroup-specific analyses; power and sample size for the interaction test

Sara T Brookes et al. J Clin Epidemiol. 2004 Mar.

Abstract

Objective: Despite guidelines recommending the use of formal tests of interaction in subgroup analyses in clinical trials, inappropriate subgroup-specific analyses continue. Moreover, trials designed to detect overall treatment effects have limited power to detect treatment-subgroup interactions. This article quantifies the error rates associated with subgroup analyses.

Study design and setting: Simulations quantified the risks of misinterpreting subgroup analyses as evidence of differential subgroup effects and the limited power of the interaction test in trials designed to detect overall treatment effects.

Results: Although formal interaction tests performed as expected with respect to false positives, subgroup-specific tests were considerably less reliable: A significant effect in one subgroup only was observed in 7% to 64% of simulations depending on trial characteristics. Regarding power of the interaction test, a trial with 80% power for the overall effect had only 29% power to detect an interaction effect of the same magnitude. For interactions of this size to be detected with the same power as the overall effect, sample sizes should be inflated fourfold, increasing dramatically for interactions smaller than 20% of the overall effect.

Conclusion: Although it is generally recognized that subgroup analyses can produce spurious results, the extent of the problem may be underestimated.

PubMed Disclaimer

Similar articles

Cited by

Publication types

LinkOut - more resources