Problems due to small samples and sparse data in conditional logistic regression analysis
- PMID: 10707923
- DOI: 10.1093/oxfordjournals.aje.a010240
Problems due to small samples and sparse data in conditional logistic regression analysis
Abstract
Conditional logistic regression was developed to avoid "sparse-data" biases that can arise in ordinary logistic regression analysis. Nonetheless, it is a large-sample method that can exhibit considerable bias when certain types of matched sets are infrequent or when the model contains too many parameters. Sparse-data bias can cause misleading inferences about confounding, effect modification, dose response, and induction periods, and can interact with other biases. In this paper, the authors describe these problems in the context of matched case-control analysis and provide examples from a study of electrical wiring and childhood leukemia and a study of diet and glioma. The same problems can arise in any likelihood-based analysis, including ordinary logistic regression. The problems can be detected by careful inspection of data and by examining the sensitivity of estimates to category boundaries, variables in the model, and transformations of those variables. One can also apply various bias corrections or turn to methods less sensitive to sparse data than conditional likelihood, such as Bayesian and empirical-Bayes (hierarchical regression) methods.
Comment in
-
Re: "Problems due to small samples and sparse data in conditional logistic regression analysis".Am J Epidemiol. 2000 Oct 1;152(7):688-9. doi: 10.1093/aje/152.7.688. Am J Epidemiol. 2000. PMID: 11032165 No abstract available.
Similar articles
-
Bias in Odds Ratios From Logistic Regression Methods With Sparse Data Sets.J Epidemiol. 2023 Jun 5;33(6):265-275. doi: 10.2188/jea.JE20210089. Epub 2022 Apr 1. J Epidemiol. 2023. PMID: 34565762 Free PMC article. Review.
-
Comparison of the missing-indicator method and conditional logistic regression in 1:m matched case-control studies with missing exposure values.Am J Epidemiol. 2004 Mar 15;159(6):603-10. doi: 10.1093/aje/kwh075. Am J Epidemiol. 2004. PMID: 15003965
-
Data augmentation priors for Bayesian and semi-Bayes analyses of conditional-logistic and proportional-hazards regression.Stat Med. 2001 Aug 30;20(16):2421-8. doi: 10.1002/sim.902. Stat Med. 2001. PMID: 11512132
-
Bias-reduced and separation-proof conditional logistic regression with small or sparse data sets.Stat Med. 2010 Mar 30;29(7-8):770-7. doi: 10.1002/sim.3794. Stat Med. 2010. PMID: 20213709
-
Methods for epidemiologic analyses of multiple exposures: a review and comparative study of maximum-likelihood, preliminary-testing, and empirical-Bayes regression.Stat Med. 1993 Apr 30;12(8):717-36. doi: 10.1002/sim.4780120802. Stat Med. 1993. PMID: 8516590 Review.
Cited by
-
Environmental determinants of bicycling injuries in Alberta, Canada.J Environ Public Health. 2012;2012:487681. doi: 10.1155/2012/487681. Epub 2012 Nov 28. J Environ Public Health. 2012. PMID: 23251192 Free PMC article.
-
Parental risk perception and influenza vaccination of children in daycare centres.Epidemiol Infect. 2014 Jan;142(1):134-41. doi: 10.1017/S0950268813000782. Epub 2013 Apr 18. Epidemiol Infect. 2014. PMID: 23594431 Free PMC article.
-
Low bone mineral density is associated with insulin resistance in bone marrow transplant subjects.Bone Marrow Transplant. 2009 Jun;43(12):953-7. doi: 10.1038/bmt.2009.70. Epub 2009 Apr 13. Bone Marrow Transplant. 2009. PMID: 19363530
-
Child outcomes after amnioinfusion compared with no intervention in women with second-trimester rupture of membranes: a long-term follow-up study of the PROMEXIL-III trial.BJOG. 2021 Jan;128(2):292-301. doi: 10.1111/1471-0528.16115. Epub 2020 Mar 4. BJOG. 2021. PMID: 31984652 Free PMC article. Clinical Trial.
-
Bias in Odds Ratios From Logistic Regression Methods With Sparse Data Sets.J Epidemiol. 2023 Jun 5;33(6):265-275. doi: 10.2188/jea.JE20210089. Epub 2022 Apr 1. J Epidemiol. 2023. PMID: 34565762 Free PMC article. Review.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources