Augmented outcome-weighted learning for estimating optimal dynamic treatment regimens
- PMID: 29873099
- PMCID: PMC6191367
- DOI: 10.1002/sim.7844
Augmented outcome-weighted learning for estimating optimal dynamic treatment regimens
Abstract
Dynamic treatment regimens (DTRs) are sequential treatment decisions tailored by patient's evolving features and intermediate outcomes at each treatment stage. Patient heterogeneity and the complexity and chronicity of many diseases call for learning optimal DTRs that can best tailor treatment according to each individual's time-varying characteristics (eg, intermediate response over time). In this paper, we propose a robust and efficient approach referred to as Augmented Outcome-weighted Learning (AOL) to identify optimal DTRs from sequential multiple assignment randomized trials. We improve previously proposed outcome-weighted learning to allow for negative weights. Furthermore, to reduce the variability of weights for numeric stability and improve estimation accuracy, in AOL, we propose a robust augmentation to the weights by making use of predicted pseudooutcomes from regression models for Q-functions. We show that AOL still yields Fisher-consistent DTRs even if the regression models are misspecified and that an appropriate choice of the augmentation guarantees smaller stochastic errors in value function estimation for AOL than the previous outcome-weighted learning. Finally, we establish the convergence rates for AOL. The comparative advantage of AOL over existing methods is demonstrated through extensive simulation studies and an application to a sequential multiple assignment randomized trial for major depressive disorder.
Keywords: Q-learning; SMARTs; adaptive intervention; individualized treatment rule; machine learning; outcome-weighted learning; personalized medicine.
Copyright © 2018 John Wiley & Sons, Ltd.
Figures




Similar articles
-
Use of personalized Dynamic Treatment Regimes (DTRs) and Sequential Multiple Assignment Randomized Trials (SMARTs) in mental health studies.Shanghai Arch Psychiatry. 2014 Dec;26(6):376-83. doi: 10.11919/j.issn.1002-0829.214172. Shanghai Arch Psychiatry. 2014. PMID: 25642116 Free PMC article.
-
Synthesizing independent stagewise trials for optimal dynamic treatment regimes.Stat Med. 2020 Dec 10;39(28):4107-4119. doi: 10.1002/sim.8712. Epub 2020 Aug 17. Stat Med. 2020. PMID: 32804414 Free PMC article.
-
Adaptive contrast weighted learning for multi-stage multi-treatment decision-making.Biometrics. 2017 Mar;73(1):145-155. doi: 10.1111/biom.12539. Epub 2016 May 23. Biometrics. 2017. PMID: 27213913
-
Bayesian inference for optimal dynamic treatment regimes in practice.Int J Biostat. 2023 May 17;19(2):309-331. doi: 10.1515/ijb-2022-0073. eCollection 2023 Nov 1. Int J Biostat. 2023. PMID: 37192544 Review.
-
Artificial Intelligence in Precision Cardiovascular Medicine.J Am Coll Cardiol. 2017 May 30;69(21):2657-2664. doi: 10.1016/j.jacc.2017.03.571. J Am Coll Cardiol. 2017. PMID: 28545640 Review.
Cited by
-
Optimizing Contingency Management with Reinforcement Learning.medRxiv [Preprint]. 2024 Mar 29:2024.03.28.24305031. doi: 10.1101/2024.03.28.24305031. medRxiv. 2024. PMID: 38585900 Free PMC article. Preprint.
-
Discussion of Kallus (2020) and Mo et al (2020).J Am Stat Assoc. 2021;116(534):690-693. doi: 10.1080/01621459.2020.1833887. Epub 2021 Apr 1. J Am Stat Assoc. 2021. PMID: 34483404 Free PMC article.
-
Estimating individualized treatment rules for multicategory type 2 diabetes treatments using electronic health records.Stat Interface. 2023;16(4):505-515. doi: 10.4310/22-sii739. Epub 2023 Apr 14. Stat Interface. 2023. PMID: 38344146 Free PMC article.
-
Matched Learning for Optimizing Individualized Treatment Strategies Using Electronic Health Records.J Am Stat Assoc. 2020;115(529):380-392. doi: 10.1080/01621459.2018.1549050. Epub 2019 Apr 23. J Am Stat Assoc. 2020. PMID: 33041401 Free PMC article.
-
Composite interaction tree for simultaneous learning of optimal individualized treatment rules and subgroups.Stat Med. 2019 Jun 30;38(14):2632-2651. doi: 10.1002/sim.8105. Epub 2019 Mar 19. Stat Med. 2019. PMID: 30891797 Free PMC article.
References
-
- Lavori PW, Dawson R. A design for testing clinical strategies: biased adaptive within-subject randomization. J Royal Stat Soc Ser A Stat Soc. 2000;163(1):29–38.
-
- Thall PF, Sung HG, Estey EH. Selecting therapeutic strategies based on efficacy and death in multicourse clinical trials. J Am Stat Assoc. 2002;97(457):29–39.
-
- Lunceford JK, Davidian M, Tsiatis AA. Estimation of survival distributions of treatment policies in two-stage randomization designs in clinical trials. Biometrics. 2002;58(1):48–57. - PubMed
-
- Rush AJ, Fava M, Wisniewski SR, et al. Sequenced treatment alternatives to relieve depression (STAR* D): rationale and design. Contemp Clin Trials. 2004;25(1):119–142. - PubMed
-
- Murphy SA. An experimental design for the development of adaptive treatment strategies. Stat Med. 2005;24(10):1455–1481. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous