Skip to main content
International Journal of Epidemiology logoLink to International Journal of Epidemiology
. 2016 Nov 18;45(6):1922–1926. doi: 10.1093/ije/dyw280

Author's reply: The role of potential outcomes thinking in assessing mediation and interaction

Tyler J VanderWeele
PMCID: PMC5841620  PMID: 27864414

I would like to thank the various commentators for their time and thoughts concerning my book ‘Explanation in Causal Inference: Methods for Mediation and Interaction’1 and its accompanying synopsis.2 Kaufman3 traces some of the history of the methodological developments on these topics; Oakes and Naimi4 discuss the relevance of the book’s content to social epidemiology; Pearce and Vandenbroucke5 raise various concerns principally about the scope of the book. Taken together, the commentaries provide a helpful overview of what the book does and does not accomplish, and of the extent and limits of the book’s scope.

There appear to be important tensions present when comparing the commentaries themselves. Oakes and Naimi speak of how important the phenomena of mediation, interaction, and spillover are within social epidemiology and how helpful the methods will be in assessing these; Pearce and Vandenbroucke, in contrast, repeatedly state that they believe the methods will only ever be applicable to a very narrow set of questions. Kaufman and Oakes and Naimi praise the book for its clarity of exposition of difficult ideas that will now be widely accessible; Pearce and Vandenbroucke state that it is too technical and esoteric (‘should not ‘try at home’’). Kaufman fears that the methods, having now been made accessible with easy-to-use software will result in an explosion of applications, some of which will be carried out without a thorough understanding of the assumptions and limitations. Pearce and Vandenbroucke believe that the methods ‘are unlikely to be applied routinely by researchers who use epidemiologic tools in their studies of public health or clinical problems.’ Some of the differences in perspective may reflect differences across the sub-disciplines of the commentators. Some of the differences may reflect something of a generational divide. Some are likely due to natural variation that is to be expected and experienced across different readers. As stated in the book’s preface, some of the material covered in the book is indeed challenging. I have attempted to make difficult concepts and methods as accessible as possible, but I acknowledged in the preface itself, my own limitations in my capacity to do so. Whether the methods are used, or prove useful within public health, and whether the book is found accessible or not, I suppose time will tell.

An important point touched upon by Kaufman and by Oakes and Naimi, but perhaps not entirely appreciated by Pearce and Vandenbroucke concerns the relation between the new methods for mediation analysis, and the traditional methods (sometimes referred to as the ‘difference method’, or the ‘Baron and Kenny method’,6 effectively fitting two separate regression models, alternately with and without the mediator). As noted in the book, and in my synopsis, the newer methods effectively subsume the more traditional ones. As noted by Kaufman, in the absence of interaction between the exposure and the mediator, the methods coincide. The assumptions (for example, about adequate control for mediator-outcome confounding, discussed by all commentators) also coincide. However there was often not an awareness of what those assumptions were in many applications of the traditional approaches. What the newer methods do then is relax the assumptions, about the absence of interaction, about functional forms, and, through sensitivity analysis,7–15 about confounding and measurement error. Whenever the assumptions of the newer methods are not applicable, the assumptions of the traditional methods will not be either. This is an important point. It is not that the newer methods require stronger assumptions. It is that they point out the assumptions required by both the newer and the traditional methods (that have often been simply ignored when the traditional methods have been used). In the book and in my teaching I have emphasized the conditions under which the newer approaches are not needed and, in such cases, I have encouraged the use of the traditional approaches as they are easier to understand, interpret, and present. However, the advantages of the traditional approaches are simply ease of interpretation and use; they are not are making weaker assumptions nor are they sometimes applicable when the newer approaches are not. Again, whenever the newer methods are not applicable, the traditional methods will not be either.

Thus when Pearce and Vandenbroucke write that ‘In practice, very few studies will (even hypothetically) have data available of sufficient accuracy, and including sufficient variables, to enable us to use these methods with any confidence,’ this statement, contrary to what they seem to suggest elsewhere in their commentary, applies also to the traditional methods. In the context of mediation, it is effectively an abandonment of any attempt to statistically assess the importance of a mechanism. Indeed, in part because of the strengths of the assumptions required for mediation, this position of abandoning attempts at estimating direct and indirect effects has essentially been adopted by Donald Rubin.1,16,17 I am not entirely unsympathetic to the view: the assumptions are strong. My own perspective, however, differs insofar as I have worked on at least a handful of examples in which I think we can draw at least reliable qualitative conclusions about the importance, or lack thereof, of a particular pathway.1,18–20 What enables robust inference in these settings is sensitivity analysis to evaluate how conclusions might or might not be altered when assumptions are violated.

Pearce and Vandenbroucke repeatedly raise concerns about the scope of the book. They complain that it does not provide an overarching theory of causality, or of all types of scientific evidence, and of how to integrate this within epidemiology. However, it seems that they have misunderstood the intended scope of the text. The intended scope is relatively clearly laid out in the book’s preface: to address the topics of mediation and interaction from a counterfactual-based perspective. Much of their discussion is thus I think out of place. The book is not trying to present a grand theory of causality; the book is not intended as a comprehensive epidemiologic textbook; and the book is not intended as a broad philosophy of science. All of those undertakings would be important; material of that sort should be included in education; but none of this was the intent of the book. Their aphorism from Aquinas ‘Timeo hominem unius libri - I fear a man of one book’ is thus also out of place as a critique of the book.

They claim the book’s scope ought to have been broader on the grounds of its title: ‘Explanation in Causal Inference: Methods for Mediation and Interaction.’ They claim otherwise the book ought to have been titled: ‘Statistical Methods for Mediation Analysis and Interaction Assessment in Observational Studies which involve Exposures that are Potentially Manipulable.’ I do wonder what Oxford University Press’ marketing department would make of their proposal. Titles of books are often more pithy; one need only turn to the book’s preface for clarification of scope. I had in fact initially proposed as the title the even more compact ‘Methods for Mediation and Interaction’. Perhaps Pearce and Vandenbroucke would have thought this more accurate. One of the book’s initial reviewers suggested the expanded title on the grounds of placing the methods in the context of the proposed purpose of it all, and this indeed seemed reasonable; but again the second part of the title ‘Methods for Mediation and Interaction’, clarifying the scope, remained. What Pearce and Vandenbroucke would like to see, it seems, is a text that integrates different views of causality, comprehensive epidemiologic theory, and different forms of evidence, including animal models and qualitative research. I agree this would be desirable. It is an ambitious undertaking. It is not the task I set myself to undertake. Pearce and Vandenbroucke are better positioned to write such a text than I, and I very much hope that they will do so, and move beyond complaints that others have not supplied what they desire.

With regard to their comments on causality, Pearce and Vandenbroucke have, as also elsewhere,21 mis-represented my views, suggesting that I take the position that we can only speak of causes when something is subject to human manipulation. This is not a position I hold. Such a view is also contradicted in my book in my providing, in section 16.2, what I take to be a set of sufficient conditions for attributing causation or the truth of counterfactual claims, applicable even in the absence of human intervention (p. 450, 453, 454). Remarkably, this is in the very section from which they cherry pick their quotations to seemingly attribute to me a view I do not hold. It is not clear to me whether Pearce and Vandenbroucke have not carefully read the relevant section or have simply intentionally falsely represented my views. This was pointed out in a previous exchange,22 and still they persist in misrepresentation. I consider this poor scholarly practice in general and especially so for those of the intellectual stature of Pearce and Vandenbroucke.

Pearce and Vandenbroucke do raise important issues of the place of the potential outcomes framework within epidemiology. The potential outcomes approach is certainly only one tool among many that are important within epidemiology. There has been an entire discussion, to be published in the International Journal of Epidemiology, on the role and limits of potential outcomes in epidemiology. Both they and I have contributed to that exchange and so I will refer the interested reader to some of the relevant papers and commentaries21–26 and will not repeat here my own views on the matter. I do think it is an important discussion. I do not think that there are easy answers. Apparently, my book offered another occasion for them to raise these issues again.

Pearce and Vandenbroucke do raise some important, more specific, points about the actual intended content of the text. One concerns measurement and misclassification. Even if the modern methods of mediation are applicable, they will only give estimates concerning the mechanistic importance of the actual measured construct. If smoking intensity is measured by average cigarettes per day then the mediated effect estimate, even if measurement error correction methods are employed, will only be with respect to average cigarettes smoking per day, not to other aspects of smoking intensity, such as depth of inhalation. This is an important point. It is acknowledged in the book (p. 45, 235), but it probably ought to have been given greater emphasis.

Pearce and Vandenbroucke suggest that problems of unmeasured mediator-outcome confounding may only generate substantial bias in quite extreme situations and question whether this really occurs in practice. Section 3.4 of the book gives a dramatic example of such bias in a randomized trial of cognitive behavioral therapy, dramatic bias that emerges if such issues are ignored in assessing direct and indirect effects. In the example, the qualitative conclusions are entirely incorrect due to such biases. Ignoring such mediator-outcome confounding has been a central problem in the assessment of mechanisms in epidemiology and in the social sciences. Whether the comments of Pearce and Vandenbroucke were an attempt to justify past practice that had ignored the issue I do not know, but this point on needing to control for mediator-outcome confounding when assessing mediation is what I have tried to most centrally emphasize in my writing and teaching. Ignoring it imperils scientific assessment of mechanisms with epidemiologic data.

This example also raises another important point. In their proposed alternative book title, ‘Statistical Methods for Mediation Analysis and Interaction Assessment in Observational Studies which involve Exposures that are Potentially Manipulable,’ there appears to be the suggestion that the methods are relevant only in the context of observational studies. This is not so. The methods are relevant in the context of randomized trials as well if questions of mediation and mechanisms are of interest. Moreover, when assessing mediation, the same problems of mediator-outcome confounding can plague estimates of direct and indirect effects as in observational studies, as clearly illustrated in the book in the randomized cognitive behavioral therapy example. In a randomized trial, the exposure has been randomized, but the mediator generally has not been.

Pearce and Vandenbroucke criticize the book for what they claim as placing more emphasis on the potential problems and assumptions of Mendelian randomization than with mediation analysis. I find this surprising. The strong assumptions required to carry out mediation analysis are pointed out repeatedly, and, as noted in the commentary by Kaufman, there is continual emphasis on the need to employ sensitivity analysis techniques to evaluate the robustness of conclusions to the assumptions. My hope, and one of the goals of the book, is to see mediation analysis, when it is carried out, done more rigorously. I would like to see the same with Mendelian randomization analyses. My issues are not with the idea of Mendelian randomization (I have seen some very fine applications), but with its typical use in practice. In actual application, the assumptions are sometimes not stated; when they are stated, there is often little discussion of their plausibility or as to what potential violations might be (such as those documented in the book); until recently, there has been little effort to carry out sensitivity analysis to the assumptions. Some of these same criticisms of course hold true with the use of mediation analysis techniques. All of this I would like to see changed. The book itself provides a number of tools for sensitivity analysis for mediation; the book also proposes some sensitivity analysis for Mendelian randomization.1,27 A more recent proposal, subsequent to the book’s publication, for sensitivity analysis in Mendelian randomization, 28 I believe holds considerable promise for improving the quality and the robustness of inferences made using Mendelian randomization approaches and has somewhat altered my outlook on the value of the approach. However, I still want to see assumptions stated, discussed in the context of the application, and potential violations explored through sensitivity analysis. I think that this needs to happen both for mediation analysis and for Mendelian randomization analyses. I do not see things differently across these two methodological approaches. The potential outcomes framework has helped clarify the assumptions underlying both.1,29

More generally, the potential outcomes framework has brought tremendous clarity as to how to go about assessing more nuanced types of causal effects1,30–43, including those related to mechanisms with observational and experimental data, and under what assumptions such inferences are warranted. I agree with Pearce and Vandenbroucke that the framework does not address, and is not particularly helpful, for answering numerous other important questions. But with regard to assessing mechanisms from epidemiologic data, I see no reason to return to unclear thinking, less accurate inferences, and the ignoring of assumptions. Progress has been made. One can debate how central or important this progress is to the overall project of epidemiology, but the book was intended only to address one particular subset of questions within the field, and to do this better than was done in the past.

My goal was to take a few topics in epidemiology – mediation, interaction, and spillover effects – that are often discussed but for which concepts and methodology had been inadequate, and provide a more formal foundation for these. Considered within the entire scope of epidemiology, it is a modest undertaking. It was not in any way intended to provide a comprehensive theory for the field, or for causation. Whether I have achieved my more modest goals, the reader can judge and, as I wrote in the synopsis2, how useful any of this will be for the field depends on the extent to which the epidemiologic questions of greatest significance truly require quantitative assessment of mediation and interaction to address. Oakes and Naimi think often they will; in contrast, Pearce and Vandenbroucke believe that their usefulness will ‘often be limited.’ My hope is simply that when such mechanistic assessments are useful, they will now be done more carefully and more accurately.

Funding

This research was funded by United States NIH grant R01 ES017876.

Conflict of interest: None declared.

References

  • 1. VanderWeele TJ. Explanation in Causal Inference: Methods for Mediation and Interaction. New York, NY: Oxford University Press, 2015. [Google Scholar]
  • 2. VanderWeele TJ. Explanation in causal inference: developments in mediation and interaction. Int J Epidemiol 2016;45:1904–08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Kaufman JS. The epidemiology of two things considered together. Int J Epidemiol 2016;45:1909–11. [DOI] [PubMed] [Google Scholar]
  • 4. Oakes MJ, Naimi AI.. Mediation, interaction, interference for social epidemiology. Int J Epidemiol 2016;45:1912–14. [DOI] [PubMed] [Google Scholar]
  • 5. Pearce N, Vandenbroucke JP.. Causation, mediation, and explanation. Int J Epidemiol 2016;45:1915–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Baron RM, Kenny DA.. The moderator-mediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations. J Pers Soc Psychol 1986;51:1173–82. [DOI] [PubMed] [Google Scholar]
  • 7. VanderWeele TJ. Bias formulas for sensitivity analysis for direct and indirect effects. Epidemiology 210;21:540–51. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Imai K, Keele L, Tingley D.. A general approach to causal mediation analysis. Psychol Methods 2010;15:309–34. [DOI] [PubMed] [Google Scholar]
  • 9. Hafeman D. Confounding of indirect effects: a sensitivity analysis exploring the range of bias due to a cause common to both the mediator and the outcome. Am J Epidemiol 2011;174:710–17. [DOI] [PubMed] [Google Scholar]
  • 10. VanderWeele TJ. Unmeasured confounding and hazard scales: sensitivity analysis for total, direct and indirect effects. Eur J Epidemiol 2013;28:113–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Jiang Z, VanderWeele TJ.. Causal mediation analysis in the presence of a mismeasured outcome. Epidemiology 2015;26:e8–9. [DOI] [PubMed] [Google Scholar]
  • 12. Valeri L, Lin X, VanderWeele TJ.. Mediation analysis when a continuous mediator is measured with error and the outcome follows a generalized linear model. Stat Med 2014;33:4875–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Valeri L, d VanderWeele TJ.. The estimation of direct and indirect causal effects in the presence of a misclassified binary mediator. Biostatistics 2014;15:498–512. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Tchetgen Tchetgen EJ, Shpitser I.. Semiparametric theory for causal mediation analysis: efficiency bounds, multiple robustness, and sensitivity analysis. Ann Stat 2012;40:1816–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Ding P, VanderWeele TJ.. Sharp sensitivity bounds for mediation under unmeasured mediator-outcome confounding. Biometrika 2016;103:483–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Rubin DB. Discussion of: Experimental designs for identifying causal mechanisms (Imai ., Tingley D, Yamamoto T). J R Stat Soc A 2013;176:45. [Google Scholar]
  • 17. Rubin DB. Direct and indirect effects via potential outcomes. Scand J Stat 2004;31:161–70. [Google Scholar]
  • 18. VanderWeele TJ, Asomaning K, Tchetgen Tchetgen EJ. et al. Genetic variants on 15q25.1, smoking and lung cancer: An assessment of mediation and interaction. Am J Epidemiol 2012;175:1013–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Oberg AS, VanderWeele TJ, Malmros CA, Hernández-Díaz S. Pregnancy complications following fertility treatment - disentangling the role of multiple gestation. Submitted to Obstetrics and Gynecology. in press. [DOI] [PMC free article] [PubMed]
  • 20. Jackson JW, VanderWeele TJ, Viswanathan A, Blacker D, Schneeweiss S.. The explanatory role of incident stroke as a mediator of the mortality difference between older adults who initiate first vs. second generation antipsychotics. Am J Epidemiol 2014;180:847–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. VanderWeele TJ, Hernán MA, Tchetgen Tchetgen EJ, Robins JM.. Re: Causality and causal inference in epidemiology: the need for a pluralistic approach. Int J Epidemiol 2016;45:2199–200. [DOI] [PubMed] [Google Scholar]
  • 22. Vandenbroucke JP, Broadbent A, Pearce N.. Causality and causal inference in epidemiology: the need for a pluralistic approach. Int J Epidemiol 2016;45:1776–86. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Krieger N, Davey Smith G.. The tale wagged by the DAG: broadening the scope of causal inference and explanation for epidemiology. Int J Epidemiol 2016;45:1787–808. [DOI] [PubMed] [Google Scholar]
  • 24. VanderWeele TJ. On causes, causal inference, and potential outcomes. Int J Epidemiol 2016;45:1809–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Vansteelandt S, Daniel RM, De Stavola BL.. The formal approach to quantitative causal inference in epidemiology: misguided or misrepresented? Int J Epidemiol 2016;45:1817–29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Robins JM, Weissman MB.. Counterfactual causation and streetlamps. What is to be done? Int J Epidemiol 2016;45:1830–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. VanderWeele TJ, Tchetgen Tchetgen EJ, Cornelis M, Kraft P.. Methodological challenges in Mendelian randomization. Epidemiology 2014;25:427–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Bowden J, Davey Smith G, Burgess S.. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol 2015;44:512–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Didelez V, Sheehan N.. Mendelian randomization as an instrumental variable approach to causal inference. Stat Methods Med Res 2007;16:309–30. [DOI] [PubMed] [Google Scholar]
  • 30. Hernán MA, Robins JM.. Causal Inference. Boca Raton, FL: Chapman Hall, forthcoming. [Google Scholar]
  • 31. Robins JM, Hernán MA, Brumback B.. Marginal structural models and causal inference in epidemiology. Epidemiology 2000;11:550–60. [DOI] [PubMed] [Google Scholar]
  • 32. Robins JM. A new approach to causal inference in mortality studies with sustained exposure period – application to control of the healthy worker survivor effect. Math Model 1986;7:1393–512. [Google Scholar]
  • 33. Robins JM, Greenland S.. Identifiability and exchangeability for direct and indirect effects. Epidemiology 1992;3:143–55. [DOI] [PubMed] [Google Scholar]
  • 34. Pearl J. Direct and indirect effects In: Proceedings of the Seventeenth Conference on Uncertainty and Artificial Intelligence. San Francisco, CA: Morgan Kaufmann, 2001. [Google Scholar]
  • 35. Hudgens MG, Halloran ME.. Towards causal inference with interference. J Am Stat Assoc 2008;103:832–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Tchetgen Tchetgen EJ, VanderWeele TJ.. On causal inference in the presence of interference. Stat Methods Med Res 2012;21:55–75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. VanderWeele TJ, Tchetgen Tchetgen EJ.. Effect partitioning under interference for two-stage randomized vaccine trials. Stat Probab Lett 2011;81:861–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. VanderWeele TJ. Surrogate measures and consistent surrogates (with Discussion). Biometrics 2013;69:561–681. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. VanderWeele TJ. On the distinction between interaction and effect modification. Epidemiology 2009;20:863-71. [DOI] [PubMed] [Google Scholar]
  • 40. VanderWeele TJ, Robins JM.. Empirical and counterfactual conditions for sufficient cause interactions. Biometrika 2008;95:49-61. [Google Scholar]
  • 41. Chiba Y, VanderWeele TJ.. A simple method for principal strata effects when the outcome has been truncated due to death. Am J Epidemiol 2011;173:745–51. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Frangakis CE, Rubin DB.. Principal stratification in causal inference. Biometrics 2002;58:21–29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Angrist JD, Imbens GW, Rubin DB.. Identification of causal effects using instrumental variables (with discussion). J Am Stat Assoc 1996;91:444–72. [Google Scholar]

Articles from International Journal of Epidemiology are provided here courtesy of Oxford University Press

RESOURCES