Concepts (C)

ML in causal inference

In comparative effectiveness studies, researchers typically use propensity score methods. However, propensity score methods have known limitations in real-world scenarios, when the true data generating mechanism is unknown. Targeted maximum likelihood estimation (TMLE) is an alternative estimation method with a number of desirable statistical properties. It is a doubly robust method, enabling the integration of machine learning approaches within the framework. Despite the fact that this method has been shown to perform better in terms of statistical properties (e.g., variance estimation) than propensity score methods in a variety of scenarios, it is not widely used in medical research as the implementation details of this approach are generally not well understood. In this section, we will explain this method in details.

Reading list

Key reference: (Karim and Frank 2021)

Optional reading: (Frank and Karim 2023)

Video Lessons

Machine learning

The workshop was first developed for R/Medicine Virtual Conference https://r-medicine.org/ 2021, August 24th. What is included in this Video Lesson/workshop:

Chapter 1 RHC data description 4:22
Chapter 2 G-computation 23:13
Chapter 3 G-computation using ML 45:02
Chapter 4 IPTW 1:18:50
Chapter 5 IPTW using ML 1:30:11
Chapter 6 TMLE 1:36:41
Chapter 7 Pre-packaged software 1:58:05
Chapter 8 Final Words 2:14:36

The timestamps are also included in the YouTube video description.

Links

The original workshop materials are available here.

References

Frank, Hanna A, and Mohammad Ehsanul Karim. 2023. “Implementing TMLE in the Presence of a Continuous Outcome.” Research Methods in Medicine & Health Sciences, 26320843231176662.

Karim, Ehsan, and Hanna Frank. 2021. “ehsanx/TMLEworkshop: R Guide for TMLE in Medical Research.” Zenodo. https://doi.org/10.5281/zenodo.5246085.