Propensity Score Matching

Author

Affiliation

Ehsan Karim

School of Population and Public Health, UBC

Video content

Slides

Download

Summary of topics discussed

Propensity Score (PS)

A Propensity Score is the probability of a unit (e.g., person) being assigned to a particular treatment given a set of observed covariates. It is used to reduce bias due to confounding in observational studies where treatment assignment is not random. The propensity score is typically denoted as \(Pr(A=1 | L)\) where Pr is the probability of receiving the treatment given covariate L.

Modeling Propensity Score

Modeling PS involves estimating the probability of treatment assignment based on observed covariates. Common methods include:

Logistic Regression: Often used to estimate PS, where the dependent variable is the treatment assignment, and independent variables are the covariates.
Machine Learning Methods: Such as decision trees, random forests, or support vector machines, can also be used, especially when the relationship between covariates and treatment assignment is complex.

Choosing Variables

Variables to be included in a PS model should be those that are:

Confounders
risk factor for the outcome

We usually avoid instruments and noise variables. Looking at outcome data while modelling PS are usually discouraged.

PS Approaches

PS Matching: Involves pairing treated and untreated units with similar PS to create a balanced comparison group. Matching can be done through various methods like nearest neighbor, caliper matching, etc. We can estimate ATT from PS matching.

Tip

Nearest-Neighbor Matching

Nearest-neighbor matching is a widely used technique in PSM to pair treated and control units based on the proximity of their propensity scores (see related lab materials).

Tip

Caliper

In the context of PSM, a caliper is a predefined maximum allowable difference between the propensity scores of matched units. Popularly, researchers popularly use calipers of width equal to 0.2 of the standard deviation of the logit of the estimated propensity score (see required reading [Austin 2011] and related lab materials).

PS Weighting: Involves assigning weights to each unit based on their PS to create a synthetic sample in which the distribution of observed covariates is independent of treatment assignment. Common methods include inverse probability of treatment weighting (IPTW). We can estimate both ATE and ATT using PS weighting. In situations where the PS matched sample size is significantly reduced, PS weighting might be a preferable approach, as it utilizes the entire sample without omission.

Other PS approaches:

Stratification by PS deciles and
covariate adjustment using PS (on it’s own or along with other useful confounders)

are other PS approaches that are often used. We do not cover them in details, but refer to the required reading [Austin 2011] for further details.

PS Methods	Advantage	Disadvantage
Matching	Directly gives ATT. Balance checking possible.	May reduce sample size a lot. Balance may not be achieved. Iterative process.
Weighting	Directly gives ATT or ATE. Uses full data. Balance checking possible.	Weights can be extreme. Truncation may bias results. Iterative process.
Stratification	Helpful in effect modification assessment. Balance checking possible.	May suffer from small sample size issues in each strata (5-10). May be hard to assess overlap in each strata.
As a covariate	No balance assessment necessary.	Method associated with most bias. Unclear what is the estimand.

Four Steps of PS Modeling

Specify Covariates and Fit Model: Identify and select relevant covariates, and fit a model (e.g., logistic regression) to estimate PS.

Tip

In propensity score matching, some researchers prefer to omit unexposed subjects with notably low propensity scores or exposed subjects with particularly high propensity scores. However, exercising caution regarding this practice is essential.

Match Subjects by Estimated PS: Use the estimated PS to match treated and untreated units, ensuring they are comparable.
Covariate Balance Checking: Assess whether the matching or weighting has successfully balanced the covariates across treatment groups.
Estimation of Treatment Effect: Analyze the outcome using the matched or weighted sample to estimate the causal effect of the treatment.

PS vs. Regression

PS is used to balance covariates across treatment groups to create a quasi-experimental design, while regression is used to directly estimate the relationship between treatment and outcome. Assumptions: Both PS and regression methods assume that all relevant covariates are observed (no hidden bias and no unmeasured confounding). They also require correct specification of the functional form of the relationship between covariates, treatment, and outcome.

When comparing the estimates of Odds Ratios (OR) and Confidence Intervals (CI) from PS matching and regression models, they might not differ significantly. However, there are several reasons why researchers might still opt for PS matching:

Intuitive Comparison: PS matching allows for an intuitive and straightforward comparison between two groups that are similar in observed covariates, facilitating easier interpretation and explanation of results.

Ease of Diagnostics: Checking the balance of covariates across matched groups (balance checking) can be more straightforward and intuitive than evaluating regression diagnostics, such as residual plots or influence diagnostics.

Separation of Models: PS matching separates the model for the exposure (treatment assignment) from the model for the outcome, which can be beneficial in ensuring that the method for estimating treatment effects is not too dependent on the specification of the outcome model.

Flexibility with Assumptions: PS methods, especially those utilizing machine learning approaches, can relax the linearity assumption in estimating PS, providing more flexibility in handling complex relationships between covariates and treatment assignment.

References (Optional)

Karim, ME (2021) Understanding Propensity Score Matching

Important

Related content was discussed in previous course materials and discussion materials.