19  Reporting

19.1 Analysis Information

Many reporting guideline already exists about what to report in a propensity score analysis. Most of the reporting guideline should be applicable in the hdPS context as well. On top of those, we also need to consider reporting the following information about hdPS for transparency.

19.2 Information about hdPS

Information Description Our example
Proxy data dimensions The number of data dimensions (p) used. p = 1 as only one proxy data dimension (dx) was available from medication usage
What was done to remove proxies that are problematic Usually proxies of outcome, exposure as well as those identified as IV or mediator or collider are discarded. obesity and diabetes related codes removed
Proxy feature parameters The parameters used to select proxy features, including granularity [g], prevalence filter [n], and the minimum number of patients [m] g = 3, n = 200, m = 20. This resulted in 126 empirical covariates.
Recurrence parameters How many recurrence variables per code [r] and the covariate assessment period [CAP] r = 3, CAP = 30 days. This resulted in 143 distinct recurrence covariates.
Prioritization process The process used to prioritize proxy features, such as machine learning (ML), Bross, or hybrid methods We used all of these, but used Bross formula for hdPS to calculate absolute log of the multiplicative bias, and then ranked based on magnitude to select / prioritize recurrence covariates.
Selected proxies The number of proxies selected (k) for the model k = 100 for the hdPS
Software The software used to perform the analysis R: autoCovariateSelection package

19.3 Diagnostics

Information Description Our example
Diagnostics used to assess the model Standardized mean differences (SMD) Within 0.1 in hdPS analysis
Weight (IPW) summary assessment Somewhat reasonable (maximum approximately 54) within hdPS analysis
Comparison of propensity score distributions between each exposure group Overlapping (common support) does not seem to be an issue.
Assess distribution of absolute log bias Most bias multiplier values are close to null (0), only a few values seem to deviate from null.
Comparison with regular propensity score Estimates slightly towards null

19.4 Sensitivity analysis

Information Description Our example
Sensitivity analysis Varying the number of selected proxies [k]. OR estimates stabilizes around 1.5, shows variability below k = 50 and above 110
Sensitivity analysis Varying the prevalence filter [n]. OR estimates stabilizes around 1.5 for above n = 60.