Q1. Consider the following matching formula: formula.match <- as.formula(diabetes ~ sex + age + race + smoking'). Which R codes can be used to match one exposed to one unexposed based on their propensity score, where propensity score is defined as the probability of being exposed to diabetes?)

A. matchit(formula.match, data = dat.analytic, method = 'exact', ratio = 1)
B. matchit(formula.match, data = dat.analytic, method = 'full')
C. matchit(formula.match, data = dat.analytic, method = 'nearest')
D. matchit(formula.match, data = dat.analytic, method = 'exact')

Q2. What is the key difference between Zanutto (2006), DuGoff et al. (2014), and Austin et al. (2018) methods in propensity score analysis with complex survey data?

A. Different in terms of outcome modelling
B. Different in terms of assessing covariate balance
C. Different in terms of exposure/treatment modelling
D. There is no difference

Q3. Say, we are exploring the association between diabetes (exposure) and CVD (outcome) using NHANES 2017-18 dataset. Age, sex, race, income, diet, smoking, and hypertension are the confounders. The name of the survey features are: psu, strata, survey.weight. Which of the following code can be used to estimate the propensity score using Austin et al. (2018) approach?

A. ps.austin <- glm(diabetes ~ age + sex + race + income + diet + smoking + hypertension, data = dat.analytic, family = binomial)$fitted.values
B. ps.austin <- glm(cvd ~ age + sex + race + income + diet + smoking + hypertension, data = dat.analytic, family = binomial)$fitted.values
C. ps.austin <- svyglm(diabetes ~ age + sex + race + income + diet + smoking + hypertension, design = survey.design, family = binomial)$fitted.values
D. ps.austin <- glm(diabetes ~ age + sex + race + income + diet + smoking + hypertension + psu + strata + survey.weight, data = dat.analytic, family = binomial)$fitted.values

Q4. For Q3, we match an exposed subject (having diabetes) with a control subject (no diabetes) without replacement within the caliper of 0.2 times the standard deviation of the logit of the propensity score. The name of the matched dataset is 'mydata.match'. The name of the survey design on full data is 'design.full', and the subset of the design on the matched dataset is 'design.match'. What could be the outcome model for DuGoff et al. (2014) method when we are interested in population-level estimates?

A. fit <- glm(cvd ~ diabetes, data = mydata.match, family = binomial)
B. fit <- glm(cvd ~ diabetes + age + sex + race + income + diet + smoking + hypertension, data = mydata.match, family = binomial)
C. fit <- svyglm(cvd ~ diabetes, design = design.match, family = binomial)
D. fit <- svyglm(cvd ~ diabetes + age + sex + race + income + diet + smoking + hypertension, design = design.full, family = binomial)

Q5. What could be your outcome model when you perform propensity score matching with multiple imputation?

A. fit[[i]] <- glm(cvd ~ diabetes, data = mydata.match, family = binomial)
B. fit[[i]] <- glm(cvd ~ diabetes + age + sex + race + income + diet + smoking + hypertension, data = mydata.match, family = binomial)
C. fit[[i]] <- svyglm(cvd ~ diabetes, design = design.match, family = binomial)
D. fit[[i]] <- svyglm(cvd ~ diabetes + age + sex + race + income + diet + smoking + hypertension, design = design.full, family = binomial)