df_uc_em_sel {multibias} | R Documentation |
Simulated data with uncontrolled confounding, exposure misclassification, and selection bias
Description
Data containing three sources of bias, three known confounders, and
100,000 observations. This data is obtained by sampling with replacement
with probability = S from df_uc_em_sel_source
then removing
the columns X, U, and S. The resulting data corresponds
to what a researcher would see in the real-world: a misclassified exposure,
Xstar; missing data on a confounder U; and missing data for
those not selected into the study (S=0). As seen in
df_uc_em_sel_source
, the true, unbiased exposure-outcome
odds ratio = 2.
Usage
df_uc_em_sel
Format
A dataframe with 100,000 rows and 5 columns:
- Xstar
misclassified exposure, 1 = present and 0 = absent
- Y
outcome, 1 = present and 0 = absent
- C1
1st confounder, 1 = present and 0 = absent
- C2
2nd confounder, 1 = present and 0 = absent
- C3
3rd confounder, 1 = present and 0 = absent