Simulate cross-fitted predictive data
simulate_crossfit_data.RdGenerates a synthetic regression/classification dataset with covariates, outcomes, and out-of-fold predictions from a K-fold cross-fitted GLM.
Usage
simulate_crossfit_data(
n = 20000,
p = 5,
family = stats::binomial(),
K = 5,
seed = 1
)Arguments
- n
Number of observations to simulate.
- p
Number of covariates (columns in the design matrix).
- family
GLM family (defaults to
stats::binomial();stats::gaussian()gives a Gaussian response).- K
Number of folds used for cross-fitting.
- seed
RNG seed for reproducibility.