I am running analyses on a dataset with missing values. Running Multiple Imputation, MI, in SPSS is easy enough and quite handy My dilemma comes with imputing data on one variable that shows a conditional effect upon my main independent variable of interest (substantive model post imputation). It seems to me that the value in running MI, filling in missing data while producing accurate standard errors in regression models, is lost here. Is it correct to say that MI is appropriate for independent variables, dependent variable, and covariates only? But not for variables showing an interaction effect (conditional effect)?

One variable (binary) shows an interaction with my independent variable. In the original data there is a strong effect in one subgroup with no effect in the other. This variable is missing 18% of its values, so imputing these results in considerable misclassification. For inclusion as a simple covariate, this misclassification would be addressed and the standard errors for the regression coefficient would be accurate.

When pursing analyses to illustrate this interaction term, my effects seen in the original complete data now are washed out. In the subgroup that had showed a strong, clear effect, it now is not strong and no longer statistically significant. I think that is due to the misclassification, not due to having a fuller more accurate dataset.

Your thoughts? Any references? Need clarification?

More Karen Buck's questions See All
Similar questions and discussions