Hello researchers!

I am doing a multivariate linear regression model, 4 of my predicting variables are "yes/ no/ I don't know" questions (analyzed as dummy variables), I decided to code the I "don't know" as missing values, but now regression analysis will do listwise deletion and I will lose about 25% of my data (each of my 4 variables have 13% to 2% missing data, combined they involve 25% of the sample).

Should I continue with listwise deletion and lose that much of my sample? or should I do multiple imputation? and if so, do I need to do it for the 4 variables or I can for example do multiple imputation for 3 variables and listwise deletion for one? (i.e., 2% listwise deletion is not a big deal, but I am not sure if this will bias my results).

Note: my final regression model contains more variables than just these 4, and these other variables contain no missing data.

More Husam Aldean Mahdi Abuhayyeh's questions See All
Similar questions and discussions