Hello!
I'm currently working on the LDQ detection for my thesis. I have read so many things about missing values and how to deal with them that unfortunately I am now even more confused than ever before.
I am working with SPSS 26 and I have several categorical as well as continuous variables which are important for my research question.
Initially I followed the procedure Analyze menu > Missing Value Analysis
The range of my n is between 145 and 158 (so not a really huge sample size). In the univariate statistics I get percent missing around 7.6% to 8.2%.
I ran Little's MCAR test which became non-significant with Chi²(15) = 1.390 and p = 1.000
I made a mistake in the construction of my questionnaire because participants were not able to skip questions - looking at the data set most of my missing values is simply because after a certain amount of time people simple dropped out and every question that would have followed was considered "missing". The "initial" sample size without the drop outs was around 350 participants.
I ready that missing data analysis would be important especially to look what type of missing it is, that you're dealing with in your data set.
To me (or my really limited understanding of those things compared to experts) it makes complete sense that I have higher amounts than the recommended 5% of missings where some sort of imputation or replacement of the missing values should be considered because mine are mostly due to drop outs. Also the p = 1.000 at the Little's test is highly irritating for me.
I have read a ton of articles the past few days but even though they all give really good explanations for missing data and multiple imputation and which method is best - I found no answer what I should do in my specific case where the missing data is because people stopped at one point to answer the questionnaire and closed it (I already stated in my limitations that preventing them from skipping questions should be handled in a different way in the future).
Can someone please help me out and give me a recommendation?
I found an article from D. A. Bennett (1999) that is called "How can I deal with missing data in my study?" and gives a cut off of 10% of missing values but gives no information if it is considered for each variable or the overall data set and I found no way to calculate the missing values for the complete data set instead of "just" columns or rows.
I hope I somehow was able to explain my issue with that missing values due to drop out and my confusion how to handle them and especially how to argue in my thesis WHY I handled them the way I did.
I really hope that some expert finds time to give me a recommendation - I'm happy to answer any further questions. I'm simply overwhelmed atm with the amount of information I read and suddenly nothing makes sense anymore.
Thanks everyone in advance!