I'm conducting a study measuring happiness across 4 time points, aiming to determine if there's an increase in overall happiness. The required sample size is 24 for four time points and 28 for three. However, I need help with participant retention. Here's the breakdown:
Survey 1: 24 participants
Survey 2: 36 participants (8 returned from Survey 1)
Survey 3: 60 participants (24 returned from Survey 2, none from Survey 1)
Survey 4: 100 participants (some overlap with previous surveys) To analyze the data, I'm considering only those who completed at least two surveys from Surveys 2, 3, or 4. This resulted in 36 participants and about 20% missing data. I've excluded Survey 1 participants due to more than 50% missing data.
My questions are:
Is it statistically valid to include only participants who completed at least two surveys from Surveys 2-4? Should I impute missing data and use repeated measures ANOVA, or would a linear mixed-effects model better handle the missing data? My model is straightforward:
Happiness score is a dependent variable, and time is an independent variable.
Any advice would be greatly appreciated! I appreciate any help you can provide.