I wonder whether it is reliable to conduct an imputation of full scales/questionnaires in our study.

We conducted a longitudinal data collection of 3 waves. There's a time of 3-4 years between each wave (wave 2 was collected 3 years after wave 1, wave 3 was collected 3-4 years later after wave 2). However, some participants collected in wave 3 left some of the questionnaires unanswered, which means that those answers are missing.

Something important to add is that all the participants of the three waves were emerging adults, which means the participants are undergoing numerous and significant personal, social, work-related, and other life changes over the years (and wave 2 was collected during Covid lockdown). Longitudinal analyses between waves 1 and 2 have shown that the relative stability of most scales and subscales ranges from 0.3 to 0.6, which is not especially high. Additionally, some participants were not present in wave 2, meaning that the data we have for the imputation of the responses of these participants would come from only one collection. Is it methodologically reliable to impute responses to entire questionnaires based on the responses provided by those same participants in previous waves to those same questionnaires? (and given the characteristics of the developmental stage in which they are). And, if it is possible, which would be the more robust methods to do so?

More Paula Domínguez-Alarcón's questions See All
Similar questions and discussions