Little and Rubin (2002) introduced three major missing data mechanisms. If the cause of missingness is independent of data, missingness is called missing completely at random (MCAR).  On the other hand, in missing at random (MAR) mechanism, missingness depends on data which is observed yet independent of the unobserved data. Finally, third mechanism termed missing not at random (MNAR) because the pattern of missing data is non-random and depends on the missing variable.

According to the literature, you cannot ignore missing data which is "Missing not at random" (MNAR). My question is: given a dataset, how do you identify the missingness mechanism?

Similar questions and discussions