I have a set of ecological data containing extinct species for which I have predicted median values for missing data. Unfortunately, I have not yet been able to find a suitable solution to validate this imputation. If I jackknife the data set (reproducing missing variables for known values) using median values it is highly unlikely to produce the same values as those I have removed, thus producing a very weak model validation.

More Christopher Brooke's questions See All
Similar questions and discussions