Hello Everyone! I am trying to run cluster analysis on a set of large mammal data in order to try and recreate ecological profiles for extinct species. However, as one would expect with these data sets there are missing values. I have been trying to run k-means cluster analysis in R however the missing data limits this significantly. Is there a way of filling in this missing data (something similar to nearest neighbor) in order to assign species with missing data to clusters? Unfortunately there is different pieces of data missing form different species.

More Christopher Brooke's questions See All
Similar questions and discussions