I'm working on a set of clinical data and am eager to know if there is any method that can help me to decide whether to keep a variable (feature) or discard it because of the percentage of its missing data. For example, is there any method that can lead us to this conclusion that because there is 50% of missing data in variable X, this variable should be discarded?

I have to note that I'm coding in MATLAB and am not using statistical tools such as SPSS.

Thank you all in advance.

Similar questions and discussions