I am building a classification model using Random Forrest, Decision Trees & Logistic regression. 

For one of my critical variable i have some missing values. In some earlier models during similar situation i have used mean of that variable for the missing entries. 

I see that few analysts use median or even a 0 for the missing value. What is the rational behind it and does it vary from one use to another. 

Thank you. 

More Shivi Bhatia's questions See All
Similar questions and discussions