I partitioned my data into train and test sets. I did outlier detection and missing value imputation in my training set. Then, I built a model on this set. I am not sure what I should do about the missing values and outliers in my test set? For example, in case of missing values, should I ignore them in the test set? Impute them using the same algorithm? Fill them with a simple estimate, like mean (mean estimated from the test set or train set??)

More Farideh Bagherzadeh Khiabani's questions See All
Similar questions and discussions