Hi All.
I am building a churn model and using a host of models - logistic, Decision Tree & Random Forrest.
Currently i am in the initial phase of my first model - Logistic Regression.
There are multiple variables of which only 3-4 are numeric. If the outcome variable would have been numeric [Linear regression] we could have used VIF & tolerance to study which one to keep or remove, whereas the outcome in the case is dichotomous [ 1/0]. There is a bit of confusion as in some of the research articles people are still recommending using VIF for binary outcome variables while in some cases they are not.
Please advice.