24 November 2017 2 4K Report

Hi All.

I am building a churn model and using a host of models - logistic, Decision Tree & Random Forrest.

Currently i am in the initial phase of my first model - Logistic Regression.

There are multiple variables of which only 3-4 are numeric. If the outcome variable would have been numeric [Linear regression] we could have used VIF & tolerance to study which one to keep or remove, whereas the outcome in the case is dichotomous [ 1/0]. There is a bit of confusion as in some of the research articles people are still recommending using VIF for binary outcome variables while in some cases they are not.

Please advice.

More Shivi Bhatia's questions See All
Similar questions and discussions