Hi all,
I am dealing with data with sevaral features and many of them are highly correlated with each other as well as with dependent variable.
In my research on this topics, I found that multicolinearity is harmful for regression problem and may not end up with good model. I got some suggestion that if the features are highly correlated then we have to remove them using VIF criterion.
But, logically when I think of removing correlated features from my analysis how can expect better model as I am not considering all the available information.
Is there any logical explaination or mathematical explaination is available for the above question?
Also, I am thinking that each features are somehow related to any of the other (May be nonlinearly) in that case, do we have problem of multicolinearity ?