Hi there,

I am planning to run some classification models and found that several of my 53 predictors are count variables with a 5-6 extreme outlying values (out of N=1,500)

My questions

1) I tend to eliminate those but would like to know how outliers in predictors can have any effect on the modeling (perhaps affect stabilty, variance, convergence?)

2) Are there systematic approaches how to detect outliers *in count variables*. I find that difficult as e.g. poisson distributions would not enable it to conceptualize outliers at all (as the vlaues run to infinity theoretically).

3) Should I remove or trim the outlying values? And if I should trim, what would be the approach?

4) Am I hysterical and do I make a fuzz about nothing? :D

Best,

Holger

More Holger Steinmetz's questions See All
Similar questions and discussions