I'm analyzing some data and having the issue that I found with a steam&leaf analysis that there are several outliers in the data.
One the one hand I would like to put the data out, but on the other hand I think the participants have understood the question and answered it correctly?
I read some articles in the past and I thought both ways are feasible. Can anybody help me with some significant articles to underpin my approach? Or does anybody know articles who discuss the need of keeping outliers in the Data? In my case I have some extreme outliers in annual income and the mean with taking the outliers out of my data is four times higher.