We all know the pivotal role pre-processing plays in text mining tasks and NLP. But I couldn't find any paper or scientific blog post which thoroughly discusses about pre-processing of textual data and it's impact on results.
So is there any paper or scientific blog post which covers this topic??