Regarding the subject of handling imbalanced data (specifically in NLP):
1. Is there any benefit to balancing nearly-balanced classes?
(say, majority to minority data ratio of 60:40 or even 55:45)
2. Can such a procedure cause more harm than good?
Thank You.