I am working on Sentiment Analysis and have a dataset that has 48000 tweets with labels. 7000 Negative, 17000 Neutral and 24000 Positive Tweets. Is my dataset is balanced and if not how can i balanced it?
Balancing is not necessary, however., if you are still interested in handling balancing issues, I suggest you to follow https://towardsdatascience.com/handling-imbalanced-datasets-in-machine-learning-7a0e84220f28
On your DS the amount of positive is 3.4 times more, that amount of negative - so it is evident, that DS is imbalanced. To make balanced Ds it is possible to use different techniques - random under-sampling (RUS), random over-sampling (ROS), SMOTE, etc.