I have collected above 2.5 millions of news headlines from the newswires in Arabic (BBC, CNN, Aljazeera, AlArabyia, ....etc.) posted on twitter from 2008-now.
I am asking about what kind of task can be performed on this big dataset.
I am interested in classification, clustering, opinion analysis.
Please guide me.
Thanks,
Ali