My thesis is about online review fraud detection. I need a dataset that contains ground truth about whether a review is genuine or not. Is such a dataset available?
try this link http://odds.cs.stonybrook.edu/yelpzip-dataset/
The dataset includes 608,598 reviews from 5,044 restaurants by 260,277 reviewers. To get the datasets with ground truth please email: [email protected]
The dataset I had used for my previous research contains labels whether the review was fake or real.
You can contact the owner of the dataset to get access to her dataset. This is the link to her publication Conference Paper Collective Opinion Spam Detection