Hi, are you aware of any open data set on crowdsourcing, which include information such as profiles of workers/tasks, records of behaviors, completion status of tasks? Or are there any recommended method for obtaining such data set quickly? Thanks!
You are probably not going to find a data set in the public domain because those pieces of information represent proprietary information that has a competitive aspect to it, that could give one company a competitive advantage over another that was less forthcoming. The best way to get that type of information if it is at all possible is to approach the industry leader and offer a non-disclosure agreement if they will let you look at their proprietary dataset.
Joanna is right. Also, most researchers who created their own datasets would be happy to share their data. Just search for research papers that are close to your domain and try to identify if their data would be useful for you. If you find one, just send an email and ask them if they can share their data. If their data is not somehow classified/private/closed, they can share it.
Thank you very much for all of your suggestions and comments. I have checked some websites such as crowdsourcing.org recemmended by Joanna, but cannot find those types of dataset that we are interested in. We will consider crawl data from some crowdsourcing platforms.
If you're looking for datasets which contains labels and anonymized worker ID, you may access the NIST TREC Crowdsourcing Track website (https://sites.google.com/site/treccrowd/). Since 2011, TREC Crowdsourcing Track annually distributes crowdsourcing datasets publicly to encourage academic research. In addition, you can find some other datasets from the following links: Dr. Matt Lease' lab in UT Austin (http://ir.ischool.utexas.edu/square/data.html) and Dr. Panos Ipeirotis' lab in NYU (https://github.com/ipeirotis/Get-Another-Label).
Thank you very much for your detailed answers, Hyunjoon! I will try the links you provided and see if I can find some appropriate informations through them.