I am looking for an open MOOC dataset (OLI, Coursera, Edx, Udacity etc.) which includes discussion forum data (text-based). If this has a classification of topics (coarse-grained is fine) and separation of threads and replies would be great.
"How can I obtain datasets for sessions offered at other universities?
Currently, Coursera’s agreements with partner institutions only permit Coursera to share data from sessions with researchers at the institution sponsoring that class. To obtain data for a session sponsored by a different partner institution, researchers should directly contact the data coordinator at that institution. Contact information for data coordinators may be obtained through CourseOps."
The first year edX MOOC data published by Harvard and MIT researchers is quantitative. It is still interesting to get demographic variables overall of MOOC participants. However, you have to keep in mind that the data is a subset of the actual data due to the process of de-identification. The process of de-identification necessitated removing on an average 21% of the records from the open dataset, and as high as half of the records from at least one course.
The researchers themselves have shown how that might affect validity of analysis conducted. See: Daries, J. P., et al. (2014). "Privacy, anonymity, and big data in the social sciences." Communications of the ACM 57(9): 56-63.
Thanks for the Coursera information Safwan and the datashop information Llanos.
The Harvard Dataset doesn't seem to have much data on discussion forums. Can anyone please suggest where to find data on discussion boards of learning management systems or MOOCs.
Hi Ashish Dutt , Please can you tell me how download the data of Harvard and MIT university.I try the link (ttp://papers.ssrn.com/sol3/papers.cfm?abstract_id=2381263).it is allow me just to download paper with many thanks