Due to a recent "BigData" hype, I've recently tried to analyze the possible sources of such data for research projects. The interesting finding is: there does not appear to be such a huge amount of "usable" data in the open space. But I would love to be wrong.
The definition of "usable" in this context is roughly:
What I see is:
To cut it short, the amount of "useful" open data on the web (see definition of useful above) appears to be quite small. And much of the company-owned data cannot be explored because of the privacy concerns (unless you work for NSA). Besides, the companies such as Google and Facebook base their business on exclusive access to this data.
Does this mean that open data research is essentially about companies digging through the data they somehow collect on their own (+satellites) or am I missing the point?