In the wake of the Snowden leaks, the public has learned that the sheer technical scale of surveillance goes beyond what has usually been thought possible, such as a "full take" of internet communication. The picture emerges that intelligence agencies are prominent users and drivers of "big data" technology. But from the media reporting, capabilities remain rather unclear, certainly also because of the variety of different surveillance programs, the complexity of the technology, and of course secrecy.

I would like to know more about what the technologies employed are and how far technical capabilities go. Since it may be a vast topic, let me narrow the question down: I would like to learn about data analysis capabilities, not about the use of cryptology or computer security research for surveillance. The focus is on mass surveillance rather than spying on specific target persons. I am interested in efforts to make society as a whole "machine-readable" by using techniques such as data mining, machine learning, social network analysis etc.

Especially interesting are things like:

- the intent to build a comprehensive social graph of large population segments [1]

- the use of social media and other data streams to anticipate political events such as protests, civil unrest etc., e.g. [2]

Can you point to publications? Work done in the service of surveillance and contributions of computer scientists to the critical debate around the issue would be equally interesting.

[1]: http://qz.com/129425/how-the-nsa-builds-social-graphs-of-americans-with-phone-email-and-location-records/

[2]: http://arxiv.org/abs/1402.7035

Similar questions and discussions