I'm aware of some projects in sociolinguistics and historical linguistics that share their data either in an open access format, without any substantial restrictions or delays, or without any "application" process as long as the work is for non-profit purposes. The idea is that everything that goes beyond a simple "Safeguard" letter hinders the maximal exploitation of limited and valuable resources.
These best practice examples, which make (often publicly-funded) data collections available to the public deserve recognition. While I can think of many historical data collection, the Helsinki Corpora Family or the BYU corpora, the more contemporary the data get, the fewer resources are publicly accessible. On the more contemporary end, I can think of, as exceptions,
* the Linguistic Atlas Project (http://www.lap.uga.edu)
and our own
* J. K. Chambers Dialect Topography database (http://dialect.topography.chass.utoronto.ca)
* Dictionary of Canadianisms on Historical Principles (www.dchp.ca/dchp2).
Which other projects of active data sharing do you know?
I'd appreciate your input for a list of Best Practice Data Collections that I'm preparing.
Best wishes,
Stefan D.