Documents published by companies and governmental agencies could contain rich metadata. For example, the names of the people involved in the redaction and their roles in the organization. This kind of information is very useful to perform a phishing attack. A good tool for perform automated data mining over the meta data contained in documents is: http://blog.elevenpaths.com/2013/12/foca-final-version-ultimate-foca.html