The tools are usually associated with the environment you are with, for example if you are in CRM SAP or Oracle they have there now set of tools to do the cleaning up, reformatting, export to BI etc.. If you speak in general, this is dependant of the DB or file system you have and how structure your data are. If DB most for most of them SQL do a pretty serious job, if you are on Hadoop Hbase then you have all the open source including Hive type of SQL Like.
If you have files type CSV and the size is in MG Bytes then many script languages could do the job, as mentioned SAS, R, Python will do.
These are great resources/answers, thank you! As of this moment, we are trying to determine the uniqueness of our product (at least compared to what is commercially available). Has anyone seen anything similar to the CRM Cleanup algorithms we use for our customer?