I am currently working on a bibliometric analysis based on a scientific literature corpus extracted from Scopus and treated using the text mining software VantagePoint. I want to "purge" the institutional affiliations list, since there is a lot of incomplete or diversely spelled university's and research center's names included in the articles subject to the analysis. The List Cleanup tool included in VantagePoint seems quite difficult to use in order to save time and have optimum results when dealing with more than 3 thousand records. I feel a bit impatient about it because I thought it might take a while to do this task but the functioning of the tool doesn't seem so advanced and I feel all of the energy put on it might be wasted. Any advice?