I want to use Thyroid data-set(UCI repository) for my machine learning approach. The data-set having 21 attributes and 7200 instances. 15 instances out of 21 are binary and rest are continuous. Please suggest me how to pre-process this data so that i can use it for my work.