I have some questions on classification using Support Vector Mahine Recursive Feature Extraction with WEKA. RFE gives a final ranking for the attributes however one needs a method to decide which featuers to select from the final ranking. i am dealing with binary classification and have a set of features to discriminate between patients and healthy individuals but my dataset is unbalanced (have a few samples from patients) and need some help regarding these questions:

- Should i normilize the data or not? when and why we normilize the data?

- using SVM-RFE what is the best method to decide which features should be condidered to build the classifier and which not (sensitivity, accuracy, kappa statistic,....) considering that i am dealing with unbalanced data

Similar questions and discussions