I am trying to build a classification based machine learning model using efficient feature selection methods that predicts a dependent variable based on few independent variables.

The objective of feature selection algorithms is to generate discriminating features to accurately distinguish the type of independent variable. However, I am struggling to find efficient solution around this problem related to generating discriminating features (currently the features belonging to different labels highly overlap with each other) for each of the unique labels of the independent variable.

In addition, there is a variation observed among the independent variables across different datasets. Hence, there is recommendation required around the suitable supervised machine learning algorithms that perform learning and prediction based on overall generalizations of the data and not instance based.

Any thoughts or guidance towards the potential solutions around the highlighted problems would be greatly appreciated.

* The independent variables hold numerical floating point values

More Saurabh Malgaonkar's questions See All
Similar questions and discussions