What are the different types of feature selection techniques?

More Lov Kumar's questions See All

What is the procedure to call filter method of weka tool in matlab?

My data is imbalance. I want to apply filter method before classification. Can you please help to call Spread Subsample method of weka tool in Matlab.

05 June 2017 5,852 0 View

Can anyone please help me in finding a data set for the NAVAL software system which I can use for my research work?

I am planing to work in the area of software quality for the software used in Naval Research Board (NRB). Can anyone please help me in finding a software source code which I can use for my...

08 September 2016 8,436 0 View

What is difference between case study and data set?

what is difference between case study and data set Both are same or different.

07 August 2016 2,044 0 View

What is the most effective way to represent the idea?

What is the most effective way to represent the idea ? 1. Video 2. Text 3. Text with figure. 4. Figure 5. Cartoon Ans with reason...

06 July 2016 8,859 7 View

What is the relation between training error and testing error?

what is the relation between training error and testing error? Is it always possible that a model having higher training accuracy have also high testing accuracy.

06 July 2016 1,367 1 View

What are the advantages of radial basic function neural network (RBFN) over artificial neural network (ANN) ?

05 June 2016 953 0 View

What is the best way to explain software metrics?

How to explain software source code metrics?

05 June 2016 7,204 3 View

Cloud services vs. Web services: Are these two term same?

What are the similarities and dissimilarities between cloud cloud service and web service?

04 May 2016 5,821 14 View

How to compute the source code metrics for graphical language?

How to compute the size, halstead different metrics for graphical language such as petrintet, Visual Logic etc.

04 May 2016 10,092 3 View

What are the different way to define maintainability of software?

I need some survey paper on maintainability definition and metrics.

03 April 2016 8,663 0 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

Is this a facetotecta nauplius?

This larva was captured using a plankton net in the Persian Gulf during the summer. I believe it may be a Facetotecta nauplius.

08 August 2024 3,746 4 View

May members post flyers about opportunities to present at a conference? If so, where to post?

May members post flyers about opportunities to present at a conferehttps://veraeducation.com/nce? If so, where to post for the Virginia Educational Research Association? https://veraeducation.com/

08 August 2024 4,585 1 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

Deepak Paudel Popular answer

Feature Selection methods can be classified as Filters and Wrappers. One can use Weka to obtain such rankings by Infogain, Chisquare, CFS methods. Wrappers on the other hand may use a learning algorithm with a classifier like SVM or Random Forests to search and report optimal feature subsets. For eg. one may use a genetic algorithm(GA) to build a population of random solutions(feature subsets). Each such subset internally generates a reduced dataset which may be fed to SVM that returns a 10 fold cross validation classification accuracy(CVA) . The 10 fold CVA may be associated with the corresponding subset's fitness function.After the population is built GA takes over and keeps improving the fitness landscape.After a number of iterations one may hope to get an optimal feature subset.

Anders Carlsson

First you should consider if a 'wrapper' or a 'filter' method is best suited in your situation.

- Filter methods look at the features not in context of your model, but checks some fixed thresholds or criteria for inclusion.

- Wrappers are essentially search algorithms that will add/remove features and try to optimize a feature set. I commonly use 'recursive feature elimination'.

Filter methods are simpler and faster, but wrapper methods can give you a set of features that are optimized for your model. In both cases, be careful to only use training data when doing the feature selection as it really is a part of building the model.

Finally:

- There are several models that have feature selection "built in", for example random forest. Training a random forest model will typically also output "importance" values for each feature.

- You should consider creating dummy variables from your categorical variables.

Lov Kumar

Dear Anders Carlsson,

Thanks for this information, but I need full list of feature selection techniques.

Shuichi Shinmura

I developed "100-fold cross-validation for small sample" method.

I proposed the simple powerful feature (model) selection method named the BEST model.

See my several papers on RG.

Thank you Shuichi Shinmura.

Deepak Paudel

Muhammad Imran Ishfaq ahmad

information gain (IG) also is very interesting method for feature selection.