Suggestions for Supervised Feature Selection Methods?

More Duc P. Truong's questions See All

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

After COVID-19 it has seen that EFL learners technological affiliation has raised. In addition, in the post-COVID period learners started to engage AI technologies like ChatGPT while learning...

08 August 2024 8,964 4 View

How to generate a citation of my paper from ResearchGate?

How we can cite the papers from ResearchGate. I am trying to create citations for this article, Quantum Machine Learning Algorithms for Optimization Problems: Theory, Implementation, and...

08 August 2024 6,690 3 View

Does Anyone have expertise in in vitro transcription and RNA pull down assay?

I am currently working on LncRNA; to know the lncRNA-protein interactions I want to do RNA pull down assay, so I need to design primers with T7 promoter. I need assistance in this regard.

07 August 2024 6,622 1 View

How to fix background error in rietveld refinement of one XRD peak using GSAS-II?

I want to refine one XRD peak of my in-situ xrd but the background is never working good which ultimately fails the refinement. How to refine and adjust the background using GSAS-II

05 August 2024 5,291 2 View

How can I add own Henry coefficients in Aspen Plus?

Hi, i would like to simulate an absorption process in Aspen Plus. I want to use the NRTL model und would like to add some individual Henry coefficients. Is that possible and how?

05 August 2024 2,333 2 View

Why might the impedance values for DI water and 0.1X PBS buffer solution exhibit a decreasing and increasing trend, respectively over time (HP 4194A)?

Hello everyone, I'm encountering an issue with my electrochemical impedance spectroscopy (EIS) measurements and would appreciate some insights. Experimental Setup: Electrodes: Gold interdigitated...

05 August 2024 3,783 2 View

Can usage of AI tools like chat GPT in research work is recommendable ?

AI tools like ChatGPT can enhance research work significantly when used responsibly and in conjunction with thorough human oversight.

05 August 2024 1,842 3 View

Usage of internal standards in LC-MS/MS analysis?

Have you ever seen a LC-MS/MS method uses both internal standards and external standards (in matrix matching purpose) but the concentrations of internal standards are outside the calibration curve...

05 August 2024 3,084 6 View

ANY free software for reconstructing neurons in the microscopic image?

Hi everyone, I am working on brain slices for visualizing a protein in the soma and dendrites, using a fluorescence tag. However, I need a tool (not paid) for reconstruction of the whole neuron,...

04 August 2024 4,725 2 View

How effective is the Citi Bloc standard basket in enhancing the accuracy and comparability of international construction cost assessments?

Citi BLOC Standard Basket Definitions: A standardized unit representing a fixed basket of construction materials, labor, and equipment costs priced in various cities. Purpose: To create a common...

04 August 2024 8,997 1 View

How to determine the position of occupancy of the dopant? - whether it is doped in tetrahedral or octahedral site?

Suppose a material "A" has both tetrahedral and octahedral sites and we are doping another material "B" - usually an ion into it. How can we detect if the dopant has occupied the octahedral site...

17 July 2024 4,299 4 View

Literature on best practice in supervision of social work students on placements?

assist in practice educators' course at university

03 July 2024 3,320 0 View

Papers concerning falsification of school reports on teachers working conditions?

Dear all. I need a couple of examples that were mentioned in a paper, a working paper or a technical repot (not a news media article) of falsification of school reports on teachers working...

25 June 2024 4,040 2 View

What are the best ways to cultivate self-discipline?

I would like to become self disciplined and work by myself without supervision. How do I achieve it when I am not interested at all in working without any push by others?

24 June 2024 4,529 5 View

What are possible sample selection biases in Logit model estimation?

I randomly interviewed 250 poor people and 250 non-poor people. Considering 1 for poor and 0 otherwise, does estimating a logit model aiming to capture the probability of becoming poor make sense?...

02 June 2024 2,326 2 View

Why do I get no melt curve peak and no Ct for some of my patients but not others, in a SYBR green qPCR?

I am doing a RT-qPCR for gene expression analysis of cancer patients, and there are four different groups in the results. The first group that shows over expression of my target gene (both two...

31 May 2024 1,088 0 View

How to calculate the angular velocity of a rotating cell from a microbe video?

I have a video of Brownian motion of microbes under a microscope. From this video, I need to calculate the angular velocity of a particular prominent cell. From my understanding, what I know is...

18 May 2024 4,289 3 View

Adherence/attendance rates on non home based resistance exercise in non-athletic adults?

Dear colleagues I'm looking for a reference that can provide accurate compliance/attendance rates of non-home-based strength training in gyms or clubs (supervised or not) in middle aged to older...

13 May 2024 4,655 0 View

During variable selection to enter into the multivariable analysis, is that a must to run a bivariable analysis when the number of variables are < 10?

I want to select variables to enter into the multivariable analysis by doing a bivariable analysis but the number of variables are limited,so is there any ground rule in such condition?

13 May 2024 7,816 1 View

What are the challenges and issues of clinical supervision among teachers?

Just want to know what are teachers sentiments or opinion regarding the conduct of clinical supervision? Are they in favor or not?

25 April 2024 371 3 View

Samer Sarsam

Hi Duc,

You could invoke a search method, such as "Best First", "GeneticSearch", etc., then use "Correlation-based Feature Subset Selection", Wrapper method, etc. in order to evaluate the worth of the selected feature subset. Google for the associated references.

HTH.

Samer

Helena U Zacharias

we evaluated the performance of different classifiers in combination with various feature selection methods in the context of NMR-based metabolomics. Just have a look at the attached publication.

Article Performance Evaluation of Algorithms for the Classification ...

Maged Al-Quraishi

you can visit this website you might find the good method

http://www.rami-khushaba.com/matlab-code.html

Dalton Ndirangu Kaimuru

Kindly receive extract from my work. You can google for references

Mickaël Tits

Hi,

This is quite a general question. Do you have a particular problem on which you want to apply feature selection ? The type of method to use may depends on many factors. Is it for regression or classification ? 2 or more classes ? Noisy data ? highly redundant data ? time-series or not ? ...

Anyway, here is a nice introduction to feature selection :

http://www.jmlr.org/papers/volume3/guyon03a/guyon03a.pdf

Anyway you can also find some famous methods simply on wikipedia :

https://en.wikipedia.org/wiki/Feature_selection

Concerning supervised selection, if you don't have a lot a data, you may take a look to filter methods, such as minimal redundancy maximal relevance (mrmr). If you have a very large amount of data and not too much features, an embedded method or a wrapper, such as recursive feature elimination with an svm (embedded), regularized tree (embedded), or a genetic algorithm (wrapper) may be more relevant.

If you are coding in matlab, here is a feature selection library with quite various methods :

https://nl.mathworks.com/matlabcentral/fileexchange/56937-feature-selection-library

These methods are originally designed for 2-class classification, though some might work fine with regression (like mrmr I think).

And another one just in case (maybe a bit old/deprecated) :

http://people.kyb.tuebingen.mpg.de/spider/

I hope it can help,

Cheers,

Mickael Tits

Duc P. Truong

After reading my question again, I think I can be more specific. My label only has two classes. Number of samples is 200, and number of features is 15000. So I am interested in feature selection, and not feature extraction. I do not know if this additional information will change your answers a little bit?

Thank you.

Best,

Duc

Mohammed Chalouli

I used Correlation based algorithm and you can find more details in this article.

Article Intelligent Health Monitoring of Machine Bearings Based on F...

Wish you all the best

Mehdi Brahimi

Dear Duc Truong,

We can broadly classify feature selection algorithms into two main categories, namely, wrapper methods and filter methods.

Filters methods study the relationship between the features in order to obtain a ranking of your features before use the top of them into your classification method. They include correlation criteria, Mutual Information, Fisher criterion, etc.

Wrapper methods use prediction performance as an input to select the best set of features that give the best prediction performances. The model is wrapped on search algorithm which will find the features subset which gives the highest score for classification in your case. It includes ROC curve assessment, mRMR, Fisher linear discriminant, etc.

A good review on feature selection approaches is addressed in Chandrashekar, G., & Sahin, F. (2014). A survey on feature selection methods. Computers & Electrical Engineering, 40(1), 16-28.

More generally, you have to keep in mind that the feature selection is about two topics the relevancy of your features and the redundancy. In your specific case (1500 features), I would suggest doing a PCA first. In fact, the PCA can be used as a feature selection approach. It will you tell you which features are linearly dependent or independent. (for PCA see https://stats.stackexchange.com/questions/27300/using-principal-component-analysis-pca-for-feature-selection)

Thanh Do Van

Dear Duc

You should read the two following papers:

Van Der Maaten, L. , Postma, E. , & Van den Herik, J. (2009), Dimensionality reduction: A comparative, Journal of Machine Learning Research, 10 (1-41), 66–71.

Sorzano, C. O. S. , Vargas, J. & Pascual-Montano, A. (2014). A survey of dimensionality reduction techniques, Cornell University Library Abstracts (1–35).

Vinicius Horn Cene

In our work, we included a supervised method in the overall structure of the classifier that selects which features offered higher accuracy rates on a electromyography signal pattern recognition to predict humam movement. I believe this is exactly what you are looking for.

Our paper is avaliable @ http://ieeexplore.ieee.org/document/8036844/

Best Regards.

Eduardo Corrêa Gonçalves

A collection of very simple techniques to begin with (e.g.: computing the chi-squared statistic between each predictive attribute and the class atribute) can be found in the following paper: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.32.9956