How can I apply both feature selection and feature extraction method?

More Farhad Bulbul's questions See All

Beta cyclodextrin complex characterization methods?

Which is the best method for the characterization of beta cyclodextrin complex formation with guest molecule in solution form and in solid form ?

11 June 2024 6,719 0 View

Hi can anyone help me figure this error out?

calculate genome-wide iHS values > breed_wgscan_ihs

01 March 2024 1,775 0 View

Blinder oaxaca decomposition?

Dear friends. Recently, I performed an economic inequality of blinder-oaxaca decomposition on Fatty Liver Disease (FLD). I obtained a difference of -0.06 between high and low economic status....

23 December 2023 6,831 0 View

What are the definitions for Annual moringa and Perennial moringa?

The fantastic tree Moringa oleifera is a perennial tree. In literature, I see moringa being referred to as Annual moringa and Perennial moringa. What are the basis for these terminology? Is there...

09 December 2023 8,364 0 View

Can self-healing hydrogels be formed quickly due to the presence of Schiff-base bond during fragmentation?

Hello all, Regarding self-healing hydrogels, can only the covalent Schiff-base bond break and form quickly, or do other bonds give this self-healing property to the hydrogel?

28 October 2023 1,050 3 View

Dissolving TPU in DMF?

Hi, Does anybody know the procedure for dissolving TPU in DMF? I mean a 40% TPU in DMF, I wan to do the dip molding Thanks, Temp, type of agitation, drying the resin,..

23 September 2023 580 0 View

How can I dissolve polyurethane in DMF?

Hi, can you help me with a resource for dissolving TPU in DMF? what temperature? drying resin? type of agitation and impeller? also casting.

22 September 2023 1,158 2 View

Can anyone guide me how I can access ASHA account to download the article?

I am writing to seek guidance on how to access articles from the American Speech-Language-Hearing Association (ASHA) for my research work in 2023. ASHA's resources are known to be invaluable for...

14 September 2023 2,962 1 View

Can anyone help me with accessing the ithenticate account for free?

I hope this message finds you well. I am currently in need of access to iThenticate for my research and academic work. However, due to budget constraints, I am exploring the possibility of...

14 September 2023 2,247 2 View

How to find a supervisor in specific country with a specific research interest?

Hi everybody I recently graduated with Msc. in Computer Engineering. I am searching for a Ph.D. position at universities in Canada. Is there a quick way to find a supervisor with a specific...

12 August 2023 8,147 2 View

Is there a problem with my RNA pellet?

Hello, I am currently having problems with RNA extraction. I am using mouse liver (C57BL6J), and I have extracted RNA from mouse liver before. Before this experiment, my final RNA pellets were...

11 August 2024 7,082 3 View

What is the difference between mathematical R^4 space and physical 4D unit space?

We assume that the difference is huge and that it is not possible to compare the two spaces. The R^4 mathematical space considers time as an external controller and the space itself is immobile in...

10 August 2024 6,678 14 View

RNA Extraction Using Hot Borate Method No Longer Working?

I've been performing RNA extraction on cotton petiole tissue for a few months now using the method described in the following paper, a derivative of the typical hot borate method...

08 August 2024 9,882 2 View

If we are using snowball sampling technique, how do we justify the true representativeness of the sample statistically? is there any statistical test?

Are there any statistical methods to justify your sampling technique using SPSS or AMOS?

05 August 2024 9,153 4 View

Low-yield gel extraction problem?

I am having an issue with my gel image where my PCR product is not appearing very bright on the gel. When I perform gel extraction, the A260/280 purity value is very low. I used the Qiagen gel...

05 August 2024 9,798 3 View

Do you have good tips for seaweed tissue preservation in the field for post RNA extraction?

I will be with my students collecting seaweed samples in a marine farm and later we will process this tissue for RNA isolation and further sequencing. Does anyone have tips on how to collect the...

04 August 2024 501 2 View

What are possible strategies can be used to analyze data under sequential explanatory mixed method approach?

Better ways to analyze the qualitative and quantitative data in a sequential explanatory mixed method approaches

04 August 2024 2,703 6 View

The question is how to use Wavenet transform?

HOW CAN I WRITE A CODE TO USE THE WAVENET TRANSFORM AS A FEATURE EXTRACTION METHOD INSTEAD OF DWT IN MATLAB?

03 August 2024 7,829 0 View

Should I remove an item from a scale to raise Cronbach's alpha and McDonald's omega or is it better to leave it if they are both over .7 already?

Hello! I have this scale which had 10 items initially. I had to remove items 8 and 10 because they correlated negatively with the scale, and then I removed item 9 because Cronbach's alpha and...

01 August 2024 4,606 7 View

Why 3 replicates for most biological assays? Is it enough to examine the data fits normal distribution?

Just bounced on me. Before statistically analysing significant difference, shouldn't we see if data fits normal distribution first? Is 3 replicates enough to testify the hypothesis of normal...

31 July 2024 8,141 13 View

Onder Aydemir

First you should extract some valuable features. After you can apply feature selection methods to determine if all features you extracted are really valuable or not? And you should get answer which of them are really valuable!

Farhad Bulbul

Thanks for your answer. My question is , Will I apply PCA first and then mrmr feature selection method? MRMR=maximum relevancy-minimum redundancy.

Mamadou Bamba

Hi Farhad,

I am proposing the following steps for your classification process

1- Extract Features

2- Select relevant features (using MRMR method)

3- Apply PCA allowing you to perform the analysis of all the components, and to determine the ones that contribute the most, i.e. explain the maximum variability. Hence the Eigenvalues will inform you whether factors F1, F2, .... F8, for instance explain 87% of the total variability.

4- Apply Classifier (Perform classification), to find out how many groups / clusters you obtained, which set belong to which group / cluster.

Hi Mamadou Bamba, You are right. I have applied MRMR first and then PCA and it's working well rather than using PCA first and then MRMR. My question: Why does PCA perform well after MRMR?

C. Frelicot

Hi.

It works better because the features that PCA combines are class-relevant because they have been selected with respect to a particular class-dependant criterion. In the opposite way, principal components you try to select may not be class-relevant since the criterion i(total covariance) is not class-dependant.

An alternative you should try is to use a simple LDA (Linear Discriminant Analysis) or any derivative (QuadraticDA, RegularizedDA, etc) with also performs dimension reduction by combining features according to within-class and between-class (here is the relevancy) criteria.

Hi C. Frelicot , Thanks for your nice explanation. I am dealing with two class problems with 4096 features. So using LDA provides only one dimension which is not good enough for my task. From 4096 to only one dimension makes average precision is zero. So , How do I apply LDA for my two class problems to get better result?

Alberto Muñoz

First, feature extraction. But feature extraction involves some transformation. PCA extracts linear combinations on the original variables, searching for maximum variability in the data,but it is not class-dependant. You could try PLS, ICA and kernel PCA to extract features using diferent optimization criteria and compare.

Ahmad Alzghoul

Also, it is better to start by removing the highly correlated variables if exist.

Volker Lohweg

I would like to add a general comment on feature selection which I have stated in several blogs.

There are many feature selection methods available like LDA, Fisher's Disriminant with Rayleigh coefficient, Intra-class-Minimizers, etc. What is usually not working is: PCA! Why? PCA tries, based on a gaussian process (assumption), to measure the variance and sort the eigenvalues which are proportional to the variances in decending order. The assumption is that the main Eigenvalues (EWs) contains most of the information and therefore, we use the main components (EWs) for data reduction. So far so good. But: Applying this approach for feature reduction is risky because this assumes that the feature in itself is stable (invariant) AND the feature's variance contains all information for classification.

This assumption is wrong!

Real-world features contain artefacts (noise, etc.) and therefore, PCA generates in its main components EWs with the highest variance which are related to noise. Hence, you generate "new" features which are not stable.

Iff you can prove that your features are completely artefact-free and the features underly a gaussian process, then PCA might work - in all other cases PCA is, as I said, very risky.

Dr. Indrajit Mandal

Dear friend

I would like to add some spice to the above solutions provided by eminent researchers.

For any given engineering problem, before you apply any feature selection or extraction method, it is important to look into the data.

do some tests first.

look at the infogain, correlation among variables etc

Look, Feature extraction and selection have their own importance. Just read some literatures and understand where what to apply.

I hope it helps you.

Best regards

Zeliha Gormez

You can use PCA for feature selection and featıre exraction.

Firslty you can select features and then extract the new features by dimension reduction with only the selected features before.

Also you can select features using multi-objective optimization techniques such as Pareto Optimality (PO).

here a study is regarding multi-objective feature selection by using PCA and PO.

http://www.sciencedirect.com/science/article/pii/S1476927112001156

Wolfram Rinke

First of all you have to find out, the kind of objective and its characteristics (linear or non linear). PCA is a good way for linear systems, for non linear systems I use artificial neural networks and do a sensitivity analyses after. This fits perfectly well for all of my applications within the last 20 years it has the benefit of a more generic aproach, if you don't know much about your system.