How to implement filter-wrapper feature selection methods for data classification?

More Ahmed Elsayed Hegazy's questions See All

Can you help me to find data minig journal with IF less than 1 ?

As a start, I want to publish data mining paper in journal with small impact factor

01 February 2018 8,989 0 View

What are chaotic maps and how I can use it in swarm algorithms for feature selection?

Your toughest technical questions will likely get answered within 48 hours on ResearchGate, the professional network for scientists.

11 December 2016 3,279 0 View

How can I use swarm algorithms for feature selection?

Using swarm for feature selection Firts the data set is passed to the algo and thealgo generate some solutions each soltion is a candidate subset. At each iteration the fitness of solution is...

11 December 2016 6,201 10 View

How to store a row vector of numbers into cell array to write the result in excel sheet in matlab ?

For example A=num2str (find (pos==1))% pos is a vectorcontain feature number of pso and change in every iteration For example A=12 14 5 7 % For rep =1:3 Res(rep,:)= A; End Xlswrite...

11 December 2016 4,780 0 View

What is the last swarm algorithm?

I want to use swarm algorithms for feature selection What is the last swarm algorithm? What are chaotic maps and how i can use in swarm algorithms for feature selection?How I can make a hybird...

11 December 2016 1,562 0 View

I am working in data classification using metaheuristic ( and swarm )feature selection , i appply svm,knn in the original dataset without fs ?

i am working in data classification using metaheuristic ( and swarm )feature selection , i appply svm,knn in the original dataset without fs (100 features) and record the accuracy in weka tool and...

07 August 2016 1,868 3 View

How can i use swram algorithms for feature selection for data classification ? can you send me a useful papers and matlab codes?

How can i use swram algorithms for feature selection for data classification ?can you send me a useful papers and matlab codes

06 July 2016 4,972 5 View

What is the last swarm algorithm and how i use it for feature selection ?

there are many swarm algorithms , what is the last one and how i use it for feature selection ?

06 July 2016 6,035 8 View

How to convert weka .arff into matlab .mat?

i want some data sets with matlab .mat

01 February 2016 6,243 0 View

Can anyone help me in a master thesis; "improving feature selection using meta heuristic algorithms"?

I've been doing research on feature selection, but I'm missing some codes that are available to help me, and I hope that someone can help, provide a feature selection method based on UCI data...

11 December 2015 5,101 9 View

Is the black sediment consider as sand? If not, how can I filter it out?

Hi researchers! I'm working on soil texture analysis, and the end result for sand is doubtful because there is black sediment appearing after drying, as shown in the figure. Is it considered sand?...

30 July 2024 557 2 View

Can we isolate microplastic from sludge sample without using vacuum filter..?

isolation of microplastic from sludge sample using centrifugation ..

23 July 2024 6,418 0 View

How to determine the position of occupancy of the dopant? - whether it is doped in tetrahedral or octahedral site?

Suppose a material "A" has both tetrahedral and octahedral sites and we are doping another material "B" - usually an ion into it. How can we detect if the dopant has occupied the octahedral site...

17 July 2024 4,299 4 View

In kalman filter, it is logical to have the observation variance equale 0 ?

I am working on implementing a Kalman filter integrated with ARMA parameters, as described in the article "Predicting Time Series Using an Automatic New Algorithm of the Kalman Filter"...

12 July 2024 8,116 2 View

How to wash capsule filters (Membrane: polyethersulfone; pore size0.2u/0.45u) after filtration. can I wash by NaOH? if yes, what will be the molarity?

11 July 2024 8,969 1 View

How to filter structural variants from WGS/WES data?

Hello everyone! I would appreciate some advice regarding the filtering of structural variants. The variant caller used for discovery-“DELLY or DELLY2”using whole-genome sequencing and Whole...

09 July 2024 498 1 View

List of journals impact factors?

Dear colleagues, Is it possible to send me the list of journals impact factor for the year 2024 (classification is for the year 2023)? excel format if it is possible. Thank you in...

29 June 2024 2,102 3 View

How to filter/get the silver nanoparticles &nanoplates ? We synthesis this silver nanoparticles materials from the NaBh4, trisodium citrate,H2O2?

After finish the synthesis of nanoparticles.and nanoplates of silver how to take out those as solid/powder form from the solution. Please give a solution of u already done that and succeeded by that

27 June 2024 692 1 View

What is the appropriate filter size?

I have truck tire particles that I am using in my column experiment. I am trying to do size determination using the Malvern Zetasizer. My sample concentration is 0.1 mg/L, i.e., 10 g of sample was...

22 June 2024 8,857 6 View

How do I successfully culture primary human leukocyte from TLRE filters and improve viability of the cells in suspension?

I am currently culturing leukocytes from TLRE filters from our blood bank. I have been struggling to culture these cells, as well as improve their viability. I was wondering if anyone would be...

16 June 2024 3,752 0 View

Majdi Mafarja Popular answer

I think it depends on the size of the dataset. If you have a small size dataset, then you can initialize the population with the selected features by using a filter approach. In this case, you will start the optimization process with relatively good solutions.

Another approach that you can use with the large size datasets, you can use a filter approach (e.g., mutual information) to rank all features in the dataset, then select the top X features (X may be 20, 30, ...) -the number of features to be selected depends on the dataset, so you need to do some experiments to see which number of features that best contribute to the classification accuracy -After selecting the highly ranked features, you need to update the dataset to contain those features only, then apply the wrapper approach on the new dataset.

Hope this helps you.

Majdi Mafarja

Khaled Ahmed

Wrapper method has a gap that many researchers are working to solve , " improving the search phase in wrapper method" , so you can choose suitable optimization algorithm then start to execute experiment ( wrapper based vs your new improved wrapper)

Please read my paper

A Novel Chaotic Chicken Swarm Optimization Algorithm for Feature Selection

Conference Paper A Novel Chaotic Chicken Swarm Optimization Algorithm for Fea...

Ahmed Elsayed Hegazy

Dear Dr. Majdi Mafarja

Thank you too for your explanation.

How can I set threshold (k) to select top features?

Dear Dr. Dr. Samer Sarsam

Thank you

Dear Dr. Khaled Ahmed

Thank you too

Dear Ahmad,

As I mentioned in my first answer, the number of features to be selected depends on the dataset, so you need to do some experiments to see which number of features that best contribute to the classification accuracy. Select the top 20 features and test the algorithm, then select the top 30 features and test the algorithm. you may select like 5 different values! then select the number of features that gives you the highest accuracy.

You need to select a filter FS approach and apply it on the original dataset. This step aims to is to exclude the redundant and irrelevant features. In the second phase, to further explore reduced feature subset and identify a subset of informative features you may employ a metaheuristic algorithm with a learning algorithm such as KNN or SVM.

You may refer to this paper. They used the same approach the you are asking about. They used the MRMR filter approach then the used the Bat Algorithm with SVM classifier as a wrapper approach.

Article MRMR BA: A hybrid gene selection algorithm for cancer classification

All the best