Measures of stability and robustness for feature selection (dimensionality reduction) algorithms?

More Alexander Rakitko's questions See All

How can I import gpr-file inside R?

I have some gpr-file from Mapix Sofware (looks like GenePix). Is there any function (cran-package) to import these files inside R?

01 February 2016 1,561 0 View

How to use evolutionary algorithms with real parameters in ryu sdn controller with large scale?

Hi, I wanna to implement evolutionary algorithms in ryu sdn controller in mininet, i have some challenges, how i can run the big scale topo with one sdn contoller??? and another question is to...

21 July 2024 246 2 View

Can you suggest reliable procedures to get displacements from accelerations in frequency domain ?

I have identified many solutions. I need suggestion from somebody with application experience of this topic to identify the most reliable and robust procedure.

21 July 2024 3,465 5 View

How to determine the position of occupancy of the dopant? - whether it is doped in tetrahedral or octahedral site?

Suppose a material "A" has both tetrahedral and octahedral sites and we are doping another material "B" - usually an ion into it. How can we detect if the dopant has occupied the octahedral site...

17 July 2024 4,299 4 View

What are the criteria that must be retained for the development and validation of a qualitative NMR method ?

Dear Community, I would like to develop and validate a qualitative NMR method for the analysis of a specific category of chemicals, and my question is the following: What are the criteria that I...

08 July 2024 1,402 4 View

Cheaper alternative to XSens mocap system for ergonomic data acquisition?

Hello all, I'm starting a project where we want to automate the video analysis of people working in different environments to produce ergonomic measures like hip flexion, shoulder extension, etc....

16 June 2024 1,843 1 View

ROBUST RESULT vs NON ROBUST RESULT - SPSS version 27 ?

Hello, I have a question regarding the interpretation of the results from an experiment I conducted. Each participant answered 4 questions measuring motivation, satisfaction, help, and...

15 June 2024 992 4 View

How can I begin quantum computing on my computer or laptop?

I am interested in designing, developing, and testing algorithms on my laptop or local machine. Do I require any specialized quantum hardware or an online quantum computing service? Is it possible...

10 June 2024 2,917 3 View

Propose a comprehensive plan to design the new telecommunication system for the company, considering the following aspects:?

a) Design a scalable and robust network architecture that can handle the increasing data traffic and support various communication technologies. b) Recommend suitable transmission technologies,...

10 June 2024 2,187 1 View

Where can I find a reliable(peer reviewed) source code for the QKD BB84 protocol?

I'm trying to implement BB84 on a network, however I don't have a source code that is backed by any organization or a peer reviewed paper. Any help would be appreciated. Thanks!

09 June 2024 5,786 1 View

How does the application of (GANs) for data augmentation impact the robustness and accuracy of image classification models?

How does the application of generative adversarial networks (GANs) for data augmentation impact the robustness and accuracy of image classification models?

09 June 2024 2,923 2 View

Miroslav Karny

Just brief related comment: please think of Bayesian methodology of structure selection (also hidden under label hypotheses testing). This essentially balances model complexity, amount of processed data and predictive ability. The search through huge space of possible hypotheses then becomes an independent problem solvable by using randomly restarted local search.

Sivia Cateni

Stability of variable selection is defined as the insensitivity of the features selection algorithm to variations of the training set.

There are three kinds of representations in which variable selection approach can indicate feature preferences: in fact, the different variable selection methods usually provide their outcomes on one of the following forms:

_ a weighting-scoring

_ a ranking vector

- an n-dimensional binary vector where each component is associated to a feature and its null or unitary value represents, respectively, absence or presence of a variable in the selected subset.

In order to evaluate the stability of a variable selection method, a measure of similarity for each of the three representations must be considered.

Andrea Paudice

As already mentioned, the stability of a feature selection algorithm is usually assessed by measuring its insensitivity to variations in the training set: that is, if you apply your method on two datasets A and B, will it select the same subset of features [1] ?

With this regard a quite popular measure is the Kuncheva Stability Index [2]. Such measure consider also certain subtle issues like the correction for chance. Furthermore it has been used to measure the robustness of feature selection algorithm in adversarial environments.

[1] A Stability Index for Feature Selection

[2] Measuring the Stability of Feature Selection with Applications to Ensemble Methods

[3] Is Feature Selection Secure against Training Data Poisoninjavascript:g?

Wei Hang

how to measure stability and robustness for feature selection? I think it can get from the results of machine learning,such as ROC.

Feature selection is a key for practical work. At present in research on Chinese medicine, this work mainly depends on experts, which I don't expect. Some statistical methods, machine learning methods and evolutionary computing methods are shown to be effective for feature selection.

Patricia Ryser-Welch

Hi,

If you wish to measure the robustness and stability. I would recommend you using a coefficient of variation.

Patricia

Giorgio Roffo

In order to assess how robust is a feature selection method you could use the kuncheva stability index or the Shannon-Jensen criterion like we did in our paper, please see infinite feature selection from my publications. We analysed the stability of the ranking according to the number of available samples, some feature ranking techniques may suffer when reducing the amount of samples.

Moreover, if the goal is to study the ranking also the spearman's correlation coefficient can be used.

Daniel Isac Escobedo Beceiro

There are multivariate methods and statistic diagnostic test able to measure of stability and robustness for feature selection. Of course is important considering application requirements.