Feature Selection - Before or After Normalization?

More Gustavo Sousa's questions See All

How to properly dilute RNA for cDNA?

Hi all, I have a question regarding proper dilutions and conversions of my RNA. According to my protocol, my cDNA kit is optimized for 1ug of RNA and calls for this amount. I would like to...

07 July 2024 9,206 4 View

Can I use BSA for a Histological Blocking Solution?

Hi all, I would like to know if it is possible to use Bovine Serum Albumin (BSA) in my blocking solution for immunofluorescence muscle fiber typing. The primary antibodies' host is mouse. The...

07 July 2024 7,967 3 View

Why do we use Horse Serum for C2C12 cell differentitation?

I have seen some papers that differentiate C2C12 myoblasts into myotubes by changing the growth medium that contains 10%FBS by a medium containing 2%Horse serum/calf serum + 1% NEAA. What's the...

04 July 2024 4,732 1 View

Problems to find a region of a Virus gene?

We have a couple of primers but when we make a blast for those primers we see two results: When I use Blast from NCBI, I found many lineages which anniling with many lineages. But when I want to...

01 July 2024 7,644 3 View

What is your preference regarding Artificial Intelligence apps/methods/platforms for image analysis?

Please, let me know the apps, platforms, or methodologies based on AI to analyze images, such as radiographic or histology images. Tell me your experience of using AI in assessing patterns,...

30 June 2024 4,430 4 View

JCR 2024 Impact Factor Download File-Factors caused decline in IF are?

Why the IF of almost all journals declined so much? For the file download go to https://www.researchgate.net/post/JCR_Impact_Factor_2024-Factors_causes_decline_in_IF

21 June 2024 3,358 0 View

Why PLLA deposited by gap electrospinning lose piezoelectricity?

I tried to prepare aligned PLLA nanofibers by gap electrospinning method. But after the deposition, when I tried to measure its piezoelectric sensitivity, it actually did not exhibit any...

17 June 2024 9,763 0 View

Can I use frozen tumor fragments in flow cytometry?

Can performing flow cytometry evaluation on cells from the digestion of tumor samples frozen in liquid nitrogen or at -80ºC negatively influence the evaluation of cell cycle and tumor stem...

17 June 2024 7,129 0 View

About mineral exploration?

Are there any techniques at ore exploration that could replace the compass-tape measure of mineralized structures?

14 June 2024 3,254 2 View

How to normalize fluorescence measurements from different microplate runs?

I'm currently doing an experiment where I'm following the expression of a reporter gene (GFP) along growth using S. cerevisiae. The cultivations last around 48 hours. I observed that the results...

10 June 2024 5,941 3 View

Hello researchers Is this a random laser or just fluorescence?

I am using Rhodamine6G as gain medium and silver nanoparticles as scatterers on a microscope slide and laser input 532 nm comes from above.

09 August 2024 9,894 2 View

Is the black sediment consider as sand? If not, how can I filter it out?

Hi researchers! I'm working on soil texture analysis, and the end result for sand is doubtful because there is black sediment appearing after drying, as shown in the figure. Is it considered sand?...

30 July 2024 557 2 View

Can a photocatalytic degradation of methylene blue from red mud be pseudo- zero order kinetics?

My photocatalyst from solid waste red med. Dye is methylene blue My all parameter study is showing zero order. How to prove it further that the reaction in zero order?

29 July 2024 7,404 1 View

How to determine method detection limit in an analytical method?

I know the difference between instrumental LOD and method LOD but my query is - in case of any sample whose concentration is zero or not detected by the instrumental LOD, is it possible to get...

24 July 2024 6,592 5 View

Is it redundant to use both Random Forest and Decision Tree algorithms in the same regression project?

I am currently working on a regression model for a project and considering using both Random Forest and Decision Tree algorithms. Given that Random Forest is essentially an ensemble of Decision...

23 July 2024 4,306 3 View

Can we isolate microplastic from sludge sample without using vacuum filter..?

isolation of microplastic from sludge sample using centrifugation ..

23 July 2024 6,418 0 View

Radar Detection Probabilities using beta distributed Scattering Cross section?

Currently I need to calculate detection probabilities (PD) from radar cross section (RCS) data. Beta distribution parameters for this RCS data are calculated and will be used in Swerling0...

22 July 2024 868 0 View

What is the physical meaning of the magnetic scalar potential?

In cases where the rotational of the magnetic field H is zero, we can define this field as the gradient of a scalar function defined as the magnetic scalar potential (similar to the electric...

21 July 2024 9,633 4 View

How to determine the position of occupancy of the dopant? - whether it is doped in tetrahedral or octahedral site?

Suppose a material "A" has both tetrahedral and octahedral sites and we are doping another material "B" - usually an ion into it. How can we detect if the dopant has occupied the octahedral site...

17 July 2024 4,299 4 View

How to model a non-minimum phase system?

In deriving a model of an LTI dynamic system, the model of the system does have no RHP zeros, however, when inspecting the step response, it reveals a non-minimum phase system represented by a...

14 July 2024 6,355 2 View

Koen Van de Moortel

What is a variance "close to zero"??? 0.1, 0.01, 0.000001??? Any value is an infinity away from zero. Orders of magnitude are not important; they will reflect in the parameters your model will fit.

Instead of just blindly shooting in the wild, you should first reflect about which kinds of relationships between your variables you might expect.

Examples here:

https://www.lerenisplezant.be/teksten/FittingKVdm-Manual.pdf

Mrutyunjaya Hiremath

Variance-based feature selection is a simple and effective way to start the feature selection process. If a feature has a near-zero variance, it's almost constant across all samples and is likely, not informative.

Regarding the sequence of preprocessing steps:

Normalization/Standardization: When considering variance filtering, the order of magnitude can have a significant impact. If you directly apply variance thresholds without considering the scale, you may inadvertently drop or keep features based on their original scales. Therefore, it's a good idea to standardize the data (i.e., [x - mean(x)] / var(x)) before applying variance-based feature selection. This will ensure all variables are on a comparable scale and variance thresholds can be set more uniformly.

Variance Filtering: After standardizing your data, apply the variance filtering. Features with variances close to zero can be eliminated since they have little or no information.

Build Models: Once the initial uninformative features are removed, build your models.

Mahmut Baydas

So how accurate is the normalization/scaling/conversion technique we use?

Different Normalization techniques sometimes do not affect the data at all, but sometimes reverse the results. Therefore, I think the innocence of these techniques should be questioned. Depending on the data type and problem type, the choice of normalization techniques may also differ. Even the number of alternatives and criteria influences fair selection. These are such volatile techniques that there are some who claim that they sometimes cause “ranking reversal” in MCDM rankings. There is no one-size-fits-all normalization. Min-Max is often preferred in artificial intelligence and machine learning applications, which is wrong. I wonder why the alternatives (vector, Max, sum, logarithmic) are not used? Let's not forget that normalization homogenizes different units, but does not always guarantee a fair conversion. Sometimes what transforms turns away from the original and turns into something completely different. Sometimes things go well when the water evaporates, but sometimes the coal turns into ash.

Ramadoss .R

The decision regarding whether to conduct feature selection before or after normalization is contingent upon the particular scenario and the objectives of your data preprocessing workflow.

Sina Jamalzadegan

Hi, please check this recent research in this field:

Article Abaxial leaf surface-mounted multimodal wearable sensor for ...

Check the PCA methods for normalization