K-Means clustering: Is feature scaling a necessary pre-processing step ?

More Titas De's questions See All

RNA later for the preservation of RNA in fecal samples at room temperature for one day (37°C)?

I am planning to collect human fecal samples for metatranscriptomic analysis using MGI. These samples are from indigenous people living in a region with high temperatures. I will have access to a...

06 August 2024 1,367 3 View

How to develop an academic literacy program for engineering at the higher education level?

Information literacy in higher education integration with curricula engineering

04 August 2024 5,368 3 View

How can i generate a CRISPR knockin mutation zebrafish model with a reporter?

Hey! I aim to generate a transgenic knockin zebrafish line that mimetizes a genetic condtition that leads to a certain disease on human. To do so, I need to insert a codon for mutagenic aminoacid...

14 July 2024 6,240 0 View

What should be the best Lumens range for T8 (120cm) full spectrum LED lamp tubes?

Please (for Arabidopsis), what could be a good Lumens and color range (Kelvin) range for full spectrum LED lamp tubes size T8 (120cm) for each shelve measuring 130x50 cm (length x width) and 60 cm...

11 July 2024 6,078 1 View

Cross Attention in Transformers: Standard applications of the same ?

What are the standard applications of Cross Attention in Transformer Architectures ?

09 July 2024 9,310 2 View

Time Series Analysis: Has Recurrent Neural Networks (RNN) ever been used on Time Series Analysis ?

Are there standard RNN architectures been applied for Time Series Analysis, forecasting and anomaly detection problems ?

30 June 2024 3,169 8 View

LSTM on Time Series: Has LSTM architectures ever been applied to Time-Series Forecasting ?

Have we ever used LSTM architectures on Time-Series Forecasting and Analysis, and gotten a decent result ?

30 June 2024 6,924 3 View

What could be causing these smears in my PCR electrophoresis gel?

I am new to running PCR gels. I loaded this gel and I thought it was fine, meaning I saw/felt no apparent punctures or spillovers to neighboring wells (see picture 1). When the gel started to run,...

30 June 2024 4,107 4 View

What are the typical applications of Large Vision Models (LVMs) ?

Where are large vision models typically used ?

25 June 2024 4,113 0 View

Are there standard libraries/frameworks for doing RLHF for training LLMs ?

When it comes to Re-inforcement Learning with Human Feedback, are there standard libraries/frameworks for training LLMs ?

25 June 2024 1,121 0 View

How to use evolutionary algorithms with real parameters in ryu sdn controller with large scale?

Hi, I wanna to implement evolutionary algorithms in ryu sdn controller in mininet, i have some challenges, how i can run the big scale topo with one sdn contoller??? and another question is to...

21 July 2024 246 2 View

Reversed flow at outlet due to the release of DFBI?

Hi everyone, I am working on a simulation involving restricted canal with ship using DFBI. I am facing reversed flow in my outlet boundaries as the DFBI is released (In 1.25s). Is there any...

17 July 2024 7,032 1 View

How can I begin quantum computing on my computer or laptop?

I am interested in designing, developing, and testing algorithms on my laptop or local machine. Do I require any specialized quantum hardware or an online quantum computing service? Is it possible...

10 June 2024 2,917 3 View

Where can I find a reliable(peer reviewed) source code for the QKD BB84 protocol?

I'm trying to implement BB84 on a network, however I don't have a source code that is backed by any organization or a peer reviewed paper. Any help would be appreciated. Thanks!

09 June 2024 5,786 1 View

How are surrogates integrated in evolutionary algorithms?

I am interested in understanding how surrogates are effectively integrated into evolutionary algorithms (EAs). Specifically, I would like guidance on how to handle the approximation function when...

08 May 2024 2,579 0 View

Can someone please guide on cluster analysis to make identity statuses?

I am using the DIDS scale by Koen Luckyx and I am confused about how to do cluster analysis and then use K means to make identity statuses. and how to convert the scores into z scores.

14 April 2024 6,635 4 View

How do quantum algorithms, such as quantum support vector machines or quantum neural networks, differ from their classical counterparts?

25 March 2024 2,307 1 View

How do quantum algorithms, such as quantum support vector machines or quantum neural networks, differ from their classical counterparts?

25 March 2024 4,842 1 View

What is the most effective method for fine-tuning PID controllers, including techniques like Ziegler-Nichols, Genetic Algorithms (GA),PSO, ACO,WOA ?

Which tuning method is optimal for adjusting PID controller parameters, such as Ziegler-Nichols (ZN), Genetic Algorithms (GA), Particle Swarm Optimization (PSO), Ant Colony Optimization (ACO), and...

18 March 2024 5,283 1 View

What is the script for running protein cluster by using DBSCAN?

Im trying to run dbscan.py by using vmd (dcd and pdb) files but the script is showing error. Its not generating cluster it's only generating noise from the trajectory file. How to solve this issue...

03 March 2024 6,414 0 View

Samer Sarsam

Yes, in general, attribute scaling is important to be applied with K-means. Most of the time, the standard Euclidean distance is used (as a distance function of K-means) with the assumption that the attributes are normalized.

HTH.

Dr. Samer Sarsam

Titas De

Thanks Samer Sarsam

Mohamed Elhadad

Yes, to make sure that your calculations will not be biased either to the very high or to the very low values. In other words, to make sure that all your data are at the same level. you could use any normalization technique to do this, and I recommend this:

Xi(new)= (Xi-mean(all X values))/standard deviation

Thanks Mohamed Elhadad

Jiayin Lin

In most cases yes. But the answer is mainly based on the similarity/dissimilarity function you used in k-means. If the similarity measurement will not be influenced by the scale of your attributes, it is not necessary to do the scaling job.

Thank you Jiayin Lin