How to see the data points of most similar cluster(s) to another cluster in k-means clustering?

More Parva Jain's questions See All

How to apply dynamic attributes clustering algorithm?

Think of a scenario where there's a dataset with 15 attributes(i.e. columns heading). I want to apply the clustering algorithm on that dataset but not taking all 15 attributes but taking any of...

03 June 2020 5,780 10 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Why does my protein refolded to beta sheet during thermal denaturation analysis?

Hi! So i attempted to understand a novel protein behavior towards heat application by analyzing its secondary structure change. I subjected the protein to a thermal denaturation analysis using...

06 August 2024 1,989 3 View

Samer Sarsam

Parva Jain

If I understand you correctly, then here is my suggestion:

You can merge the 2 datasets and run a clusterer method on the resulting data. Then, explore the instances of each cluster to see which data they originally belong to.

HTH.

Dr. Samer Sarsam

Jianfei Pang

K-means will generate the central coordinates of each cluster, and you can calculate the distance between the sample and each cluster.

Javad Pourmostafa Roshan Sharami

Once clusters have been identified, you would able to find the corresponding sample's cluster! right.

To find the top m neighbors' data points within the cluster, you can set a marginal distance. It should be less than your minimum cluster intra-similarity (less than farthest data points coordinates). Then design a plain local search based on the aforementioned criteria and find the distance between central coordinates and data points caught by local search. Finally, order them descendingly and select the top m points WRT the desired search intensity.

B.K. Tripathy

You can find the inter-cluster distances using any distance formula. The ones which provide the smallest value(s) are the closer ones. Similarly the other way.