What will be the way for Clustering a data set using hierarchical clustering?

More Soumyendu S Bandyopadhyay's questions See All

Help with GFP fusion protein pulldown?

I am trying to pulldown my GFP fused protein of interest using ChromoTek GFP-Trap® Magnetic Agarose. My protein is not binding to the beads. Expression of the GFP-fusion protein has been...

02 June 2024 8,541 0 View

How to do CCSD calculation with localized molecular orbitals in PySCF?

Hello everyone, One can obtain the canonical HF orbitals and localize them by Pipek, Ruedenberg or Boys method to obtain Localized Molecular Orbitals (LMO). One can perform CCSD calculations with...

07 December 2023 3,294 0 View

From which programs an user can obtain the 2-particle density matrix from a ccsd(t) calculation?

I have been using PySCF to obtain the two-particle density matrix from a ccsd(t) calculation. However, the calculation is not so memory efficient. But, one can obtain, store ,and manipulate the...

23 October 2023 2,576 0 View

Alexa488 synthesis-how easy it is?

Can anyone help sharing a robust method of sulfonation for the last step of Alexa488 synthesis? It seems quite tricky. I appreciate your help of any suggestion.

17 September 2023 3,200 0 View

Why 1.5 times amplification in between pga level and 100 Hz spectral acceleration level is observed NGA east GMMs (Vs=3000m/Sec)?

Dear All, I am going to use NGA-east attenuation relationship for a work. While I am using the NGA Atkinson Boore attenuation relationship for hard rock site ( Vs=3000 m/sec) an 1.5 time...

03 September 2023 916 1 View

Is there any one who can give me a chance to work in the field Image Processing ?

I'm Eagerly waiting to get a chance in the research field image processing

13 June 2023 8,365 5 View

Why many elemnets are deleted in Element deletation "Cohesive element" Abaqus?

Here, I have attached an out video of an delamination problem. Here, it is noticed that many elements are deleted at a time. But not able to figure out the reason. Any lead in this regards are...

19 October 2022 5,117 1 View

Is there a way to make RAM available on cloud?

We have seen and observed how cloud can be used to host unlimited data. Is there any possible method that the RAM of a computer can be connected to a cloud service so that computers do not run out...

11 October 2022 9,982 7 View

How to generate grid file in R-CRISIS for gridded seismisity?

In am going to perform PSHA analysis by gridded seismicity approach in R-Crisis. But during compilation, I am getting following errors. "bounding boxes of gridded seismicity and geometry are not...

13 September 2022 6,044 1 View

Principal Component Analysis ?

Any existing dataset and R code available for multi-covariate PCA, mostly dealing with different aspects of CT images? Any paper, repository or help will be appreciated.

07 February 2022 10,003 2 View

How to use evolutionary algorithms with real parameters in ryu sdn controller with large scale?

Hi, I wanna to implement evolutionary algorithms in ryu sdn controller in mininet, i have some challenges, how i can run the big scale topo with one sdn contoller??? and another question is to...

21 July 2024 246 2 View

How can I begin quantum computing on my computer or laptop?

I am interested in designing, developing, and testing algorithms on my laptop or local machine. Do I require any specialized quantum hardware or an online quantum computing service? Is it possible...

10 June 2024 2,917 3 View

Where can I find a reliable(peer reviewed) source code for the QKD BB84 protocol?

I'm trying to implement BB84 on a network, however I don't have a source code that is backed by any organization or a peer reviewed paper. Any help would be appreciated. Thanks!

09 June 2024 5,786 1 View

How are surrogates integrated in evolutionary algorithms?

I am interested in understanding how surrogates are effectively integrated into evolutionary algorithms (EAs). Specifically, I would like guidance on how to handle the approximation function when...

08 May 2024 2,579 0 View

How do quantum algorithms, such as quantum support vector machines or quantum neural networks, differ from their classical counterparts?

25 March 2024 2,307 1 View

How do quantum algorithms, such as quantum support vector machines or quantum neural networks, differ from their classical counterparts?

25 March 2024 4,842 1 View

What is the most effective method for fine-tuning PID controllers, including techniques like Ziegler-Nichols, Genetic Algorithms (GA),PSO, ACO,WOA ?

Which tuning method is optimal for adjusting PID controller parameters, such as Ziegler-Nichols (ZN), Genetic Algorithms (GA), Particle Swarm Optimization (PSO), Ant Colony Optimization (ACO), and...

18 March 2024 5,283 1 View

What is the script for running protein cluster by using DBSCAN?

Im trying to run dbscan.py by using vmd (dcd and pdb) files but the script is showing error. Its not generating cluster it's only generating noise from the trajectory file. How to solve this issue...

03 March 2024 6,414 0 View

How can i calculate the Levy flight on Monarch Butterfly Optimization Algorithm?

Hi all! I am implementing the Monarch Butterfly Optimization Algorithm in Java in order to schedule tasks on VMs. How can I calculate the Lévy flight for updating the butterflies in subpopulation...

13 February 2024 3,194 0 View

How to solve quadratic programming problems with continuous variables by using Qiskit quantum algorithms?

i solve this classical optimization using docplex . The requirement is that I need to solve it with quantum algorithms. For solving QUBO problems, I could use QAOA. But for solving quadratic...

24 January 2024 1,645 2 View

Israel Aillon

You must to dedice how do you want to use this result, the clustering hierarchical is used when there are few cases (less than 150) because you will to interpretate de levels of the hierarchical, if you have so much cases don´t use clustering hierarchical, support vector machine is another algorithm to make clustering.

P.D.: sorry for my grammar, still learning english :)

Fabrice Clerot

you do this usually when you think that hierarchical agglomerative clustering will give you a good "high level" view of your data (depending on the linkage you use, the hierarchical clustering can follow complex shapes at high level) but either you fear that hierarchical clustering will get lost at the beginning of the process (linkage functions and noise do not always mix well) or you have so many individuals to start with that you face algorithmic / performance problems with the calculations of linkages (randomly subsampling would also be a possibility)

in such case, you start with a "standard" clustering, say k-means, so as to bring the number of "individuals" of the hierarchical clustering (these "individuals are indeed cluster centroids ; whether they are weighted or not by the population of the cluster is another matter) into a manageable range (1000 for instance, so, you run a k-means with k=1000)

(in the process, you also hope that this first step will "smooth" the data somewhat so that the linkage function will not drive the hierarchical clustering onto the void at the first "noise bump")

Rex Tracy

Clustering the clusters relies upon an assumption that the importance of the first level clusters is roughly equal across your entire sample set. That is, your first level clusters reflect a similar assumption - that a given sample from your set is just as important as any other sample. Once that first level of clustering is complete, you have cluster centers. If the number of samples in a given cluster is approximately the same as the number within the other clusters, then it would be reasonable to do a second level clustering, however, if the number of samples in a first level cluster vary widely, then a second level clustering has the potential for providing poor quality second level centers. For example, let's say you are clustering colored balls (reds, blues, yellows) and the first level clustering gives you three nice groupings that correspond with the colors. A second clustering would likely bring the 2nd level cluster to the center of the color space (which may be what is intuitively expected). Looking at the 3, level 1 clusters when they contain much differing numbers of sample brings up the issue of how uncertainty will play into the location of the second level cluster. Small number of samples within a given cluster will "move" the level 1 cluster center around your parameter space to a much greater degree than could be compensated for by the smoothing affect of a large number of samples. This "noise" in estimating the level 1 cluster centers is then reflected in the results of the second level clustering more by the lightly populated first level clusters than the smoothed centers of the other level 1 clusters. Rather than mixing the colors in the example above, if there are only a couple of Red balls, one of which is "off-color" (noise in the sample), the Red cluster center from level one would be "pulled" away from true Red and thus making the second level cluster pull away from the center of the color space.