31 May 2018 6 6K Report

Hi there,

I have about ~1 million compounds to cluster. While calculating the fingerprints were relatively fast using Canvas on a desktop computer, I found clustering (by K-Means) very slow. For example, clustering them into 128 clusters took ~10 hrs, and I wonder if this would increase exponentially as I increase the number of clusters.

So are there alternative/faster methods to cluster a library of compounds into arbitrary number of clusters?

Thanks!

More Yun Shi's questions See All
Similar questions and discussions