How do I determine that a database is cluster-friendly and therefore that it's possible to be confident in using an algorithm as k-means (for example)?
to discover the structure of the database
Note : the question is not related to the idea that that database can easily be distributed on lots of machines