I always wondered how to test for the significance of clustering results. Now I have an urgent need. I want to show that when my input is organized in a biologically meaningful way. For that, I want to show that clusters are tighter and/or fewer than expected by chance (e.g. shuffling). I cluster the data after sub-grouping samples in what I want to prove is a meaningful way. I than shuffle, clustering data randomly (maintaining cluster sizes). I now need a good measure of clustering score, in the absence of a target classification (i.e. clusters are REALLY unknown to me). The score must be ignorant to the number of clusters, since I want to optimize the number of clusters (I can't assume to know it).