I've got a large amount of metabolomic data collected from a patient population which shows some considerable heterogeneity. To overcome this, I would like to split the large population into two equal groups which are statistically matched for the most troublesome variable to generate homogenous sub-groups.

I can't seem to find an easy way to do this - any suggestions?

Similar questions and discussions