My lab has recently generated some medium size data sets that describe gene expression (qRT-PCR data) as "Delta-Ct" against patients. As you'd expect, the data set isn't complete - so for some patients IL-2 (say) might be undetectable, so there is no numeric value for the Delta Ct. Also, the patients fall into more than one category - sick, healthy, severe, mild and so on.

I want to use R (or a better package - I'm open to ideas) to perform some hypothesis-neutral analysis to find clusters or expression associations, if they in fact exist in the data.

My question is related to cleaning up the dataset so that R will won't throw an error when it is trying to undertake a hypothesis neutral clustering. I've tried entering "na" but that isn't helping.

So, over to The Community!

Many thanks

G

Similar questions and discussions