The ultimate goal is to look for co-expressed genes by using some clustering,

Let = [x1 x2 x3 ... xn] be a vector representing the raw genetic expression of a certain gene over (n) time points as taken from a single-channel microarray. The log2 of each element is taken THEN the mean is subtracted and THEN it was divided by the standard deviation. This makes the profile (zero-mean unity-std).

In two-channel microarrays, the same thing is done over the log2 of the ratio between Cy3 and Cy5.

One concern is that dividing by the standard deviation eliminates the spread of the expression. Another suggestion was that dividing by the mean is better.

What do you suggest?

Similar questions and discussions