Subject: How can I handle 5 different time series gene expression pattern for the yeast gene SAM3 to infer gene functions for similarities between microarray time series gene expression patterns?
Hi
I am trying to analyze the series GSE3431. Its title is "Logic of the yeast metabolic cycle". Its URL is https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE3431.
I am trying to infer gene functions from similarities and dissimilarities between the trajectories between the expression trajectories of genes.
To trust my results I must understand why I am getting five different expression patterns for the yeast gene with the common name "SAM3".
I tried in vain to find the nucleotide gene sequences for the 5 different SAM3 expression pattern. Their prob-set IDs are
2597_at,
2598_g_at,
2599_s_at,
2600_s_at,
8021_at
The information in each of the 5 lines is identical, except for the last line, which also lists the systematic name of SAM3. How can I chose the correct expression pattern from these 5 possible options? What's the reason that generated these 5 options?
I have attached the pdf file with the gene expression time series trajectories for SAM3. The difference in the magnitude between their expression levels is too big to simply take the average. What is the best way to select the best out of these 5 possible options?
The platform / chip of this dataset is
GPL90 [YG_S98] Affymetrix Yeast Genome S98 Array
Thanks a lot
Thomas Hahn
(Bioinformatics graduate student)
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE3431