C5h C10h C15h T5h T10h T15h
gene1 1 2 3 30 32 29
gene2 10 12 13 20 21 23
gene3 100 103 105 200 205 208
gene4 1000 1100 1200 1600 1600 1650
I got the RPKM value of members in a gene family (encoding an enzyme), as showed above, how to predict the importance of these members(C:control ; T:treatment; h:hours)?
I have tried PCA (principal component analysis )to find members with representative expression pattern as well as high absolute correlation coefficient with PC1 which was representative of the major variance, but often low RPKM.
so I was confused, is there any method that can select the members with high RPKM as well as difference between control and treatment groups?
any questions and suggestions will be very appreciated.