Hello everyone!

I'm doing microarray analysis with data GSE10474 about sepsis. i got gene expression matrix, but when mapping the probe to gene symbol i got struggle in some probe is represent multiple gene like :207665_at - ADAM21 /// ADAM21P1

205013_s_at - ADORA2A /// SPECC1L

208597_at - CNTF /// ZFP91 /// ZFP91-CNTF

How I deal with this problems ? Why one probe represent for many genes ?

Can I choose the first gene as gene symbol for corresponding probe ? So if one probe represent multiple gene, it possible to do feature selection for my machine learning model.

Sorry because my major is telecommunication and I do not have experience in bio.

More Duc-Long Vu's questions See All
Similar questions and discussions