I have done a cDNA microarray for cancer cells transduced with adeno5 vector holding my gene of interest and the control group was infected as well with adeno5 control ( empty vector) after 24h, over 14000 genes had fold changes after data analysis.
I know that my protein is signaling many things but did not expect a huge number of 14000 genes will be modified, How can I discarded the genes related to the adeno vector or the genes related to protein production and packaging, in order to minimize the genes I will analyze?
With the big data I have, I did different bioinformatics analysis e.g. Gsea and GO analysis, and still have a lot of data and pathways related to protein production or mRNA translation but I’m interested to look for genes related to cell death pathways and migration pathways (anticancer pathways in general).
From all the changed and enriched pathways, how can I know the ones related to my protein of interest and discard the ones that relate to cellular processes or considered as responses to the vector transduction.