I would like to use STRING for functional analysis of proteome data, as it appears to be quite powerful. Unfortunately I'm working with an organism that isn't in STRING's database, nor is there anything particularly closely related on the species list. STRING requires all proteins to be from a single species on their list. Is there a sound method to use my data in STRING, perhaps using gene orthologs from another species? For example, could I BLAST my differentially expressed protein sequences against the genome of a single model organism (e.g. Arabidopsis) and use the top hits as the gene IDs? I've tried using simple gene names as the input, but typically only about half are present in any single model organism I've tried. I'm aware that there are hazards in using orthologs, but I'm mostly interested in higher-order patterns and GO term enrichment.

More Clinton A Oakley's questions See All
Similar questions and discussions