I have some deduced sequences for a specific protein from diseased as well as apparently healthy animals, I just want to detect active sites and see differences and their effect if any.
COACH is currently ranked as the best method in the ongoing CAMEO ligand binding experiment for assessing how well function prediction programs are doing.
Note: COACH uses as a structure as an input. The standard algorithim is to run the sequence through ITASSER first to model the structure and then use COACH to determine the binding site.
http://zhanglab.ccmb.med.umich.edu/I-TASSER/
ITASSER by default will run another function prediction program COFACTOR.
Most of the time is spent in modelling the structure. Although structure based detection is more accurate then sequence based (see the link above) if you have hundreds of sequences it may not be practical. However, if the sequences represent SNPs or other minor variants of a parent sequences it should be possible to speed this up considerably.
2. Can you send me the job ID by message or email ( [email protected]) so I can take a look at your result? I have direct access to the servers and code.