I have a selection of genes that have no corresponding BLAST or PDB entry. I want to find out:
a) What does the corresponding protein structure look like?
b) What is its likely function? What does it bind to?
At the moment I am only interested in (a), what does it look like?
I am using a combination of PSIPRED and I-TASSER. I can cross-validate the predicted secondary structure, and I-TASSER returns predicted structures based on templates. Unfortunately, for some of the protein sequences the template scores (C score in I-TASSER) are extremely low.
For each five returned I-TASSER sequences, I have put them into a molecular dynamics structure using the Amber force field. I was then going to perform some simulations to see how stable the returned tertiary structures are.
I would be interested to hear people's opinions on an approach to predict the tertiary structure of protein sequences with very poor template matches.
Thanks