I am trying to find out existence of protein domains in a set of sequences. I am using BLASTX for the task. I have made a BLASTX of my sequences with the ProDom sequences. I used an e-value cutoff of 1e-3. What should be the identity percentage and the coverage percentage of the domain so that I can say that a protein domain or it's signature is present in a particular region?

Ex:

Identity percentage is 80% (80% of the amino acids in the aligned regions are identical)

Coverage 60% (The aligned length is 60% of the total length of the domain being compared to)

What other precautions and observations should I make while making such comparison?

More Ravi Kanth Reddy Sathi's questions See All
Similar questions and discussions