I want to find out the overall protein sequence similarity among 2 strains of same bacterial specie.
Let say, strain 1 has 4500 proteins and strain 2 has 4300. My strategy was; I performed all-against-all BLASTp of both the strains and selected the best-hit from strain 2 against the query from strain 1. Then sorted them according to their percent identity values. But. I am getting a range of NO HITS to 100% similarity. I was expecting to get a range of 80 to 100% since both the strains belong to same specie.
Kindly guide me to improve my current strategy or suggest me some computational tools to perform similar task.