I just have do an multiple sequence alignment (about 15000 sequences) between different strains of a bacteria for determine its conserved regions, but near of 1% of sequences have 80 or 85% of identity and it generate problems for the determination of conserved regions, that sequences could be the same gene or could be a wrong of the sequencing uploaded to the database?