"Restrict from EMBOSS Suit version 6.3.1 with the following parameters: snucleotide1, sitelen = 4, rformat = table, enzymes = enzymes.txt. From the 4379 enzymes present in REBASE, we selected the 650 restriction enzymes that were commercially available, since this assay is meant to be used in any laboratory. From the 650 enzymes, 152 digest all P. salmonis sequences and only 65 recognized conserved restriction sites in the complete set of sequences, generating the same/similar restriction pattern (same number of bands and similar sizes)"[https://www.frontiersin.org/articles/10.3389/fmicb.2016.00643/full]
I have two sequences corresponding to the 16s rRNA gene for two strains of a certain species. I simulated the restriction digestion of these sequences using a restriction enzyme.
How do i figure out if a recognition sequence is conserved or not, if the coordinates of the cut are not the same in the sequences?
What i want to say is that they could be different for two reasons: A-not conserved or B-conserved but there was some base insertion/deletion that lead to this position mismatch.
I guess i would need some tolerance, how do i figure this tolerance value and how do i apply it?
(the sequences are flanked by the same primers motifs.)
Emboss's restrict: