I would like to search for a 15nt string in which the last 2nt are absolutely matched against the genome of interest, yet I can tolerate 1-2 mismatches in the intial 12nt and any nucleotide in position 13. Later I would like to complicate things by adding a further string of 8nt, preceding the initial string which, although there is no requirement for a match in these nt, I would like to see where any matches do exist?

Initially I searched for a string "XXXXXXXXXXXXNXX" in which the X's represent my sequence and the N, any nucleotide. However the results seemed to ignore the last 3 nucleotides where, upon further analysis of one match, I saw there was a perfect match across all 15nt. Even if this had worked perfectly however, it would not have taken into account my preferences for these last 2nt to be absolutely conserved and only a 1-2nt mismatch in the first 12nt.

I have been using NCBI BLAST.

Any pointers would be greatly appreciated.

Similar questions and discussions