Hi,

I am looking for reasonable linear and affine gap penalties for local pairwise alignments of DNA and amino acid sequences. I am using the NUC and BLOSUM62 substitution matrices from NCBI, respectively.

The problem is, that a low penalty leads to very long alignments with many gaps (especially for DNA), that extend into non-homologous regions due to spurious matches.

In contrast, high penalties can lead to premature termination of alignments.

Can you give me some literature hints on how to find/calculate reasonable gap penalties? What are optimal penalties for the settings described above?

More Patrick Kunzmann's questions See All
Similar questions and discussions