Hi,
I am looking for reasonable linear and affine gap penalties for local pairwise alignments of DNA and amino acid sequences. I am using the NUC and BLOSUM62 substitution matrices from NCBI, respectively.
The problem is, that a low penalty leads to very long alignments with many gaps (especially for DNA), that extend into non-homologous regions due to spurious matches.
In contrast, high penalties can lead to premature termination of alignments.
Can you give me some literature hints on how to find/calculate reasonable gap penalties? What are optimal penalties for the settings described above?