07 August 2025 3 10K Report

Hello, I am looking for options on how to handle the following issue. I am aligning hundreds of amino acid (AA) sequences for HIV-1 Gag. I am interested in the protease cleavage sites in Gag and which positions P12-P1/P1'-P12' are conserved. The problem is, many of the Gag AA sequences have deletions in a cleavage site, typically 2 amino acids. If I remove the 2 amino acid gaps from the aligned Gag sequences, the cleavage site sequences will get shifted to the left or right, either way that will result in a partial misalignment of that cleavage site with the consensus sequence. For example: a cleavage site is from P12-P1/P1'-P12', if P10 and P9 are missing in the Gag sequence and I remove the alignment gaps, P12 and P11 will get shifted to the right into the P10 and P9 positions. HIV-1 produces a lot of defective viruses, so I guess that is what I am seeing. Up until now I have just been deleting the Gag sequences with insertion/deletion, however, that results in the loss of many Gag sequences and I would like to avoid that if possible. How do you handle an issue like this? Thanks in advance.

More Gary S. Laco's questions See All
Similar questions and discussions