I am trying to make a really big phylogenetic tree for which I am using mega and muscle to make my 16S rRNA database and align it. Previously, I have added sequences greater than 1000 bp, aligned them and made a phylogenetic tree that turned out fine. After this, I started to collect more sequence data (based on sequence identity in NCBI BLAST) from a metagenome database called JGI/IMG. I have retrieved over 400+ sequences from this (ranging from 200-1000+ in length). Similarly, I have also downloaded 30 sequences from the SRA database.

The problem is that the sequences downloaded from these databases had a completely distorted alignment when aligned with my longer sequences , and had several massive gaps. I tried reverse complimenting the downloaded sequences and then some of them did start to align but most of them still don't. Any idea as to why these sequences don't align with my database (despite being genus or species-level matches)? How can I solve this issue?

Similar questions and discussions