I want to generate a custom database of translated proteins involved in a particular process and then blast them against my predicted protein sequences from the metagenome . I have downloaded all the protein sequences (more than 20000 entries) from NCBI identical protein database (https://www.ncbi.nlm.nih.gov/ipg/?term=). Does anyone know how to filter out the partial sequences from these downloaded entries and keep full sequences for further analysis?

More Shailesh Nair's questions See All
Similar questions and discussions