I want to generate a custom database of translated proteins involved in a particular process and then blast them against my predicted protein sequences from the metagenome . I have downloaded all the protein sequences (more than 20000 entries) from NCBI identical protein database (https://www.ncbi.nlm.nih.gov/ipg/?term=). Does anyone know how to filter out the partial sequences from these downloaded entries and keep full sequences for further analysis?