I'm identifying plant sequences (for rbcL gene) obtained from insect gut contents using BLAST in the NCBI GenBank database. Most of the sequences have 95-96% of matched identity and a few have of 80-85%. I realize that some plant sequences (as with animals and microorganisms) are not deposited in GenBank so 99-100% match is not always possible. Still I'd like to match the obtained sequences to species/genus or at least family. Also, the sequences I use are of good quality so mismatch is a rare possibility.
I'm wondering if there are any implicit guidelines I can follow to justify my sequence identification for species, genus, and family. Any published reviews or research articles? Couldn't find any so far.
Thank you very much. I would appreciate any suggestions.