I have started working on EST data processing of cotton. As a first step, we have screened all the deposited EST sequences by 'VecScreen' to detect vector sequence contamination. But in all the sequences (nearly 60, 000), no significant match was observed. Should we proceed to next step or should we try some other tools like seqclean?
Further, as an extra cautious step is it advisable to trim 20-30 nt from both ends of EST sequences even when these are declared non significant matches to vector database by VecScreen.