Hello,

I have nearly 300M barcode sequences, length of which is about 16 nt. For each of them I want to find out the best similar target from the whitelist database containing about 700 thousand sequences with comparable lengths.

To do that, I Initially tried bowtie or bwa, which failed to work due to the short lengths of references. Then, I tried blastn by spliting barcode sequences and run batch blast tasks parallelly. However, it's still time consuming.

Does anyone have better ideas to accelerate that process?

Thank you very much!

More Leavy Zhang's questions See All
Similar questions and discussions