Hello,
I have nearly 300M barcode sequences, length of which is about 16 nt. For each of them I want to find out the best similar target from the whitelist database containing about 700 thousand sequences with comparable lengths.
To do that, I Initially tried bowtie or bwa, which failed to work due to the short lengths of references. Then, I tried blastn by spliting barcode sequences and run batch blast tasks parallelly. However, it's still time consuming.
Does anyone have better ideas to accelerate that process?
Thank you very much!