I am working on Sequence Read Archive (SRA) datasets. I have one fasta file containing the SRA identifier and the sequence (atcagtt..) and one blast output file including sequence identifier, gi number and the rest of the details. Both of them are in csv format.
I want to use the common sequence identifier to combine my files. Eventually I would like to have one file with all the sequence information as well as the sequence itself.
Attached is link for a possible solution, however the sort function does only give me the sequence (as in aaccgttc...), but no identifiers.