Hi!,
I am trying to figure out the connectivity between subpopulations of a parrotfish species along the coast of western Africa by analyzing the differences of neutral mutations in SNPs in the COI region between the subpopulations and trying to correlate these to the coastal currents.
I have done sanger sequencing for around a hundred samples, I have bi-directionally sequenced these and then aligned the pairs to get one consensus sequence per sample. However, I have now gotten stuck since most analyzing programs and packages use the newer VCF format rather than the FASTA format that I got the results in.
Is there a way to convert the FASTA sequences to VCF?
There are several sequences of the same gene in Genbank that I could use as a reference if needed.
Thankfull for any help or input!