I'm trying to find a program that will do batch sequence alignment.

Basically what I'm doing is, I have a reference sequence obtained from NCBI and I have sequencing files (.ab1) from DNA samples that I've sequenced. I'm wanting to align the sample DNA sequences to the reference sequence to see how well the samples match to the reference.

i.e Say I had a reference sequence for Exon 1 of Gene X (+ and - 200bp) and that I have amplified that exon from 6 genomic DNA samples by PCR, which, after sequencing, leaves me with 12 .ab1 files (forward and reverse for each sample). I essentially want to be able to align the sequences from the .ab1 files to the reference sequence such that I get an output file that has the sequences aligned in the order I want.

So far the best thing I've found seems to be ClustalX which allows me to align multiple sequences, although I have to manually extract the sequences from the .ab1 files, Rev-Com the reverse sequences in "A plasmid Editor", paste each sequence in FASTA format to the same text document and then open that text document in ClustalX. I would LOVE not to have to do all of that for each set of sequences considering I'll likely have to align well over 40 samples with 13 "exons" each - and that's just one population....

The main problem with ClustalX is that when I align the sequences, it throws the order of my sequences out. Instead of being in logical order:

Genome 1 Exon 2 F

Genome 1 Exon 2 R

Genome 2 Exon 2 F

... etc

They end up scattered such that it's a pain to try and get any logical information from the data. -.-

More David Shields's questions See All
Similar questions and discussions