I am having unaligned multiple sequence FASTA file (don't want to align) with ambiguous characters: RYWSMKHBVDNS
I want to remove/delete those characters...
1) I tried with mEMBOSS
Readme: http://emboss.sourceforge.net/apps/release/6.6/emboss/apps/trimseq.html#input.1
command:
trimseq -sequence seq.fasta -window 1 -percent 100.0 -strict -star -left -right -osformat fasta -auto
But was not able to remove those ambigous characters...
2) There are few Web-servers:
http://www.bioinformatics.org/sms2/filter_dna.html
http://www.cellbiol.com/scripts/cleaner/dna_protein_sequence_cleaner.php
They do perform the job, but they replace/remove even headers of those sequences, making the output useless...
Kindly suggest any Web-server or software/tool that could remove ambiguous bases...