26 November 2020 3 6K Report

Hey,

Let's say I have one gene, TP53 for example.

What I want to do is-

1.Find the gene's orthologs.

2. Align them all.

I downloaded the orthologs from ENSEMBL, they are stores in a single dictionary.

I saw that MSA can be performed using Clustal Omega, however, I am not sure how to get my input data to be in one of the right formats.

So my questions are:

1. What is the most convenient way to get my data in an acceptable format (like fasta), without doing it manually and getting all the orthologs at once?

2. Generally, is there a better way to do want I want to do? Practically, I will need to perform 1,400,000 alignments (~300 sequences each time).

Thank you very much!

More Tal Gutman's questions See All
Similar questions and discussions