Which type of Tree is needed for codeml PAML analysis?

More Ashmit Singh's questions See All

How to add an outgroup while using supermatrix approach for species tree reconstruction?

I have 12 species from same genus. For supermatrix approach we generally use the multiple sequence alignment single copy orthologs sequences then concatenate them and remove the badly aligned...

11 December 2017 3,868 4 View

What is the difference between analysis of recombination and selection pressure when using multiple Genus vs multiple species?

I read several papers where researchers use several strains of same species for analysis of recombination or selection pressure however there are few papers in which researchers use the multiple...

09 October 2017 8,848 4 View

Do I need to stick with single copy orthologs, while estimating selection pressure with PAML?

The ortholog groups of 5 species proteins look like: Group1: sp1 sp2 sp3 sp4 sp5 Group2: sp1 sp2_seq1 sp2_seq2 sp3 sp4_seq1 sp4_seq2 sp5 (sp is for species here) As here one can see the Group1...

05 June 2017 6,710 1 View

How can I evaluate the accuracy of predicted protein protein interactions with ROCR package?

I need to evaluate the accuracy of predicted interactome of an insect species which I constructed by interolog method. I read that I can use ROCR package for it. I got a gold standard positive PPI...

02 March 2017 9,554 1 View

How to identify irreversible reactions and inconsistency of KEGG reactions?

Lets see an example. The reaction R00344 is mentioned as irrversible in KEGG

06 July 2016 4,805 3 View

Which journals you regularly follow to increase your knowledge-base?

In principle, this questions is like quora flavor. Here, I am asking this question for a bioinformatician perspective but surely would like to know you opinion about the other journals that are...

04 May 2016 7,593 2 View

How the large scale signaling network can be converted into Boolean model?

PS: I have single time point gene expression data for around half of the gene in the signaling network.

04 May 2016 3,055 2 View

How can I apply z-test for significance changes in the protein-protein interaction network properties while comparing with random networks?

I have created a PPI network, now I want to compare it with random networks. I can generate the random network. In general, I am interested in the comparision plots of (i) degree vs betweenness...

03 April 2016 10,079 4 View

What is the biological significance of under-representation of differentially expressed genes?

I understand the over-representation of GO term in FDR test against reference gene sets but the under-representation seems confusing. My test set is infection induced differentially expressed...

03 April 2016 3,359 5 View

How can I identify genome inside genome?

I have some mixed genome sequence data of an insect. The insect contains 1 known and probable some unknown symbionts. Could someone please suggest me how can I identify the unknown symbionts if...

10 November 2015 9,136 1 View

Which Scopus Journal provides the most affordable fees?

"PUBLISHING IN A SCOPUS JOURNAL" Researchers are now at a cross road. The critical need to publish in a Scopus or ISI, etc journal is ever vital. Journal Publication fees must be submitted....

10 August 2024 8,621 1 View

Seeking Advice on Viability and Execution of Undergraduate Thesis Topic?

Hello everyone, I am currently developing a thesis proposal and would appreciate your input on its viability and how to effectively carry it out. My proposed topic is: "Does the perceived threat...

10 August 2024 8,992 0 View

Who will be moral responsible for the death of thousands of people in the event of an earthquake?

Who will bear moral responsibility for the deaths of thousands of people in the event of an earthquake? Weeks and months remain before the onset of strong earthquakes that bring death to...

08 August 2024 6,134 12 View

Are there any instruments for studying time similar to the way it is in space?

There are a huge number of methods for studying objects in space, according to the senses (and not only). Mechanical, thermal, optical, acoustic, electrical, magnetic, based on particle beams,...

06 August 2024 7,102 0 View

Weak DAPI staining after immunohistochemistry - how to improve?

After immunohistochemistry of previously fixed in PFA and EtOH and then frozen 20 μm sections of zebrafish brain, DAPI staining is very weak (right) compared to the same sections stained without...

05 August 2024 9,637 2 View

Why did the authors extrapolate a phenotype that they experimentally proved in one bacterial strain across the whole genus of the organism?

I aim to be as skeptical as possible regarding whether a pair of orthologous genes results in the same phenotype in their different but related bacterial organisms under similar environmental...

05 August 2024 6,787 4 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

In the case of a wound l recurrence after radical breast cancer and sentinel lymph node biopsy. Are the sentinel lymph node procedure recommended?

In the case of a wound l recurrence after radical breast cancer and sentinel lymph node biopsy. Are the sentinel lymph node procedure recommended? If no axillary lymph node dissection was not...

05 August 2024 8,056 1 View

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity?

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity? What is the acceptable percentage of error (regardless of the metric)? Could you suggest...

03 August 2024 5,358 0 View

Interested in a SCOPUS collaboration?

Hi RG family. My team and I are working on some SCOPUS publications and we need co-authors who are willing and capable of undertaking both qualitative and quantitative-based studies. The scope...

02 August 2024 7,843 0 View

Neel Prabh

Hi. Considering that codeml depends on internal nodes to calculate the dN/dS ratio for the different branches of the tree, I would suggest that you use either a maximum likelihood or bayesian approach to first generate individual trees for each cluster and then pass it to codeml. This way you have the freedom to choose different models and approaches while building the tree for each cluster.

Matteo Brilli

You can build phylogenetic trees in whichever way you prefer. For what I remember branch lengths are not used but recalculated by codeml. It is mandatory to align the proteins and then perform the alignment of the coding sequences following that as with revtrans (http://www.cbs.dtu.dk/services/RevTrans/), or alternatives such as MACSE http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0022594. Only in this way you are sure that the aligning algorithm does not introduce frameshifts that would create artifacts in the ensuing selection analysis. Also, as codeml basically will count and model mutations, it is important that there are not too many differences in the sequences; if you not, many of the sites might have had multiple mutations that you can't observe, and the selection level calculation will be affected. You can somehow control for this taking into account the branch lengths of the input tree, usually in substitutions per site, you should have much less than one for every sequence in the multialignment. Clearly the tree has to be built on nucleotide sequences. If there are highly variable regions, you can remove problematic regions from the alignment, but you need to be very careful in doing this because you have to reason in coding sequence terms such that your edit unit will be of three nucleotides, following the frame.

It is also important to contrast the likelihood of the model with selection to the one of the model with no selection. If they do not differ much then, even if you observed a few sites under positive selection, you can't reject the hypothesis of no selection.

Matt Lambert

Ideally, you should use the species tree and several gene trees generated using different nucleotide substitution models. If the null is rejected with each tree then you have strong evidence for positive selection.