Compare the core-genome mutations of lineages and identify genes associated with lineage-specific adaptation: it this a valid approach?

Frederico Schmitt Kremer @Frederico-Schmitt-Kremer

31 January 2018 1 1K Report

I want to analyze mutations in the core-genome of Xanthomonas oryzae, a bacteria species that comprises two pathovars: pv. oryzae and pv. oryzicola. I already have the core-genome identified (those genes conserved in at least 95% of the strains in each pathovar), and now i want to identify those genes that are less conserved. To do it i am thinking in calculate some conservation score for each gene based on the multiple sequence alignment by measuring the probability of each aminoacid occur in each position in both pathovars and then extract the mean shannon entropy. The Shannon entropy would be used to as the main measure of the conservation of the gene, and would be calculated for each pathovar-gene and then a mean . Thus, with a score for each gene, i would calculate the z-score and select those with outlier z-score and perform a GO enrichment analysis on them. The idea is to identify genes that are differentially variable or differentially conserved in the pathovars and the species.

Can anyone give me some advice or say if this method is appropriate?

Brian Thomas Foley

It sounds like you are saying you have sequenced the complete genomes of a few hundred isolates of each of the two pathovars. That is a lot of data. It is possible to measure diversity of each gene within pv. oryzae and within pv. oryzicola and compare the two levels of diversity. Entropy measures the diversity without taking phylogenetics or evolution of each pathovar into account, or you could measure rate of evolution of each gene which would take the phylogeny into account.

Is there no horizontal transfer of genes between these two pathovars? If not, why are the names as pathovars of the same species rather than being named as two different species? Genes move in between Escherichia, Salmonella, Shigella and other enterobacteria all the time, so it seems like you should check for horizontal transfer of genes as one mechanism of different apparent rates of evolution.

It may also be worth looking at smaller or larger regions of the genomes than genes. Some genes have hypervariable and conserved regions so that the average rate of evolution over the gene does not reflect how the gene is evolving.

It is also possible that for your purposes, you do not care what mechanism is causing diversity (horizontal transfer, hypermutation, strong selection etc) and you only care about how much diversity there is. In that case there is not much point in taking the trouble to look at evolution.

Badges
Science topic

More Frederico Schmitt Kremer's questions See All

Z-axis alignment in a protein for z-axis constraints in Amber software?

Good morning, I am running some Steered molecular dynamics in Amber 22 and want to set some restraints in the z-axis so as to imply certain distances from my ligand to its substrate. My goal is...

27 May 2024 7,082 1 View

Analytical frequency calculation error in ORCA 5.0.3, does anyone know how to change the calculation algorithm?

Good afternoon, We have recently started running ORCA calculation in a new machine, however we are consistently getting memory errors in analytical frequency calculations. Our jobs com-prise a...

27 March 2024 8,609 2 View

Could injecting samples (1ul) that potentially contain up to 3.75% of phosphoric acid be detrimental to an LC/MS system?

Attached is a ThermoScientific Application Note [20738] describing an SPE method for extracting montelukast from plasma for LC/MS/MS analysis. After reviewing it, I notice that the final elution...

07 February 2023 9,651 1 View

Different reparation of Chevreuls salt possible?

Dear experts, For a certain reason I want to make Chevreuls salt from a 'start solution mix' as concentrated as possible. Literature describes mixing a solution of CuSO4 x 5H2O with a solution...

27 December 2022 1,117 0 View

How to draw the CR100 scale while maintaining the correct number proportions of the original proposal?

List of papers 1. On Perceived Exertion and its Measurement 2. A comparison of AME and CR100 for scaling perceived exertion 3. A comparison between three rating scales for perceived exertion and...

04 June 2022 6,713 0 View

Model is unidentified and needs additional constraints?

Hi all, I have computed the model in the screenshot and I get the feedback from AMOS that it is unidentified and I need to add 5 additional constraints. Where do I add the constrainst and how to...

07 May 2022 4,371 12 View

Anyone knows references of disasters assessment, or resilience, using socioecological system?

Searching for references that uses socioecological system for disaster impact assessment, supply chain networks and resilience assessment.

24 January 2022 7,641 13 View

Question is Can I run MD in VMD/Namd2 and do analysis on Chimera? How can I import MD data from VMD into Chimera?

Hello, everyone..I got the same issue as I begun a MD it finished and then got to "Not responding mode", but I understand and still waiting for it to finish. I got VMD/Namd2 for MD, yet I do...

15 October 2021 6,858 4 View

Anyone knows a literature review about materiality for sustainability?

In the past decade most of the sustainability reports are using materiality matrix or frameworks as a tool for prioritizing issues that you be worked by the company. In that way i would like to...

09 September 2021 360 2 View

Do you know relevant papers for my Strategic Literature Review covering employee well-being, leadership and remote work conditions?

I am conducting a strategic literature review to answer the following research question: What do we know about the impact of leadership on the promotion of employee well-being in remote work...

05 September 2021 6,307 6 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

I can't see the ssDNA band after performing asymmetric PCR. Is there any way to do this?

After performing symmetric PCR, PCR purification was performed. Afterwards, asymmetric PCR was performed using the PCR purification product as a template, but no ssDNA band was confirmed in the...

08 August 2024 1,668 3 View

Does crude extraction using NaOH and Tris work well with Fungi?

I'm trying to find a DNA extraction method for fungi that does not require equipment and heating. Is there anyone who can suggest an alternative option? Thank you

08 August 2024 4,733 2 View

What are the key methods and indicators used in assessing the biodiversity of river ecosystems, and how do these methods account for variations ?

Biodiversity assessment of river ecosystems is crucial for understanding the health and stability of these environments. This question aims to explore the various techniques employed to evaluate...

07 August 2024 4,290 3 View

Are air moisture harvesting technologies effective in combating desertification?

Air moisture harvesting Air water collection devices

06 August 2024 5,473 2 View

Why after performing site directed mutagenesis ,I don't see any colony after transformation?

I want to introduce a point mutation (change in one nucleotide) into my gene of interest (DNA binding domain) I have designed primers as recommended on the Data sheet of the kit : -Both primers...

05 August 2024 9,059 3 View

Why did the authors extrapolate a phenotype that they experimentally proved in one bacterial strain across the whole genus of the organism?

I aim to be as skeptical as possible regarding whether a pair of orthologous genes results in the same phenotype in their different but related bacterial organisms under similar environmental...

05 August 2024 6,787 4 View

Who of all the Global Scientific community will help me Prof. Dr. Yoshida make way for TPEOM, MEC ~EMC to return the atmospheric gases to the norma ?

TEP presentation caption (The Environmental Project) Re: Why should Washington’s DC, or any country government point of location think of as nowadays of as to being 'tomorrow as to come! if it...

03 August 2024 2,484 1 View

Does anyone have issues using Prepman Ultra reagent for MicroSeq ID bacterial, fungal and yeast sample preparation?

I have been attempting to extract DNA from Bacterial, Fungal and Yeast banked samples (>1e7 cells) using Prepman Ultra reagent and I seem to be struggling to obtain a sequence. Although the...

01 August 2024 2,079 0 View

How is the bacterial genome's high protein count verified as genuine despite 800+ contigs and good metrics (98.55%completeness, 0.68% contamination)?

Given that the bacterial genome has over 800 contigs, but its quality metrics are good, with a completeness of 98.55% and a contamination of 0.68% as assessed by CheckM, what specific validation...

01 August 2024 1,514 1 View