How to correctly design and analyze microsatellite data for eukaryotes in Beast2 (BEASTvntr module)?

18 April 2024 0 7K Report

Dear Colleagues,

We want to use microsatellite data to analyze the historical dynamics of mammalian populations and, in particular, to test the method of coalescent Bayesian scaling. We used Beast2 v.2.7.6. For the test analyses described below, we used a sample of 50 individuals genotyped at 12 microsatellite loci. In all cases, a strict "1" (default) clock was used. Default values were also left for all parameters not mentioned below.

The manual for the BEASTvntr module (https://github.com/arjun-1/BEASTvntr) is based on the example of bacteria. In prokaryotes, all loci are localized on a single DNA molecule and therefore analyzed as a single partition. But in eukaryotes, microsatellite loci can be located on different chromosomes and are not linked. It seems logical that in this case each locus should be analyzed as a separate partition.

I am aware of only a few publications in which the BEASTvntr module was used for eukaroytes and the corresponding methodology was not described in detail. Two publications (Kjartanson et al., 2023 (DOI:10.3390/d15030385, the subject was a fish, Acipenser) and Prewer et al., 2020 (DOI:10.1093/biolinnean/blz175)the subject was a mammal, Ovibos) said nothing about their data design. Escudero et al, 2023 (DOI:10.1093/aob/mcad087, the subject was a plant, Carex) noted that "clock, tree and site models were linked for the 33 SSRs following recommendations in the BEASTvntr manual" and Rugna et al, 2018 (DOI:journal.pntd.0006595, the subject was a protist, Leishmania) described that "the diploid data were entered as two distinct partitions". From this it can be understood that in both cases all loci were combined into a single partition, albeit taking diploidy into account in the second case.

We tried this approach to reconstruct the Bayesian skyline plot and analyzed a matrix combining all 12 loci (two-column format) and imported into Beauty as a single partition. We used the Sainudiin or Sainudiin Computed Frequencies model (SCF) and Gamma Category Count (GCC) 6 or 4. For this case we have no problem with the timing of the calculations and got ESS values >> 200 at 35 mln iterations. The resulted plot looked plausible (see top of the attached picture ) and was consistent with what we actually expected for this population based on paleoclimatic data. But since the very approach of jointly analyzing alleles of different loci seems incorrect, I cannot be sure that this plot reflects the real situation.

Dr. Santiago Sanchez-Ramirez suggested to combine multilocus data into a single matrix based on a single-column format (one column - one locus, two rows per individual) and importing such a matrix into Beauty with the multiple partitions option (https://groups.google.com/g/beast-users/c/8qwtobmIWqo). We tried this option to reconstruct an Coalescent Extended Bayesian skyline (EBSP) and obtained a very strange plot, hardly reflecting the real situation (middle part of the figure). This plot (except for the numbers on the axes) remained unchanged regardless of whether we used linked or unlinked site and/or clock models for different partitions, whether we used the Sainudiin model or the SCF model and GCC parameter.

Finally, we organized a separate two-column matrix for each locus and imported them, one by one, into Beauty as single partition each. In the end, our data were also represented as 12 partitions. In this case (linked or unlinked models (Sainudiin, GCC 6) and/or clocks, we got the graph shown at the bottom of the figure. I can't say anything about its plausibility.

In both variants of data representation in the form of 12 partitions we had very big problems with calculation time and ESS values (about 5-9 for 10 mln iterations or less and about 15-25 for 20-40 mln). In the case of 12 separate matrices, the calculations were regularly interrupted with the error message "java.lang.RuntimeException: Could not find zero eigenvalue" (although they could be continued using the "appends log to existing files" function).

Can anyone comment on the above and advise on the best way to organize such data and analyze it? Maybe this or that way implies obligatory change of some additional parameters, which we left set by default?

Many thanks in advance!

Badges
Science method

Similar topics
Biological Science
Ecology

More Ilya G Meschersky's questions See All

How to obtain high yield of flavivirus in cells?

Please, give advices how to reach high virus titer in cell culture after infection. Cultural liquid after this will be used to next infections and i need advices or protocols to achieve high...

22 April 2024 8,193 2 View

Do you think that the ghost weapons produced by 3D printing represent a real tool for destabilizing public authorities in this day and age?

The term "ghost guns" refers to homemade or unregulated firearms, often using untraced components such as 3D-printed parts, assembly kits or replicas of existing firearm parts. These weapons are...

03 November 2023 6,920 0 View

Is there a second-order differential for Itô's lemma?

Dear Colleagues, There is a famous formula of replacing variables in a stochastic differential equation (SDE), so called Itô's lemma. If we have SDE dx = a(x,t) dt + b(x,t) δW then the...

19 June 2023 983 1 View

Какую модель турбулентности лучше всего использовать в Ansys CFX?

Hello! We pump the contour of a metal tube with a diameter of 3 mm with oil with a viscosity of 0.0031 kg / m * s (abt. the viscosity is 4 times higher than that of water). We are interested in...

30 May 2023 3,596 8 View

Coefficient of friction 𝜆?

Hello All! Why is it important to determine the coefficient of friction 𝜆 when determining hydraulic resistance? How important is it for the world to find a theoretical solution 𝜆 for a turbulent...

15 May 2023 3,146 3 View

Can someone give a student version of Magma (CAS)?

Dear ResearchGate Community, I am currently conducting research in the field of algebraic geometry, and I am in need of the Magma (computer algebra system) for my analysis. Unfortunately, my...

22 April 2023 8,031 1 View

Can you share a comprehensive protocol for creation of the LNA-based antisense oligonucleotide drugs?

Recommend please an article where the mentioned protocol is giving the possibility to maximize the affinity to target sequence. Is there any complex approach of LNA sequence...

26 March 2023 2,446 0 View

Convection coefficient for pipe fluid flow?

Hello all! I found an interesting relationship for convection coefficient for pipe fluid flow. Source: https://patents.google.com/patent/RU2600658C2/ru Please tell me, to what extent this...

27 February 2023 6,857 5 View

Determination of hydraulic the friction coefficient λ for the mode Re=2300...4000?

I am reading a book and it is quite old. It says that when determining hydraulic the friction coefficient λ for the turbulent regime zone, no theoretical solution to the problem was found. Is it...

19 October 2022 611 3 View

Why does a laser beam passing through a lens have to bounce off a mirror?

I am studying Wang W. "Reverse engineering". In FIG. 2.3 shows a block diagram. I don't understand why the laser beam passing through the lens has to bounce off the mirror before hitting the...

19 October 2022 9,150 0 View

How to define an anisotropic material with asymmetric elastic compliance/stiffness matrix in ANSYS APDL?

I need to model an anisotropic material in which the Poisson's ratio ν_12 ≠ ν_21 and so on. Therefore, the elastic compliance matrix wouldn't be a symmetric one. In ANSYS APDL, for TB,ANEL...

09 August 2024 5,048 2 View

Is it true that $\det(V(A))$ may be only $\pm 1$, depending on $n$, for the last symmetric tridiagonal matrix $A$?

One can try to generalize the Vandermonde determinant in the following direction: Let $A$ be any symmetric $n$-order square matrix. Consider its powers' diagonal elements $(A^k)_{ii}$ and...

08 August 2024 6,690 1 View

Usage of internal standards in LC-MS/MS analysis?

Have you ever seen a LC-MS/MS method uses both internal standards and external standards (in matrix matching purpose) but the concentrations of internal standards are outside the calibration curve...

05 August 2024 3,084 6 View

Request a single Lecture notes for math as detailed as this that I can find in one place?

- The Existence/Uniqueness of Solutions to Higher Order Linear Differential Equations - Higher Order Homogenous Differential Equations - Wronskian Determinants of $n$ Functions - Wronskian...

03 August 2024 2,366 0 View

Why is TLC not a form of partition chromatography?

Why is TLC not a form of partition chromatography and paper chromatography not a form of absorption chromatography?

03 August 2024 7,035 3 View

Tissue homogenization in mouse with FastPrep-24™ 5G - is the D matrix system useful?

I need to lyse and homogenise mouse tissues. So far I have experimented with liver and lung tissue, using matrix D. The liver was completely lysed with 6 repetitions of the standard program, but...

31 July 2024 9,186 0 View

Is it true that science is leaving the era of mathematics and entering the era of matrix mechanics?

We assume this to be true. Science leaves the era of mathematics and enters the era of matrix mechanics and the turning point is the discovery of numerical statistical theory called Cairo...

31 July 2024 3,900 2 View

Help on understanding the implementation of Mori Tanaka method on MATLAB?

I am new to Micromechanics and having similar problem with understanding the implementation of the formula's. I would appreciate if anyone can guide me on how to go about getting a scalar value...

30 July 2024 969 0 View

How Can I Apply Hashin Damage Properties to Solid Elements in Abaqus Without Using VUMAT?

Sure, here is the translation: I have a 3D orthogonal woven composite structure where warp, weft, and binder yarns are oriented in three directions. After modeling these yarns and the matrix in...

28 July 2024 8,169 0 View

Can we find a statistical matrix mechanics equivalent to the Schrödinger equation?

We assume that we can find a statistical matrix mechanics equivalent to Schrödinger's PDE in two consecutive steps: i-Transform the Schrödinger PDE describing the wave function Ψ into its square...

27 July 2024 3,959 4 View