Is there a model to estimate genetic distance that uses both point mutations and indels?

More Konstantin K. Avilov's questions See All

Formation of a target organizational and functional model for data management in space activities?

Formation of a target organizational and functional model for data management in space activities

10 July 2024 4,498 0 View

What connection can there be between politics and collective intelligence?

I am interested in reflecting on the political dimension of collective intelligence.

13 April 2024 2,811 18 View

How can the concentration of colloidal particles in an aqueous solution be determined?

How can the concentration of colloidal particles in an aqueous solution be determined? Is it possible to apply the photo-colorimetric method for this?

11 October 2023 1,563 21 View

Where can I find data about viscosity dependence on concentration in methanol–poly(vinyl acetate) solution?

Where can I find data about viscosity dependence on concentration in methanol–poly(vinyl acetate) solution? What value of saturation concentration and diffusion coefficient for the solved polymer...

11 April 2023 3,414 2 View

Is there the state support for medical insurance of radioisotope diagnostics in your country?

Radioisotope diagnostics of many diseases is currently available in many countries. But this is often an expensive procedure. Does your country provide state support for medical insurance for...

01 April 2023 3,895 7 View

I'm looking for a good review article/book chapter about parrallel methods based on simulated annealing. Any recommendations?

I am currently studying the application of simulated annealing techniques to optimization problems. In particular, I am interested in applying the optimization method to the study of Ising models....

21 November 2022 2,663 1 View

Amplification in regenerative amplifier by cw pumping?

Hello. In real moment I'm designing the CPA system for my Ti:Sa laser. I have only cw diode module for pumping (10 W). How can I evaluate amplification opportunity in 'regen'. I want to gain from...

18 September 2022 5,450 1 View

Why is there increase of electrode atomic forces when applying bias voltage in TranSiesta?

Dear colleagues, I am trying to calculate I-V characteristics for typical (Left electrode) - (Device) – (Right electrode) system. Besides the well-known problem of difficult electronic...

27 March 2022 8,532 11 View

How to block an IR beam in vacuum?

Hallo, I've a HHG setup, but the IR filter is broken. This means that the entire IR intensity hits the CCD camera and would destroy it. Because a replacement filter needs a few weeks to arrive,...

27 February 2022 5,470 1 View

How to determine the fraction of DNA from lymphocytes in the blood?

I'm thinking about how to detect in circulating free DNA fraction that comes from white blood cells. Now my idea is to use T cell receptor excision circle (TREC) and kappa-deleting recombination...

01 June 2021 5,918 0 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

David Enard Popular answer

Hi Konstantin,

I am not aware of a method using indels in addition to substitutions. However, I would recommend against using indels for your purpose because aligners do a very poor job at inferring them properly. Aligners that do better such as Nick Goldman's PRANK rely on a previously known phylogeny to correctly infer indels. So if you already know the tree topology and are only interested in estimating genetic distances at this point, this could be useful.

But maybe this is possible for you to edit your alignment manually?

I recommend that you look at the following link, people who have actually worked on the issue give interesting insights:

http://evol.mcmaster.ca/~brian/evoldir/Answers/GeneticDistance.with.indels.answers

David Enard

Konstantin K. Avilov

Thank you for your answer, David...

1) Aligners: yes, this is a problem too. I already use PRANK (with, obviously, its default approach of inferring the "previously known phylogeny" from a point mutation metrics). But PRANK performs not so good: it attributes clearly identical insertions of significant length (6-9+ nucleotides) to independent insertion events.

So the alignment that I currently use is PRANK with pretty much manual correction (which may bring the result pretty close to what Muscle of ClustalW produce).

1.5) No, I do not know the tree/topology beforehand. Futhermore, since I deal with bacterial genes, there is a high chance that evolution is not tree-like and there are lots of horizontal gene transfer.

2) The link: yes, I have already googled that text and read it. It is pretty much about the same thing as my question here: indel coding and the problem of arbitrariness of indel vs. point mutation weighing.

========================

By the way, my question may be transformed into another one:

Is there a model that mechanistically describes the processes of deletions and insertions (with some explanation where the code in insertions comes from)?

Brian Thomas Foley

There are too many different mechanisms for insertions and deletions to give one answer that describes them. Many regions of genes can have inverted repeats which can form stem-loops in the DNA prone to deletion. RNA viruses are even more susceptible because their genome exists as RNA with extensive secondary structure. Repetitive DNA such as GAAGAAGAAGAA is very susceptible to "stuttering" which causes variable numbers of the tandem repeat. In HIV envelope gene we often observe inserts that are identical to short regions of sequence near the insert, as if the insert was copied from nearby, but this is probably do to the mechanism of replication of retroviruses (two genomes packages, with template switching during the reverse transcription) and less likely to be found in other organisms.