16s species tree for 53 cyanobacterial species giving low bootstrap values (NJ/ML/MP) in MEGA/phylip. took full 16s sequences. considered all models.?

More Parva Sharma's questions See All

Which type of compound does lamda max of 218 indicate in a uv-vis spectrum of a partially purified compound through column and TLC?

A crude extract of fungal culture using EtOH was subjected to column and TLC and partially purified compound was obtained. UV vis spectrum of the compound/s has max absorbance at 218nm. The...

11 August 2024 9,801 2 View

How does grain and grain boundary affect the ceramic when studying its dielectric properties?

I am not able to get good literature and the physics behind how first these grains and grain boundaries arises out of no where when we make a pellet to study its dielectric properties and then how...

07 August 2024 5,177 3 View

Reason for discontinuities in my Band structure?

Hey All! I am wondering what might be wrong with my band structure. I did the calculations using VASP and plotted the results using Origin. Although I have tried changing various input...

25 July 2024 2,920 11 View

If my gene of interest has high GC content can it be problematic in sequencing? What kind of error is expected with GC rich gene sequences??

Gene sequencing related trouble shooting

25 July 2024 4,149 2 View

How to dispose off lipids waste?

I am looking for a lipid waste disposal method, keeping in mind the environmental, health and safety aspect of lipid waste. Could someone please provide guidelines or the lowest acceptable...

25 July 2024 637 5 View

What publications should I target as a psychology masters student in the UK?

I am writing a paper as a part of my course. I am new in London and was wondering that what publications should look upto?

21 July 2024 3,538 1 View

How effective has the United Nations been in addressing the conflict and its consequences?

After World War, we have seen the formation of the United Nations. The sovereign body for peacekeeping but with regards to the Russia-Ukraine war it is visible that the UN seems to be ineffective...

18 July 2024 4,674 4 View

What are the main obstacles to achieving a ceasefire or peace agreement between Russia and Ukraine?

It has been a quite long time since the Russia and Ukraine war has been going on and if we see the context of Geopolitics and International Relations we don't see a mediation between the two...

18 July 2024 6,737 4 View

How to decide whether the refinement is correct or not, based on Rwp and Rexp factors by Fullprof?

One of the papers I read by Toby, where (title of the paper was "R factors in Rietveld analysis: How good is good enough?"), he tells us that to get good chi square value, you must have low Rwp,...

17 July 2024 9,668 4 View

How to draw photon-magnon coupling color plot in origin?

Please help me in plotting the photon-magnon coupling plot as shown below

14 July 2024 1,632 1 View

Bootstrapping in SEM Amos ?

Can a researcher use small sample size of 40 while using SEM with bootstrapping of 1000 and is it possible to get published

28 July 2024 4,402 2 View

How should I analyze this mediating model?

I was instructed to analyze with amos, but the hypothesis was H1. Opportunistic behavior, relationship benefits affect immersion through trust, H2. Relationship in which trust influences...

09 June 2024 3,788 1 View

Error in divBasic from diveRsity when bootstrapping for Fis: Error in x[, 1] : incorrect number of dimensions?

Hi, I want to include the global Fis index in my divBasic calculation from the package diveRsity. To do so I set the bootstrap = 100. However, I have this error in return: Error in x[, 1] :...

05 June 2024 6,670 0 View

If dataset don't meet the required assumptions of Inctraclass correlation coefficient, would calculating CI with bootstrapping be useful?

I want to study agreement between 2 quantitative measurements, but I noticed they do not meet the required assumptions of normality and equal variance for ICC. I read about some non-parametric...

28 May 2024 5,650 0 View

My MEGA 11 software closes on its own when I am running a phylogenetic analysis in the 1000 bootstrp?

I have ran the analysis a hundred times now. Still after like 200-300 replicates my software stops analysing or just closes on its own.

26 May 2024 6,125 4 View

What is the different of topology view and ordinary view in MEGA X's phylogeny?

I have been running my phylogeny tree in MEGA X. My phylogeny tree doesn't seem to have any major differences between species in plain view, but the branches are becoming more clearly divided and...

16 May 2024 1,478 1 View

Why doesn't Average standard deviation of split frequencies change in phylogenetic analysis?

I've run it three times to try different parameters, but it stops changing after running for a short while. Please help me understand how to fix this. Is there a problem with the sequence or the...

15 May 2024 5,536 1 View

How to deal with zero/negative growth rates in tree growth models?

Tree growth is an important aspect of forestry and forest ecology. Typically a growth rate is calculated as the difference between two stem diameter measurements over a given time interval: (dbh2...

15 May 2024 8,739 2 View

What criteria define a tree species as monodominant within a given ecosystem?

I am looking for criteria to define a species as monodominant, based on its DBH and presence locations in the field.

13 May 2024 7,716 3 View

I have DNA sequences of ITS1 genes both forward and Reverse sequences. How do i build combined phylogenetic tree from these?

None

10 May 2024 5,322 2 View

Xabier Vázquez-Campos

First, try to use a structure-aware program for the alignment of your 16S rRNA gene sequences. See the previous answer or align against Silva db.

Second, search for the appropriate substitution model with something like ModelFinder.

Consider using IQ-TREE for a more thorough model search.

Prashant Singh

As already mentioned, it all depends on the data set that you are using. Cyanobacterial 16S rRNA gene sequences have a lot of problems in their own. So you need to be a bit careful (which unfortunately in itself is a bit tricky!). Also check things like

1. Are there wrongly oriented sequences in your dataset?

2. Did you do the model test using the lowest BIC scores?

3. Does the Tree topology change with all the algorithms (like ML, NJ, MP etc.)?

Parva Sharma

Thank you Artur Burzynski, Xabier Vázquez Campos, Gabrielle Zammit and Prashant Singh for your valuable suggestions.

Regarding Prashant Singh Sir queries----

With orientation you mean 5' - 3'? in this case I have checked this one.

I havn't done the BIC score. can you suggest some tools for this?

not much

What care should I take for cyanobacgterial 16s rRNA gene sequences.

Thank You

If you have used MEGA, there is model selection there. You can simply use the model with the lowest BIC Score.

It will be great to know that what group do your sequences belong too. Is it the heterocytous or the non-heterocytous ones? Frankly speaking, one of the biggest problems I faced was getting correctly identified sequences. Try to take published cyanobacteria reference sequences from published works from journals like IJSEM, Fottea, Phytotaxa etc. It may help!

Jovana M. Jasso-Martínez

Hi Parva

1) As Xabier recommend, I'm agree with the use of an appropiate mode to align your sequences.

2) I think that for an partitioned model selection a good option is use JModelTest or better, Partition Finder.

3) Have you used and phylogenetic method to analyze your dataset? May be that an Maximum Likelihood or Bayesian approximation can increase the bootstrap (BS) values but, do not forget that is important be sure that the sequences aligment and the evolutionary model have been selected in the best way. If these both things are right, even if you perform an analysis in MEGA, the BS values may increase.

4) I really don't know nothing about cyanobacteria but, if the sampling is not the best, this may be an important reason that you obtain low BP values

Luck!

Thank You Jovana M. Jasso-Martinez. I am trying all possible combinations.

Prashant Singh Sir

I am using species that belong to different orders including both heterocytous and the non-heterocytous (file attached). I took the sequences from NCBI RefSeq database (only for those which has been fully sequenced and are in Genome database)

Hi Parva,

Thanks for the list!

Yes with this list there will be few problems. I will detail the heterocytous ones while someone working more with the non-heterocytous ones can pitch in too

1. Anabaena variabilis ATCC 29413 is a bit problematic. I have seen that it usually distorts the entire phylogeny when put in with diverse taxa. In your case, this can be true.

2. Many of the NIES strains do not actually cluster within the identified groups.

3. Maybe remove the Nostoc azollae 0708 sequence

4. Nostoc piscinale CENA21 does not fall into the Nostoc sensu stricto clade.

5. Nostocales cyanobacterium HT-58-2 can be removed from your phylogeny

6. Calothrix sp PCC 7507 and Rivularia sp. PCC 7116 may fall into the same clade which could be actually the Rivularia node (maybe better sampling could separate them but the Calothrix strain phylogenetically should be distant from the actual Clade).

So, the thing is that the taxon sampling for the heterocytous forms looks a bit problematic here. Also your dataset is very much diverse and maybe contributing to the lower bootstraps or maybe single lines or long branches.

I would be extremely careful with any name coming from NCBI and with any name given to Cyanobacteria in general, as they are known to not have a very good concordance between phylogeny and nomenclature.

I'd check in the GTDB website the proposed updated nomenclature (they use phylogenomics).

In addition, you can search those sequences in Silva. You will get them perfectly aligned to the Silva reference and you can get the classification to genus level with something way more reliable than NCBI for these matters.

Richard Allen White III

There needs to be a better species concept and definition of what a species is in cyanobacteria based on molecular means and evolution.

Petr Dvorak

Parva Sharma 16S has generally low bootstrap (or any other support) in deeper nodes. You'll have to add many more genes to significantly improve the node support. If all nodes have