What is the general cut off used in CD-HIT when using a data set of closely related sequences for phyognetic analyses?

03 April 2016 1 5K Report

my dataset consists of highly similar sequences of proteins. i only wanted to remove duplicate sequeces if any in the data set. what threshold shall I set in CD HIT?

Sansrity Sinha

thank you so much. I wanted to ensure that no duplicate sequences are there in the dataset.

Badges
Science method

More Sansrity Sinha's questions See All

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

Why might I be observing hysteresis in my stress-strain curves when using the Mohr-Coulomb model, despite not applying dynamic loading?

While working on caisson foundation, I applied static vertical load

25 July 2024 9,357 1 View

For co-culture HUVEC and muscle cell and use angiogenic and arteriogenic growth factor, what are the markers represent stable blood vessel formation?

We have seen in the literature that Ang-1 can be considered as one of the markers. We need information about few more markers.

01 June 2024 9,650 1 View

Notch 3 can be considered as marker for stable blood vessel formation?

01 June 2024 9,788 0 View

I need to know the markers for stable blood vessel formation?

Both angiogenesis and arteriogenesis are needed to make stable blood vessel formation. Ang-1 is one of the markers for stable blood vessel formation. Can you please suggest names of few more...

01 June 2024 1,912 1 View

Research opportunity in stabilizing overburden dump slopes in opencast mining?

Globally what are the Research opportunities in stabilizing overburden dump slopes in opencast mining?

16 May 2024 5,334 1 View

How the weight estimation of Fixed Wing UAV in conceptual design varies from other aircrafts ?

Conceptual Designing of aircraft provides a major role in designing of aircraft. Let's consider the weight estimation in it. For electric aircrafts its fuel variation is " zero ". Hence W|f is...

18 March 2024 946 3 View

Chitosan JCPDS card number in X"pert HighScore?

I conducted X-ray diffraction (XRD) investigation on my chitosan sample and need assistance interpreting its diffractogram. If possible, could someone share the chitosan JCPDS card number with me?...

01 March 2024 2,080 4 View

What is the difference between PTFE and PPL lined hydrothermal autoclave reactor?

PTFE: Polytetrefluoroethylene or teflon and PPL: Polypropiolactone

14 February 2024 6,223 0 View

Can you help in running MG in STATA?

I am able to run PMG and dfe but not mg in Stata. they are showing an error and saying __ec is listed as a predictor. But I had dropped these variables including est variables before running MG....

12 January 2024 5,646 0 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How to confirm the site-directed mutagenesis result without performing NGS?

I'm cloning a fragment of 3200 nts into plasmid. The cloning was successful, however, 02 amino acids were mutated. Now I want to fix these 02 aa by site-directed mutagenesis technique using...

08 August 2024 4,645 2 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View