I just want to keep the longest contig from all similar results.
Please give me some suggestion.
You can do that by using CD-HIT-EST Clustering tool, you can get it from here. Just play around with the parameters.
CD-HIT-EST: https://github.com/weizhongli/cdhit/releases
https://github.com/weizhongli/cdhit/releases
Follow the Vivek Todur suggestion. You will get the longest representative sequence based on similarity threshold you set.
thanks, I will do it as you suggested.
ZQ
I am on the lookout for the Enhanced Yellow Fluorescent Protein (Aequorea victoria) DNA sequence. Does anyone know where I can find it? Thank you in advance
03 March 2021 3,568 1 View
Gel electrophoresis, RNA degradation, RNA extraction from fresh tissue
02 March 2021 5,433 5 View
Hi, I am trying to construct a multi-layer fibril structure from a single layer in PyMol by translating the layer along the fibril axis. For now, I am able to use the Translate command in PyMol...
02 March 2021 4,569 4 View
I am wanting to calculate the average trend in maximum annual NDVI in Iceland from 2010-2020 using MODIS MYD13Q1 V6. How would I do this? I have currently inserted the NDVI bands from the MODIS...
02 March 2021 752 2 View
During a computational analysis, I found that for a protein of a plant, two different subcellular localizations are seen using CELLO and WoLF PSORT. For example, in CELLO, it identified the...
01 March 2021 6,729 2 View
I am looking at the ATP1A2 (Sodium/Potassium ATPase alpha subunit 2) in two human neuronal cell lines. Expression levels of this protein seems to be almost equal when detected by one antibody....
01 March 2021 3,607 3 View
I have used the i-Tasser several times, however it has been unavailable for several days. I tried the swiss-model, but the output was not very pleasant due to the model used. Is there any other...
28 February 2021 4,521 3 View
I transfected my LNCaP-WT cells with 3 shRNA plus their NTC two weeks ago and split two puromycin selected cell plates on Friday last week(Feb 26). I checked for GFP in the cells, and they all...
28 February 2021 4,949 3 View
Hi everybody In the ped format for genotype, alleles of any SNP are represented by two columns (one for each allele, separated by a space). Is a column sufficient for the haplotype to...
27 February 2021 1,965 1 View
I have two groups of brain samples, control and treated for example. It was total RNA nova seq sequencing. I tried all the available pipeline like: star+rsem+deseq2, Hista+stringtie+cuffdiff,...
27 February 2021 356 6 View