Trinity versus Abyss-trans

More Magdy S Alabady's questions See All

RNA-Seq data normalization?

Raw RNA-seq data are discrete data but normalized RNA-seq data (RPM or RPKM or FPKM) are not discrete, i.e. continuous data. Shouldn't this change in the data's nature change our understanding of...

04 May 2013 7,784 12 View

RNA-seq data

Why RNA-seq data are always modeled as negative binomial distribution? What are the parameters or the assumptions that make RNA-seq data fit the negative binomial distribution?

03 April 2013 8,251 3 View

DSN normalization of full-length cDNA

We measured the expression of two genes before and after DSN normalization of the full-length cDNA library. These two genes are Ubi4, which is highly abundant, and Pr1.1, which is not....

01 February 2013 5,824 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Can you connect an HPLC to a Mass Spec only at a certain time point?

Can anyone explain this method? Especially the last statement where it says only at 1.5 to 2.5mins was the MS/MS connected to the UPLC. How is that possible, is it a feature in this specific...

11 August 2024 8,141 3 View

Which Scopus Journal provides the most affordable fees?

"PUBLISHING IN A SCOPUS JOURNAL" Researchers are now at a cross road. The critical need to publish in a Scopus or ISI, etc journal is ever vital. Journal Publication fees must be submitted....

10 August 2024 8,621 1 View

Seeking Advice on Viability and Execution of Undergraduate Thesis Topic?

Hello everyone, I am currently developing a thesis proposal and would appreciate your input on its viability and how to effectively carry it out. My proposed topic is: "Does the perceived threat...

10 August 2024 8,992 0 View

Who will be moral responsible for the death of thousands of people in the event of an earthquake?

Who will bear moral responsibility for the deaths of thousands of people in the event of an earthquake? Weeks and months remain before the onset of strong earthquakes that bring death to...

08 August 2024 6,134 12 View

How to confirm the site-directed mutagenesis result without performing NGS?

I'm cloning a fragment of 3200 nts into plasmid. The cloning was successful, however, 02 amino acids were mutated. Now I want to fix these 02 aa by site-directed mutagenesis technique using...

08 August 2024 4,645 2 View

GC-MS retention index prediticon?

Hello experts, Does anyone know any free software about retention index prediction ?

08 August 2024 7,403 2 View

Separation of organic acids-HPLC?

Hello What should be done to separate and identify organic acids in HPC when their RetTime is the same?Like oxalic acid with Propanoic Acid.or acids that have a very close RetTime.

07 August 2024 8,782 3 View

Are there any instruments for studying time similar to the way it is in space?

There are a huge number of methods for studying objects in space, according to the senses (and not only). Mechanical, thermal, optical, acoustic, electrical, magnetic, based on particle beams,...

06 August 2024 7,102 0 View

RNA later for the preservation of RNA in fecal samples at room temperature for one day (37°C)?

I am planning to collect human fecal samples for metatranscriptomic analysis using MGI. These samples are from indigenous people living in a region with high temperatures. I will have access to a...

06 August 2024 1,367 3 View

Leandro Costa do Nascimento Popular answer

Hi Magdy, there is anothe article in BMC Bioinformatics comparing some methods to de novo transcriptome assembly: http://www.biomedcentral.com/1471-2105/12/S14/S2

For me Trinity is the better option. I used it to assemble transcriptome data of two different datasets: one of a fungus and another one of a plant.

Aureliano Bombarely

There is an article in BMC Genomics where the authors compare Trinity and ABysSS-Trans for wheat Illumina reads assembly (http://www.biomedcentral.com/1471-2164/13/392).

Magdy S Alabady

Thanks Auerliano

Leandro Costa do Nascimento

Tony John Reynolds

Don't know about Trinity by have tried out Abyss for a short time, it does the job. What I do like, I dont work for the company, is CLC Genome Workbench. It does all the assembly etc in a fraction of the time but alas its not not freeware. You do have the ability to customize your workflows and design your own species specific plugins using their own SDK.

Daniel Garcia de la serrana

Hi Magdy, I agree with the previous comments to read the BMC papers recommended. About Isoforms and repeated sequences it depends of your species. I personally work with species with several whole genome duplications events in their evolutionary history, so I'm interested in paralogues. For paralogues search non of them is very good and find paralogues is not easy. Also, depending of your species evolution history, you can have several duplications and repeated sequences that in general are difficult to resolve with de novo assemblies.

But,if it helps you, for me Trinity is a bit better than ABySS-Trans.

Axel Künstner

Hi Magdy,

several papers has been already mentioned. I would like to another one the compares de novo transcriptome assemblies in general:

http://onlinelibrary.wiley.com/doi/10.1111/mec.12014/full

The authors came to the conclusion the Abyss-Trans performed not as good as Trinity or SOAPdenovo-trans and excluded it from further analyses. It might be worth to take a look on the paper. The study also suggests a mapping approach to identify gene models using a related species (divergence < 15%).

From my own experiences, Trinity works better than Abyss-Trans but this might be biased by the data sets I used.

Good luck,

axel

Thanks everyone! i have used Abyss-trans to assemble two different transcriptomes, one is for a plant without a sequenced genome and one for an insect with a fully sequenced genome. In insect case, by comparing my assembly to the published assembly of another insect belong to the same genus, I found that abyss-trans assembly is really good. Same thing with the plant transcriptome. What I found interesting about Abyss-trans is that it merges the assemblies from different k-mers. I found this particularly good as I am convinced that there is no single magic k-mer (see the attachment).

Also, Abyss-trans performed well in detecting the isoforms as compared to published trascriptomes from closely related species in both cases.

I haven't use Trinity at all but I am going to. Will do the comparison on my species and will update you guys in case there is anything that isn't published already in those papers. Cheers everyone!

Olivier Armant

Dear all,

Each assembler will have some pro and cons, and results will depends and the tuning, as you stated Magdy. I would like also to point to velvet/Oases which is also famous. Why not make the assembly with the different assembler and then pool the results with CAP3? I saw several studies doing that.

Jonathan David Moore

I would second Olivier's idea. All models are wrong, and different models are wrong in different ways, so if you can make a consensus of models you can reasonably expect it to be better than any model individually.

Ilona Urbarova

I have tried Trinity for my Ion Torrent data, I think I get very good assemblies, but have not figured out how to compare them to others (how to find out how many contigs are in agreement), because what I can see with Trinity that it reports different splice variants or how one can call it. There are more contigs from out from the same region kind of - or from similar reads.

I also would like to know how you did this graph you attached, I would also like to make some more sense out of my assemblies from Trinity, so that I can see more than number of contigs and calculate the N50 and mean length.

Thanks, Ilona

Dear Olivier,

I was thinking about doing something like that, but can one actually introduce quite much bias in the assemblies then if the assembler kinda choose to the contigs some bases - because sometimes it can be 50:50 for a base and then one would introduce more and more mistakes by fusing different assemblies? I guess then it quite much depends on the coverage...

Cheers,

Ilona