Any suggested tools for RNA-Seq data analysis?

Shabhonam Caim Popular answer

Hi Hamid,

I use the following steps to analyse the RNASeq data:

1) repeat identification using a tool such as repeatmasker, only when you have a genomic assembly.

2) RNA-Seq mapping and assembly using tophat and cufflinks respectively.

3) After repeat identification align the protein data using exonerate protein2genome

4) And finally gene build with Augustus .

You can also use R bioconductors to determine the expression anallysis (DESeq), If you use cufflinks to assemble your data then you can also use cuffdiff to analyse differential expression and later you can use Cummerbund to visualize the differential expression data.

I hope this makes sense.

Goodluck

Shab

Wenbin Mei

Trinity is a good software and has some downstream package associate with it as well.

http://trinityrnaseq.sourceforge.net

Shabhonam Caim

Hi Hamid,

I use the following steps to analyse the RNASeq data:

1) repeat identification using a tool such as repeatmasker, only when you have a genomic assembly.

2) RNA-Seq mapping and assembly using tophat and cufflinks respectively.

3) After repeat identification align the protein data using exonerate protein2genome

4) And finally gene build with Augustus .

I hope this makes sense.

Goodluck

Shab

Manoj Tyagi

You could try STAR aligner, it is suppose to be much faster and seems users are happy with its sensitivity.

https://code.google.com/p/rna-star/

I would also recommend the approach suggested by Shab, use Tophat v2.0.8 using Bowtie v2.1. then use DEXSeq , EdgeR from bioconductors.

Cufflinks / cuffdiff are not the best option these days.

Gustavo Gilson Lacerda Costa

It is not an easy task.

I agree with Lesley, TSSi is a good tool that can help you. Before TSSi you will need to map you reads against zebrafish genome and best is to use a splice-aware aligner such as STAR. I would exclude reads mapped to CDS regions for simplicity after that.

For the remaining reads, you will have to count how many reads start at each chromosome position. Samtools will help you with that. This will be necessary to input to TSSi.

Probably you will note that the TSS is not precisely defined. Often there is a region from where the transcription could start. TSSi will try to guess it based on the distribution of read counts beggining in each chromosome position.

Depending on the library construction protocol used, biases could be introduced. You will note that for some RNA-SEQ libraries the coverage of the transcript is very uneven, even when there is no alternative splicing. This could introduce a bias to undersample the 5' end of transcripts and your job will be much more difficult.

Finally, I would suggest also to train a gene finder, such like AUgustus, using bona fide transcripts with complete 5'UTR and 3'UTR annotations (both tss and tts). After training, run Augustus using a hints file diving bonus to UTRPART hints (that you gathered from samtools). You could also include bonus for TSSi predictions. The advantage is that Augustus will learn the 5'UTR sequence pattern and the pattern surrounding real TSS sites . This would allow it to predict other TSS not supported by your RNA-seq dataset. .

How can I correct the variation in my common ELISA controls between several plates?

How can one statistically prove that a set of measurements are accurate?

Any suggested work flows for lncRNA detection/prediction using RNA-Seq data?

Any suggested tools for 3D RNA structure prediction?

How to learn more about SPSS and its Application?

Is there a problem with my RNA pellet?

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

Baseline drift in HPLC? What causes this?

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

Which Scopus Journal provides the most affordable fees?

Seeking Advice on Viability and Execution of Undergraduate Thesis Topic?

Strugglling with m6A dot blot any suugesstion ?