What should I consider while calculating the FPKM/TPM - sum of exon length or total gene length?

More Sourav Nayak's questions See All

Do you think can be any Uranium bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about Uranium ore deposits in world.

11 August 2024 6,720 0 View

Do you think can be any diamond bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about diamond ore deposits in world.

11 August 2024 2,167 1 View

What is the difference between mathematical R^4 space and physical 4D unit space?

We assume that the difference is huge and that it is not possible to compare the two spaces. The R^4 mathematical space considers time as an external controller and the space itself is immobile in...

10 August 2024 6,678 14 View

If Banks do not provide credit facility, what are the options available for FPOs and impact on producer’s income?

10 August 2024 8,198 5 View

Controlling for pupil light reflex when analyzing pupil size time course?

I used eye tracking to examine how participants from two different populations (A and B) react to an image. Participants in population A exhibit larger pupil sizes over time, but they also have...

10 August 2024 3,229 0 View

What are a “Farmers Producer Organization” (FPO) and its essential features?

10 August 2024 477 5 View

Strugglling with m6A dot blot any suugesstion ?

I have been doing the m6A dot blot for a while with no improvement, I am extracting the RNA, and I can see the dots although the three biological replicas give a different reading on the memberan...

10 August 2024 8,539 5 View

Do interactions between biosphere, carbon cycle, & water cycle impact global warming & interaction between atmosphere & hydrosphere?

How do interactions between the biosphere, the carbon cycle, and the water cycle impact global warming and interaction between the atmosphere and the hydrosphere?

09 August 2024 3,291 2 View

How to get moment output in Abaqus Standart?

I have input a moment load in module load Abaqus, i put my moment load on the node surface (using reference point). I have define moment in history output and make a set for moment too. But the...

08 August 2024 4,831 4 View

How is energy cycled through the Earth's climate system and how do matter cycle and energy flow through the rock cycle?

08 August 2024 8,162 0 View

Is there a problem with my RNA pellet?

Hello, I am currently having problems with RNA extraction. I am using mouse liver (C57BL6J), and I have extracted RNA from mouse liver before. Before this experiment, my final RNA pellets were...

11 August 2024 7,082 3 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Strugglling with m6A dot blot any suugesstion ?

I have been doing the m6A dot blot for a while with no improvement, I am extracting the RNA, and I can see the dots although the three biological replicas give a different reading on the memberan...

10 August 2024 8,539 5 View

How to confirm the site-directed mutagenesis result without performing NGS?

I'm cloning a fragment of 3200 nts into plasmid. The cloning was successful, however, 02 amino acids were mutated. Now I want to fix these 02 aa by site-directed mutagenesis technique using...

08 August 2024 4,645 2 View

RNA Extraction Using Hot Borate Method No Longer Working?

I've been performing RNA extraction on cotton petiole tissue for a few months now using the method described in the following paper, a derivative of the typical hot borate method...

08 August 2024 9,882 2 View

Does Anyone have expertise in in vitro transcription and RNA pull down assay?

I am currently working on LncRNA; to know the lncRNA-protein interactions I want to do RNA pull down assay, so I need to design primers with T7 promoter. I need assistance in this regard.

07 August 2024 6,622 1 View

E.coli contamination in human RNA seq data ?

Recently, we observed that 99% of the sequences in our RNA-seq data corresponded to the E. coli genome. Despite multiple DNAse treatments after RNA extraction and ribosomal depletion, we were...

06 August 2024 807 3 View

RNA later for the preservation of RNA in fecal samples at room temperature for one day (37°C)?

I am planning to collect human fecal samples for metatranscriptomic analysis using MGI. These samples are from indigenous people living in a region with high temperatures. I will have access to a...

06 August 2024 1,367 3 View

Do you have good tips for seaweed tissue preservation in the field for post RNA extraction?

I will be with my students collecting seaweed samples in a marine farm and later we will process this tissue for RNA isolation and further sequencing. Does anyone have tips on how to collect the...

04 August 2024 501 2 View

Does anyone have issues using Prepman Ultra reagent for MicroSeq ID bacterial, fungal and yeast sample preparation?

I have been attempting to extract DNA from Bacterial, Fungal and Yeast banked samples (>1e7 cells) using Prepman Ultra reagent and I seem to be struggling to obtain a sequence. Although the...

01 August 2024 2,079 0 View

Fabrice Chatonnet

Dear Sourav, I am a bit surprised by your question, since RNA-seq is a relatively well covered area in bioinformatics and several tools are quite collectively accepted depending on what output you want to get. If you're only interested in expression in terms of whole genes, I would consider using your bam file as an input for counting scripts like featureCounts (subread.sourceforge.net/) or HTseq-counts ( https://htseq.readthedocs.io/ ). That'll give you raw counts data (integers) that can be analyzed for differential expression between conditions through R (https://www.r-project.org/) packages like DESeq2 (https://bioconductor.org/packages/release/bioc/html/DESeq2.html).

If you're more interested in transcripts detection and RPKM / FPKM values, then you need to consider transcripts sizes (i.e. sum of exons sizes) but there are already some pipelines dealing with that, particularly to detect which isoforms are the more likely to be expressed, the more popular being Tuxedo (bowtie / tophat / cufflinks): http://cole-trapnell-lab.github.io/cufflinks/.

I advise you to also have a look on basic introductions to RNA-seq analysis here: bioinformatics.ucdavis.edu/docs/.../Th_MB_RNASeq_Intro.pdf or here: https://www.rna-seqblog.com/introduction-to-rna-sequencing-and-analysis/

Good luck with your analyses, hope that'll help!

Gautier Richard

Dear Sourav, I totally agree with Fabrice. Currently one of the best way to analyse RNA-seq data is to use STAR for the mapping then featureCounts for counting the mapped reads per gene. You have the option to count the reads per exon or per gene with featureCounts and it usually gives the same result plus or minus few reads. The easiest to have a clean output is to count per gene.

For TPM/FPKM, it seems that TPM has a better reputation than FPKM/RPKM these days (i.e. some reviewers can specifically ask for TPM).

One of the easy solution would be to use already made and validated RNA-seq pipelines such as the one available here (from fastq files to DE, MAplots and so on):

https://github.com/maxplanck-ie/snakepipes

The code is transparent so if you have a doubt about how this or that step is made, you can check it and even bring your own modifications to the pipeline.

As an alternative, to identify DEG, there's this very nice and comprehensive pipeline under R using Limma and EdgeR (starts from gene counts):

https://f1000research.com/articles/5-1408/v2

Sourav Nayak

Dear Fabrice and Gautier,

Thanks a lot for your answers. Actually the organisms we are working on is a bit unexplored through RNA-Seq so many things we are performing manually. Probably that is why we are not using any pipeline but designing our own because we have a couple of check points. It is true that we have tried couple of tools for counting and found results differs. But I believe I need to explore literature more as you suggested. Thanks for the links again.

Ts. Jaeyres Jani

In RNA-seq provide the level to understanding the RPKM / FPKM /TPM is the way to normally of the reading bias during the mapping but need to understand the read not really represent the gene expression. It is depend on what tools are you use for stats the gene count. So carefully if you normalize not do double normalize via the DEseq2 or EDGR tools because in both tools already had the script to calculate the size factor and normalizes.

Changfeng Chen

Gautier Richard Thank you! Your answer really helps!