RNA-seq data - FAQS.TIPS

More Magdy S Alabady's questions See All

RNA-Seq data normalization?

Raw RNA-seq data are discrete data but normalized RNA-seq data (RPM or RPKM or FPKM) are not discrete, i.e. continuous data. Shouldn't this change in the data's nature change our understanding of...

04 May 2013 7,784 12 View

DSN normalization of full-length cDNA

We measured the expression of two genes before and after DSN normalization of the full-length cDNA library. These two genes are Ubi4, which is highly abundant, and Pr1.1, which is not....

01 February 2013 5,824 4 View

Trinity versus Abyss-trans

Has anyone compared Trinity with Abyss-trans for de novo assembly of the same data sets? I'd like to see the stats of both assemblies as well as any other assessment measures. Which algorithms...

01 February 2013 1,802 12 View

Which Scopus Journal provides the most affordable fees?

"PUBLISHING IN A SCOPUS JOURNAL" Researchers are now at a cross road. The critical need to publish in a Scopus or ISI, etc journal is ever vital. Journal Publication fees must be submitted....

10 August 2024 8,621 1 View

Seeking Advice on Viability and Execution of Undergraduate Thesis Topic?

Hello everyone, I am currently developing a thesis proposal and would appreciate your input on its viability and how to effectively carry it out. My proposed topic is: "Does the perceived threat...

10 August 2024 8,992 0 View

Who will be moral responsible for the death of thousands of people in the event of an earthquake?

Who will bear moral responsibility for the deaths of thousands of people in the event of an earthquake? Weeks and months remain before the onset of strong earthquakes that bring death to...

08 August 2024 6,134 12 View

Are there any instruments for studying time similar to the way it is in space?

There are a huge number of methods for studying objects in space, according to the senses (and not only). Mechanical, thermal, optical, acoustic, electrical, magnetic, based on particle beams,...

06 August 2024 7,102 0 View

Weak DAPI staining after immunohistochemistry - how to improve?

After immunohistochemistry of previously fixed in PFA and EtOH and then frozen 20 μm sections of zebrafish brain, DAPI staining is very weak (right) compared to the same sections stained without...

05 August 2024 9,637 2 View

Why did the authors extrapolate a phenotype that they experimentally proved in one bacterial strain across the whole genus of the organism?

I aim to be as skeptical as possible regarding whether a pair of orthologous genes results in the same phenotype in their different but related bacterial organisms under similar environmental...

05 August 2024 6,787 4 View

Why my colony PCR results of my recombinant bacterial not showing any results?

I am performing ligation of the plasmid and a target gene. The steps I have taken are: 1. Double digestion of the plasmid and target gene 2. Ligation of the plasmid with the target gene 3....

05 August 2024 2,570 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

In the case of a wound l recurrence after radical breast cancer and sentinel lymph node biopsy. Are the sentinel lymph node procedure recommended?

In the case of a wound l recurrence after radical breast cancer and sentinel lymph node biopsy. Are the sentinel lymph node procedure recommended? If no axillary lymph node dissection was not...

05 August 2024 8,056 1 View

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity?

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity? What is the acceptable percentage of error (regardless of the metric)? Could you suggest...

03 August 2024 5,358 0 View

Magdy S Alabady Popular answer

Thanks Robert and Diego! the paper and discussion were very helpful.

The bottom lines:

1) In the DE experiments, distribution models are used to fit gene expression across replicates, not within a sample.

2) The discrete RNAseq data can be modeled as Poisson distribution if the variance/mean ration = 1 (e.g. RNAseq from technical replicates). TSPM and GLM algorithms use Poisson distribution. They assess the variance to mean ratio in different ways. Then they use quasi-Poisson distribution if the variance to mean > 1 and Poisson distribution if the variance to mean = 1

3) The discrete RNA-seq data from biological replicates fit better in NB distribution, which assumes the data are overdispersed (variance > mean). The algorithms “EdgR”, “DESeq”, and “baySeq” use negative binomial (NB) distribution to model RNA-seq data. The difference between these algorithms is mainly in their ways of measuring or assessing the overdispersion.

4) A recent paper by Wanger et al., used a combination of two distributions. Exponential distribution for non-expressed genes (inactive genes if the count is ≤ 2 TPM) and Negative Binomial distribution for expressed genes (active genes if the count > 2 TPM).

“Wagner GP, Kin K, Lynch VJ. (2013) A model based criterion for gene expression calls using RNA-seq data. Theory Biosci”

Diego Diez

I think this entry in biostar about a very similar question answers it nicely:

http://www.biostars.org/p/6028/#6030

Magdy S Alabady