What will be the minimum sequencing read depth for RNA seq experiment for differntial gene expressions in weedy plants?

More Shahid Farooq's questions See All

Is the research on counselling of patients is cross sectional or clinical trail in psychiatry?

Since in Psychiatry , treatment is done through counselling and medicines are prescribed according to it, my question is what would be the most appropriate study design in the subject of psychiatry.

14 July 2024 8,240 2 View

Which other method other than Gas Chromatography can be used in the content determination of Alcohol in Pharmaceutical syrup?

Other than GC any other method might help.

26 June 2024 1,076 3 View

Can anyone send me the detailed protocol of lentiviral titer determination and its invitro transduction into Hela cell line or MSCs?

I have searched a lot of information from the published article but couldn't find valuable. As I have prepared the Lentiviral vector and stored it in -80 degree!

21 June 2024 3,484 1 View

I plot a graph between absolute value of log j (x-axis) and overpotential (y-axis) in origin by tafel extrapolation? is there need for any adjustment?

after tafel extrapolation, is there need to do more adjustment?

01 June 2024 2,780 0 View

My Cdl results for HER in basic media (1M KOH) are not satisfactory. Is there any possible solution to get correct one?

I take Cdl data from CV (between potential window -0.6 to -0.7).

31 May 2024 7,058 1 View

How to transduce lentiviral vector into ThP1 macrophages and Mesenchymal stem cells?

Is there any authentic reference from where I can get appropriate information regarding this and anyone who has experienced any one of these cells' transduction?

27 May 2024 6,554 0 View

Novel and future lubricants and additives for hybrid electric vehicles?

nano additives and lubricants for hybrid electric vehicles

24 May 2024 8,363 3 View

Hey, is there anyone who used irradiations for high entropy alloy at room temperature ?

hey, is there anyone who used irradiations for high entropy alloy at room temperature ?

28 April 2024 1,824 1 View

Is there anyone who used iiradiation exp at room temp for high entropy alloy?

is there anyone who used iiradiation exp at room temp for high entropy alloy? can you please give me some suggestion

28 April 2024 2,003 0 View

How do anesthesiologists approach emergency {trauma} surgery when a patient has a full stomach and is also experiencing coughing?

Due to the motor vehicle accident or any other accident if a patient has already cough and cold and has full stomach then, How do anesthesiologists approach this type of emergency surgery ?

24 April 2024 9,897 3 View

Is there a problem with my RNA pellet?

Hello, I am currently having problems with RNA extraction. I am using mouse liver (C57BL6J), and I have extracted RNA from mouse liver before. Before this experiment, my final RNA pellets were...

11 August 2024 7,082 3 View

Strugglling with m6A dot blot any suugesstion ?

I have been doing the m6A dot blot for a while with no improvement, I am extracting the RNA, and I can see the dots although the three biological replicas give a different reading on the memberan...

10 August 2024 8,539 5 View

RNA Extraction Using Hot Borate Method No Longer Working?

I've been performing RNA extraction on cotton petiole tissue for a few months now using the method described in the following paper, a derivative of the typical hot borate method...

08 August 2024 9,882 2 View

Does Anyone have expertise in in vitro transcription and RNA pull down assay?

I am currently working on LncRNA; to know the lncRNA-protein interactions I want to do RNA pull down assay, so I need to design primers with T7 promoter. I need assistance in this regard.

07 August 2024 6,622 1 View

E.coli contamination in human RNA seq data ?

Recently, we observed that 99% of the sequences in our RNA-seq data corresponded to the E. coli genome. Despite multiple DNAse treatments after RNA extraction and ribosomal depletion, we were...

06 August 2024 807 3 View

How do soil microflora interact with plant roots and influence plant nutrition, health, and productivity?

06 August 2024 9,618 3 View

RNA later for the preservation of RNA in fecal samples at room temperature for one day (37°C)?

I am planning to collect human fecal samples for metatranscriptomic analysis using MGI. These samples are from indigenous people living in a region with high temperatures. I will have access to a...

06 August 2024 1,367 3 View

Are there instances where molecules with larger molecular weights exhibit greater mobility than those with smaller molecular weights?

Hi, I know that low molecular weight (MW) molecules generally tend to have higher mobility, while high molecular weight molecules tend to have lower mobility. However, in my experimental...

06 August 2024 1,495 2 View

For an in-vitro drug release study, what molecular weight cut-off (MWCO) dialysis bag is required for a 117 kDa protein?

kindly reply me. Thanking you in advance.

05 August 2024 7,727 4 View

Do you have good tips for seaweed tissue preservation in the field for post RNA extraction?

I will be with my students collecting seaweed samples in a marine farm and later we will process this tissue for RNA isolation and further sequencing. Does anyone have tips on how to collect the...

04 August 2024 501 2 View

Shahid Farooq

Thank you very much :)

Manvendra Singh

Actually that depends, on genome size, degree of annotations and genome build.

e.g.

for Humans if 10M reads are mapped then its good for differential gene expression analysis. Since genome is well annotated and assemble for humans so you would get most of reads mapped to genome.

In your case if genome is not constructed well, then you would expect lesser reads mapping to genome or transcriptome. so you would need deeper sequencing to get >10X coverage of genome of your interest.

Reference genome is present for weed species on which I will work. I am planning to go with 25 M reads per sample. Will it be ok or I must decrease the depth to somewhat 20, 15 or 10? Waiting for your answers.

I would not suggest to go below 25M reads

25M is adequate or you will suggest more deeper sequencimg Manvendra Singh?

Kristoffer Vitting-Seerup

I cannot add anything to read depth since I agree with the answers above.

But even more importantly (down to a certain threshold) than read depth, is the number of replicates since they are basis on which you are going to do your statistics. Here I would always recommend 3-4 since that is where most tools approach the saturation level.

Christian Cole

Did you read Michael's suggesteds paper? From reading it you should be clear that depth doesn't matter as much as replication.

For a given lane of HiSeq data you get about ~150M paired-end reads. You could get 15 samples with 10M reads or 1 sample with 150M reads. Getting one sample with 150M will *not* give you better data than 5 replicates of 3 three samples each 10M reads. Always go for replicates over depth.

For our Arabidopsis studies we always do a minimum of 30M reads per replicate. Stranded RNAseq is very important.

Kasturi Pawar

Hi, I want to ask as to how to modify the script in R so that it runs the data at sequencing depth 30M instead of 2.5M. The current data reads at 2.5M. Thanks.

If you already have some script in R, and want to modify for specific reasons then we could possibly help.

Regarding your question, Its the same computational pipeline that works for either 2.5M or 30M reads.

hth

@Kasturi what R script? Can you post it somewhere?

The processing and analysis should be the same regardless of sequencing depth.

Hello Christine and Manvendra: Thank you. I am very new to R and just learning it. I have a data which is at 2.5M and would like to convert so it reads at 30M. Dont know how to do that. Here is the script:

# Read the data in "count_matrix_sub_2.5M.txt"

# and run a DESeq2 experiment comparing the transcriptome

# of untreated MCF-7 tumor cells to those treated with estrogen.

# There are 7 replicates for each of the conditions (untreated, treated)

### RUN next two lines once

#source("http://bioconductor.org/biocLite.R")

#biocLite("DESeq2")

#install.packages('stringi')

#library('DESeq2')

## try http:// if https:// URLs are not supported

#source("https://bioconductor.org/biocLite.R")

setwd("C:/Users/kasturi.pawar/Documents/RNA SEQ")

countTable

Hello Christian and Manvendra: I just tried changing 2.5 to 30M wherever it appears in the script, but not sure if that's the correct way of doing it. Also, wanted to know the plots generated are different (histogram, MA plot) in both scripts. What is the difference in plots when I use 30M instead of 2.5M - I know it gives better results but want to know exactly how? sorry, I may be asking basic/simple questions but I really want to understand the difference it makes when I change the read depth. I also want to know if I need to change the "condition" from a character to numeric by doing this: #f

Here are those plots for the previous post.

Thanks for the script and plots.

You're overcomplicating things; your makeDataSet() function is unnecessary. Most importantly specifying the number of reads like you're doing is wrong unless you have a very good reason to do so. Especially as you're setting each sample to exactly the same number of reads, this is never the case in RNA-seq. DESeq2 expects the read depth for sample to be different in order to normalise them appropriately. Just let it do it's calculations for you.

I recommend you follow the DESeq2 manual, esp section 1.3.3 Count Matrix Input. You should only need to do something like this:

countData = read.table("count_matrix_sub.txt", header=TRUE, sep="\t", row.names=1)

conditions = c(rep("untreated",numRep), rep("treated",numRep) )

colData = data.frame(condition = conditions)

# I'm assuming the samples are in the same order in the file as they are here.

# not sure how much that matters to DESeq

rownames(colData)

Hi Kasturi, I agree with Christian, I would also have given similar suggestion. I upvoted the answer

Shweta Jha

Dear Manvendra, As u said sequencing read depth for RNAseq depends on genome size. Plz let me know if what is the rule of thumb for that? If I have 2500 MB genome size and want to perform de-novo assembly (seq by illumina Hi-seq, paired 100 bp read length), then what should be minimum amount of sequencing data for differential gene expression analysis?

@shweta I presume you mean genome assembly? Is the genome diploid, haploid or multiploid? The higher the ploidy of a genome the more complex assembly will be.

Genome assembly is highly dependent on the length of repetitive elements in the genome. The more there are and the longer they are the harder the assembly will be. For that reason raw sequence read length is amongst the single most important requirement for a good assembly. 100bp PE illumina sequencing will guarantee you get a highly fragmented and incomplete genome, especially one the size of 2.5GB.

For any assembly, I would recommend going with PacBio or Nanopore sequencing which gives reads in the 10,000-100,000bp range. Ignore the "problems" of read errors, they are far simpler to fix than an incomplete/fragmented genome.