How to Organize RNA-Seq fastq data?

More Jacob Kurdys's questions See All

Can I directly ligate Illumina adaptor to the end of ssDNA?

I have a sequence of ssDNA that I would like to directly ligate an Illumina sequencing adaptor to. As long as I have done the end repair (remove phosphates), can I directly ligate the adaptor to...

26 June 2024 4,522 0 View

How to simulate a metalens with backward propagating source using a Lumerical FDTD?

Dear Sir/Madam, I am currently working on a beam shaping problem and have referenced the Metalens example provided on the Ansys Lumerical website. In that example, a forward propagating plane wave...

14 June 2024 4,333 0 View

Do you know a partner, friend, colleague, or someone else who was a Canadian gay (male) nurse who cared for PLWH during the HIV/AIDS pandemic?

My name is Carl GA Jacob. I am an Auxiliary Professor in the School of Nursing at the University of Ottawa. I am the author of the 2012 research titled: The Use of Experiential Learning in the...

10 June 2024 6,253 0 View

Protocol for Coating Plates with Collagen?

Does anyone have guidance on coating 96-well plates with collagen-I? Looking for a protocol. We are using Corning® Collagen I, Rat Tail, 100 mg.

22 April 2024 6,835 1 View

How do I determine and what radical is this in this EPR graph (DMPO)?

So I'm trying to determine what radical this is captured with DMPO. How do I determine and calculate what radical this is? Thanks!!

06 April 2024 7,084 3 View

Is it required to send interview transcriptions for publishing qualitative research in a journal as part of data availability?

As someone engaging in ethnographic research, are we expected to disclose transcribed data to the journal where we would like publish an article?

03 March 2024 3,653 3 View

Which Promotor Sites can I use for In Vitro Transcription?

Our lab is trying to use in vitro transcription to create mRNA of our inserted on a pcDNA 3.4 TOPO plasmid. I noticed it does not have a T7 promotor sequence. Are there other available promotors...

26 February 2024 1,852 1 View

How should one teach ETHICS to the new generation of college students in 2024?

More and more, teaching ETHICS has become an important part of the college curriculum, but teaching it has not always been easy or up to part with the generational changes in the student...

02 January 2024 7,862 6 View

Buffering pH of DNA Binding Solutions for Silica Columns?

I am trying to prepare some custom DNA binding solutions for purifying DNA with silica columns. I have prepared one with 4 M guanidine thiocyanate, 30% isopropanol, and 10 mM Phosphate buffer, pH...

03 December 2023 2,197 0 View

How do we design a three level dimensions of organizational ambidexterity?

I am currently exploring on the concept of "Organizational Ambidexterity". I would love you give me an insight on the way to design its dimensions in a three level scale, where it will depict its...

27 October 2023 3,058 3 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

How to confirm the site-directed mutagenesis result without performing NGS?

I'm cloning a fragment of 3200 nts into plasmid. The cloning was successful, however, 02 amino acids were mutated. Now I want to fix these 02 aa by site-directed mutagenesis technique using...

08 August 2024 4,645 2 View

Does anyone have issues using Prepman Ultra reagent for MicroSeq ID bacterial, fungal and yeast sample preparation?

I have been attempting to extract DNA from Bacterial, Fungal and Yeast banked samples (>1e7 cells) using Prepman Ultra reagent and I seem to be struggling to obtain a sequence. Although the...

01 August 2024 2,079 0 View

Should the amount of DNA input used for ChIP-seq library preparation be matched between the control and experimental groups?

Hi all. As a beginner in ChIP-seq experiments, I hope you understand that the following questions might be somewhat basic. I am planning to perform ChIP-seq or MeDIP-seq analysis to investigate...

28 July 2024 6,938 1 View

If my gene of interest has high GC content can it be problematic in sequencing? What kind of error is expected with GC rich gene sequences??

Gene sequencing related trouble shooting

25 July 2024 4,149 2 View

Does post-translational protein modification cause devisions on observed pI verses calculated pI?

In running two-dimensional gel electrophoresis on bacterial protein, some spots that appear to match a protein sequence have a significantly more acidic isoelectric point than the calculated pI....

24 July 2024 8,076 3 View

Are there always been barcodes, apapters and primer sequences in the FASTQ files of NGS?

Hello researchers, Sorry for my stupid question. I am learning the QIIME2 workflow for analyzing some 16s amplicon NGS fastq data. I found a very nice paper with data and code public available...

20 July 2024 5,405 2 View

Promotor observation in region annotation of RNA RIP-seq?

Hello all, I extracted RNA from my samples and performed RIP-seq. After annotating the genomic regions using R, I obtained promoters, exons, introns, and UTRs. Given that my samples consist of...

18 July 2024 1,579 2 View

Analysis of MHC-I and II alleles with CNVs and unassigned loci?

I am working on a dataset of MHC-I and II alleles from a bird species sequenced with Illumina. We were not able to assign alleles to loci through MHC-typer as we were over the limit of 150 alleles...

15 July 2024 182 1 View

How to Freeze Embryos for Spatial Transcriptomics?

Hello everyone, I am currently working on a project involving spatial transcriptomics on E14 mouse embryos. We need to perform cryosectioning without fixing the embryos. Here is the protocol we...

10 July 2024 5,985 0 View

Rana Jaber Tarish Al-Baghdadi Popular answer

Hi,

You need to download your files first. Second, align your mapping reads to the reference genome (I used the Tuxedo package and edge R). Third, you need to calculate gene expression and get the DGE.

What I have done is:

1. Raw RNA-seq reads were mapped to the mus genome using Bowtie and Tophat. Bowtie stores the reference genome sequence in FM index structure that allows searching this sequence rapidly. Bowtie aligns reads to the reference genome using the FM index at rate of tens of millions of CPU hr. Bowtie is able to align short reads only, so it cannot align reads that have big gaps such as reads that have introns. Tophat is another aligner, and it uses to find transcript splice sites. Tophat aligns reads to the reference genome using Bowtie as an algorithm core. Tophat breaks up reads that have big gaps into smaller reads called segments so they will be aligned to the genome. When several of segments align to the genome between 100 bp and several hundred kilobases from one another, Tophat infers the read spans a splice junction and estimates where the splice sites are. Polymorphisms can be identified by the mismatches, insertions, and deletions in the alignment. Aligned reads also can be used to quantify gene and transcripts expression, since the number of reads of a transcript is proportional to its abundance.

2. The files that resulted from Bowie and Tophat then ran through Cuffdiff. Cuffdiff calculates gene expression and figures the statistical significance of observed change in expression in two or more samples. Cuffdiff assumes that the number of reads of a transcript is proportional to its abundance. Cuffdiff allows applying multiple replicates per condition. Cuffdiff output files contain gene expression level changes (fold change (log2 scale)), P value (raw and corrected for multiple testing), gene name, and gene location in the genome.

3. Cuffdiff output files then ran through CummeRbund. CummeRbund runs Cuffdiff data through R statistical environment, cluster, and plot expression data.

It is a long way but you will great in the end!

Good luck!

Boas Pucker

There are two main approaches to use RNA-Seq data. Depending on your research question you could do

(1) a transcriptome assembly (e.g. via trinity) or

(2) a read mapping against a reference genome sequence (e.g. via STAR, HISAT2).

However, this is just a very brief description of the two most frequently applied approaches. Working with tools for NGS data takes some time. Interpretation of the results requires a certain knowledge about the methods. If you do not know anything about the before mentioned tools, you should seek help from a bioinformatics core facility.

Rana Jaber Tarish Al-Baghdadi