How do I get TopHat to make my Bowtie2 index files?

More Benjamin Goldman-Huertas's questions See All

Does anyone knows about any protandrous fish inhabiting the Mediterranean Sea, smaller than 20 cm (adult) and that change sex relatively quickly?

Other than Sparids

27 June 2024 1,809 0 View

Very urgent. Where we can get some of these plant species (details see description box) in Nigeria?

1. Vachellia drepanolobium 2. Vachellia horrida 3. Vachellia karroo 4. Vachellia nilotica 5. Vachellia oerfota 6. Senegalia polyacantha ? [email protected] +2348061166247 Thank you.

05 June 2024 4,858 1 View

How can I change the coordinate system in Abaqus using Python?

Hello, I'm writing Python scripts for Abaqus and I'm facing a problem. I need to change the coordinate system in a .odb file before extracting data but I'm stuck. I can create my coordinate system...

30 May 2024 7,431 3 View

How to investigate a monolayer of polymer on a surface?

Hello, Can anyone advise on methods to investigate a polymer on a surface? I have tried XPS, tapping AFM, ellipsometry, contact angle and CV and EIS. I know something is on the surface but the...

15 May 2024 2,843 4 View

Can you explain the difference observed between my PFGE and WGS results for E. coli comparison?

Recently i did PFGE and WGS comparison with the same set of E coli strains. In my mind, if PFGE give two different pulsotypes, it is sure that strains are not the same. But when i did WGS, i...

12 May 2024 8,107 2 View

Is there a commercial antibody against insect resilin?

Dear community, I want to perform an analysis of the risilin localization within the legs of myriapods and would like to visualize the localization of the resilin components in different...

07 May 2024 9,160 3 View

How do I specify the argument "time" in the function "create.matrix" (R package fossil)?

I would like to use the "create.matrix" function (R package "fossil") to create a matrix of taxa, localities, time and abundance. In the function, time is specified by the arguments "time.col" (=...

28 April 2024 3,545 2 View

Is possible to desorbe CO2, N2 or water in ZIF-8 between 30ºC and 60ºC?

Dear all, After checking DSC and TGA analysis for ZIF-8 samples, it was possible to observe an endotermic peak at around 50ºC, using DSC under N2, and a weight loss in TGA at around 38ºC and 125ºC...

04 April 2024 8,460 4 View

My plasmid DNA is 96ng. How do i calculate the volume of bsa I (20000 U) to use to digest it?

I am trying to do a golden gate cloning but the ligation-digestion seems to be failing

01 April 2024 6,416 2 View

Can I perform a Functional Enrichment Analysis on a specific list of pre-defined genes?

Hi, I would be grateful for your opinion. I performed a DGE analysis and received a list of differentially expressed genes. Next, we focus on specific list of pre-defined genes for further...

04 March 2024 6,588 3 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

How to confirm the site-directed mutagenesis result without performing NGS?

I'm cloning a fragment of 3200 nts into plasmid. The cloning was successful, however, 02 amino acids were mutated. Now I want to fix these 02 aa by site-directed mutagenesis technique using...

08 August 2024 4,645 2 View

Who of all the Global Scientific community will help me Prof. Dr. Yoshida make way for TPEOM, MEC ~EMC to return the atmospheric gases to the norma ?

TEP presentation caption (The Environmental Project) Re: Why should Washington’s DC, or any country government point of location think of as nowadays of as to being 'tomorrow as to come! if it...

03 August 2024 2,484 1 View

Does anyone have issues using Prepman Ultra reagent for MicroSeq ID bacterial, fungal and yeast sample preparation?

I have been attempting to extract DNA from Bacterial, Fungal and Yeast banked samples (>1e7 cells) using Prepman Ultra reagent and I seem to be struggling to obtain a sequence. Although the...

01 August 2024 2,079 0 View

How is the bacterial genome's high protein count verified as genuine despite 800+ contigs and good metrics (98.55%completeness, 0.68% contamination)?

Given that the bacterial genome has over 800 contigs, but its quality metrics are good, with a completeness of 98.55% and a contamination of 0.68% as assessed by CheckM, what specific validation...

01 August 2024 1,514 1 View

What is the acceptable p-value cutoff for GO enrichment analysis ?

I have an RNA-seq data that I have analysed using Limma-voom and have extracted the gene IDs, log2FC and the p-values. At p value < 0.05, I have over 10,000 DEGs, however, when I run the GO...

31 July 2024 225 2 View

Recovery Viurses from bacteria genome?

Hello everyone, I am currently looking for tools to recovery viral genomes from bacterial genomes, not metagenomes. However, I have only found tools that are designed for retrieving and studying...

28 July 2024 8,953 1 View

Should the amount of DNA input used for ChIP-seq library preparation be matched between the control and experimental groups?

Hi all. As a beginner in ChIP-seq experiments, I hope you understand that the following questions might be somewhat basic. I am planning to perform ChIP-seq or MeDIP-seq analysis to investigate...

28 July 2024 6,938 1 View

Can we convert a thousand of FASTA sequence in numeric form in .csv format? If yes kindly send me the script for the same?

I have a .text file for various FASTA sequence , and i want to convert these sequences into a numeric file which will be in .csv format. OR I want to extract physiochemical properties(features)...

25 July 2024 3,650 2 View

If my gene of interest has high GC content can it be problematic in sequencing? What kind of error is expected with GC rich gene sequences??

Gene sequencing related trouble shooting

25 July 2024 4,149 2 View

Terezinha Souza

You have to create the indices yourself using the "bowtie2-build" command and the fasta genome file as argument. This step is done only once and the location of the files are passed as the argument in the tophat command.

Here's more info on bowtie-build: http://bowtie-bio.sourceforge.net/bowtie2/manual.shtml#the-bowtie2-build-indexer

Adam P Cribbs

Tophat is a very old method for mapping your transcripts to the transcriptome. It was superseded by tophat2 then hisat and then hisat2 some time ago. Therefore, I would recommend hisat2 https://ccb.jhu.edu/software/hisat2/index.shtml.

However, even hisat2 is now looking very old in comparison to more modern pseudoaligners such as kallisto (https://pachterlab.github.io/kallisto/about) or sailfish (https://sailfish.readthedocs.io/en/master/sailfish.html). There are many advantages to taking this approach for RNA-seq, look at the documentation and paper for each (For example, its more accurate and much faster!) .

Gen Lin

Hi Benjamin,

As Terezinha Souza pointed out, you need to create the bowtie2 index for the genome first. You do not need to export $BOWTIE2_INDEXES. The transcriptome index you tried to create with the command, requires the genome bowtie2 index. Likewise for the alignment, you still need the genome bowtie2 index

First, start by removing the transcriptome data folder,

rm -rf transcriptome_data

Then:

bowtie2-build Tcas.fa Tcas

# this will create Tcas*bt2 in the current directory

# now create the transcriptome index

tophat2 -G Tcas.gff --transcriptome-index=transcriptome_data/Tcas Tcas

#now run tophat2 on reads test.fastq

tophat2 --transcriptome-index=transcriptome_data/Tcas Tcas test.fastq

Benjamin Goldman-Huertas

Thank you to Terezinha Souza and Gen Lin for the thorough replies. I did figure this out eventually, but I hope this shows up on a web or researchgate search for everyone. The manual is a bit vague and misleading in this section, though I sympathize that it is difficult to delineate every single step involved. I don't doubt others will have a similar difficulty. Adam P Cribbs I am looking into Hisat2, but I am a little wary of Kallisto because of the biased estimate log-fold change estimate B it spits out. The authors say its accurate for sign (less/more expressed), but not as reliable for magnitude of change, though maybe that has changed?

Erum Yasmeen

Gen Lin I am beginner to RNA seq data analysis, I need to know ,To run tophat2 on reads, do I need to use the output files of trimmomatic? As trimmomatic generates four files output_forward_paired.fq.gz output_forward_unpaired.fq.gz output_reverse_paired.fq.gz output_reverse_unpaired.fq.gz

so which read should i use?

Can you please guide me. Thank you in advance