Discovery of RNA viruses from metagenome and metatranscriptome datasets? a good protocol?

Guillermo Domínguez Huerta @Guillermo-Dominguez-Huerta

23 July 2018 1 4K Report

Hi all,

this one is a (I guess) tricky question...

RNA virus discovery from metagenome/metatranscriptome dataset (overall from environmental samples) is particularly difficult because of their VERY DIVERGENT genome sequences, with poor relationship with what is available in reference sequence databases.

Can you recommend a "typical" protocol for this?

I found 2 "versions" by now:

**#FIRST PROTOCOL#**

- Assemble reads with Trinity or metaSPAdes.

- Do tBLASTn with the generated contigs/scaffolds against a database made of RNA virus proteins (ssRNA and dsRNA viruses). Use an e-value cutoff of

Darren J Obbard

I would start by translating the RNA to obtain putative protein sequences - just concatenate all of the translations to make a protein pseudo-sequence that captures the (potential) protein content of your RNA. This will be faster to search with, and is usually more sensitive than blastx or tblastn.

I would strongly recommend using the virus proteins from nr rather than refseq, as refseq only has (near)complete genomes, and divergent (especially) segmented viruses will gain a lot by using all rather than refseq proteins. In my experience this will find viruses with short regions of ~30%+ identitiy to known viruses.

An HMM will be more sensitive - and is probably the best option - but for transcriptome-scale data might prove too slow. I would recommend diamond (method blastp) as being much much faster, and only a little less sensitive than blastp itself (default options).

Having identified possible viruses in this way, *then* blast (or diamond) against refseq (as DNA and protein) to exclude those that are much closer to cellular organisms than viruses, even if they hit viruses initially

Be aware that many DNA virus 'hits' will turn out to be transposable elements from the host. This reflects the presence of many closely-related transposable elements in very large DNA viruses that are not well sampled from their eukaryotic hosts

Badges
Science topic

More Guillermo Domínguez Huerta's questions See All

Can I use Polyjet after its expiration date?

I have a Polyjet that has passed its labeled expiration date for 1 year and I'm going to re-start transfection experiments. It is really needed to change it? Transfection efficiency may be lower?

23 July 2024 8,059 1 View

¿Where to buy the Aspergillus oryzae NSAR1 strain?

Greetings: I am looking to buy/obtain the auxotrophic strain Aspergillus oryzae NSAR1, but I just have found one culture collection from Japan but haven't received any answer. Could you provide...

11 July 2024 6,847 0 View

What does effector:target ratio really means in CAR-T cell therapy?

Regarding CAR-T cell therapy, during co-cultures with different effector:target ratios, when we refer to "effector", does it involve the total T cells? For example, does a ratio of 5:1 mean 5...

28 May 2024 2,757 4 View

How can I calculate the effective dielectric constant of a thin film and substrate?

I am trying to figure out a way of calculating the dielectric constant of a thin film of Yb2O3 on ITO. Assuming that the thickness of the Yb2O3 is known and that it's

05 May 2024 5,989 1 View

What's the proper way to clean lactophenol blue from microscope slides?

Hi, I've been cleaning lactophenol blue from microscope slides using ethanol (70% or 96%) but I don't know if it's okay to mix them with lactophenol blue. I already looked for information in SDS,...

22 April 2024 1,251 1 View

¿Piensas que una persona que ha maltratado psicológicamente y físicamente puede rehabilitarse 100%?

ESTÁN TODAS LAS PERSONAS PREPARADAS PARA UNA REHABILITACIÓN 100% EFICAZ

04 March 2024 3,344 3 View

What's the optimal observation count per category for Machine Learning?

Hello everyone, I'm seeking some advice or references related to the optimal number of observations needed per category within a categorical variable for machine learning projects. I've come...

29 February 2024 9,289 2 View

Some expert using "GRADE" to the assessment of certainty in evidence of systematic reviews?

We are preparing an umbrella review and we need someone who wants to collaborate and perform the assessment of certainty in evidence of systematic reviews using GRADE on the systematic reviews...

20 February 2024 9,244 0 View

What am I doing with AI in campus?

With the rising application in AI (ChatGPT, Dall-e, Tome, etc) in education, for better or for worse, what actions are you taking to improve your research and your teaching using AI...

12 February 2024 3,893 2 View

Has anyone have taken confocal images of apoptotic cells stained with Apotracker Tetra?

Hi! I was wondering if anyone has taken confocal images of apoptotic cells stained with Apotracker Tetra, from Biolegend. There is information using Apotracker Green, but I haven´t found a...

06 February 2024 8,343 0 View

How can I prepare virus for a TEM or SEM imaging?

I have virus (viral hemorrhagic septicemia virus) in suspension and the experiment will not involve cells. What level of TCID50 is preferred?

11 August 2024 3,115 1 View

Is there a problem with my RNA pellet?

Hello, I am currently having problems with RNA extraction. I am using mouse liver (C57BL6J), and I have extracted RNA from mouse liver before. Before this experiment, my final RNA pellets were...

11 August 2024 7,082 3 View

Strugglling with m6A dot blot any suugesstion ?

I have been doing the m6A dot blot for a while with no improvement, I am extracting the RNA, and I can see the dots although the three biological replicas give a different reading on the memberan...

10 August 2024 8,539 5 View

RNA Extraction Using Hot Borate Method No Longer Working?

I've been performing RNA extraction on cotton petiole tissue for a few months now using the method described in the following paper, a derivative of the typical hot borate method...

08 August 2024 9,882 2 View

Does Anyone have expertise in in vitro transcription and RNA pull down assay?

I am currently working on LncRNA; to know the lncRNA-protein interactions I want to do RNA pull down assay, so I need to design primers with T7 promoter. I need assistance in this regard.

07 August 2024 6,622 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

E.coli contamination in human RNA seq data ?

Recently, we observed that 99% of the sequences in our RNA-seq data corresponded to the E. coli genome. Despite multiple DNAse treatments after RNA extraction and ribosomal depletion, we were...

06 August 2024 807 3 View

RNA later for the preservation of RNA in fecal samples at room temperature for one day (37°C)?

I am planning to collect human fecal samples for metatranscriptomic analysis using MGI. These samples are from indigenous people living in a region with high temperatures. I will have access to a...

06 August 2024 1,367 3 View

If we are using snowball sampling technique, how do we justify the true representativeness of the sample statistically? is there any statistical test?

Are there any statistical methods to justify your sampling technique using SPSS or AMOS?

05 August 2024 9,153 4 View

Do you have good tips for seaweed tissue preservation in the field for post RNA extraction?

I will be with my students collecting seaweed samples in a marine farm and later we will process this tissue for RNA isolation and further sequencing. Does anyone have tips on how to collect the...

04 August 2024 501 2 View