Found this 276,467 sequences from virus along HumanReferecneGenome. What would be next step?

More Ernesto Rios Willars's questions See All

Alguien de habla hispana trabajando en computational algorithm Nurse Rostering Problem?

Busco algun colega que pueda apoyar con la revisión de un manuscrito

08 September 2018 3,460 2 View

Information about alzheimer peptide ?

searching for a data base to find sequences and information by country or sector

03 April 2018 7,852 1 View

What is a patogen bacteria found in diabetic foot?

searching for bacteria found in diabetic foot known as treatment target and why

02 March 2018 6,735 6 View

Is there any relation between diabetes and bacteria?

I am searching for bacterial genome found playing any role in diabetes phases

02 March 2018 1,036 7 View

Is there any relation between virus and gut microbiota?

Searching for a topic where we can explore diabetes by bioinformatics

02 March 2018 1,276 4 View

Is there any connection between endogenous retroviruses and diabetes?

can anyone provide information in this area. We are trying to explore this topic in terms of bioinformatics

01 February 2018 2,700 4 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Why does my protein refolded to beta sheet during thermal denaturation analysis?

Hi! So i attempted to understand a novel protein behavior towards heat application by analyzing its secondary structure change. I subjected the protein to a thermal denaturation analysis using...

06 August 2024 1,989 3 View

Matej Lexa

You could look at the COVERAGE of the human reference genome by n-mers from the virus and vice versa. In doing so you could allow small holes or other imperfections in otherwise perfectly covered/matching regions to vary the sensitivity of the mapping.

Visualizing this data on the human genome reference, you could see which parts of the genome are 0, 10%, 20%,...,90% or 100% covered by some virus sequence.

On the virus sequence, you could see whether different parts of the virus are equally repetitive in the reference, or taking into account the imperfections, which regions of the virus can only be mapped to the same coverage with 0%,10%,...80%, 90% mutations allowed for.

Ernesto Rios Willars

Matej that sounds good as next step, i am thinking ways to do it with an adaptation of this algorithm. Actually it goes searching all posible n-mers sequence validating String.contains() metod.

Its hard and slow for an intel core i7 but at least it runs parallel proccess.

By the way, it keeps finding new sequences, now i have a set of 194,538 sequences from 100mers to 200mers.

however, its known that this retrovirus is endogenous in human genome, but its a validation for this tool. I will follow your recomendation related to the coverage, but first i wonder wich virus-vs-genome would be apropiate for exploration.