Genome sequence data need to be converted into numerical database to perform machine learning algorithm.
Or you may want to change it to binary, for that use
https://github.com/abhijeetsingh1704/DNA-to-BINARY
If you still want to change it to numerical, change the nucleotide code in script and the script will change DNA to numerical.
By simply using SED in Linux? For example:
sed -i 's/A/1/g' genome.fasta
sed -i 's/U/2/g' genome.fasta
and so on
Fabiano Sillo
How about changes in fasta header with your suggestion?
Abhijeet Singh
by excluding headers from replacement, of course
example for replace "A" with "1":
sed -i -e '/^[^>]/s/A/1/g' genome.fasta
Hi In Molecular Dynamics Simulation, how do I make changes to the force field according to my protein? I got this error in the add-ion stage: Fatal error "./ amber99sb-ildn-HHH.ff / force...
01 March 2021 8,747 4 View
I am trying to classify and analyze the results of an SDS-PAGE based array for bacterial detection using machine learning, but I have trouble finding the best way to represent the results with...
27 February 2021 9,176 3 View
I diluted siRNA and RNAiMAX in opti-MEM and added to the cells which they were in the growth medium. Is it a right way? or should I culture cells in the opti-MEM medium for a while and not in...
26 February 2021 10,041 3 View
Most of the papers have compared different hyperparameters such as layers, initializers, activation function, and optimizer algorithm in the CNN algorithm for detecting network intrusion. This is...
24 February 2021 4,578 4 View
Hello All, I am looking for the historical wind data/map (mainly the year 2020-2021). I would like to look at the wind value and direction for the specifics locations within the US for the...
18 February 2021 9,543 4 View
I am doing a salt tolerance-related experiment and for that I need to relate my salt levels to that in bacto agar. I don't want my salt concentrations to increase due to already salt content...
09 February 2021 4,158 1 View
07 February 2021 3,663 2 View
03 February 2021 1,111 5 View
Hi everybody I'm stuck with this error and I dont know what to do to solve this problem Initialization is ok but when I start my calculation this error appears (Error: floating point exception /...
25 January 2021 8,091 3 View
Hi everyone, I'm currently trying to perform single cell RNA seq on a small (
24 January 2021 9,804 2 View
What Characteristics makes CNN work better?
03 March 2021 1,458 4 View
i would to know some of the research gaps in the artificial intelligence field in most african countries.
03 March 2021 6,145 3 View
I am on the lookout for the Enhanced Yellow Fluorescent Protein (Aequorea victoria) DNA sequence. Does anyone know where I can find it? Thank you in advance
03 March 2021 3,568 1 View
I have selected brain tumor images ...but now found that already lots of research done n this topic.
03 March 2021 5,774 3 View
dear community, my model is based feature extraction from non stationary signals using discrete Wavelet Transform and then using statistical features then machine learning classifiers in order to...
03 March 2021 6,994 5 View
I feel that the practice in teacher education in my country is below the expected performance level due to very poor management system. Hope I will learn something from your experiences.
02 March 2021 1,516 4 View
NFL theorem is valid for algorithms training in fixed training set. However, the general characteristic of algorithms in expanded or open dataset has not been proved yet. Could you show your...
01 March 2021 1,189 3 View
I am looking at the ATP1A2 (Sodium/Potassium ATPase alpha subunit 2) in two human neuronal cell lines. Expression levels of this protein seems to be almost equal when detected by one antibody....
01 March 2021 3,607 3 View
L1 and L2 regularization
28 February 2021 4,187 3 View
The following code (see 1st 2 images attached) is used to produce PID controller values that are designed to control the system (G). The code finds the PID controller values (noted as k) by using...
28 February 2021 6,560 14 View