A Data set of siRNA sequence:

CUAAUAUGUUAAUUGAUUU

AAUAUGUUAAUUGAUUUAU

GAUUUAUACAAUUCCUUUC

CAAUUCCUUUCAAUUUUAU

CAGACCAAAAUUAAAUAAG

AGACCAAAAUUAAAUAAGA

ACCAAAAUUAAAUAAGAAA

CAAAAUUAAAUAAGAAAGU

UAAGAAAGUUACAUAAGAU

AAGUUACAUAAGAUUCCAU

ACAUAAGAUUCCAUUUGAG

AAGAUUCCAUUUGAGCAUA

CCAUUUGAGCAUACAUAAG

CAUUUGAGCAUACAUAAGG

AUAAGGCCAUGAUACUUUA

GCCAUGAUACUUUAAUGUG

UUAAUGUGAACCACCAUUU

UGUGAACCACCAUUUCUUG

GAACCACCAUUUCUUGGAA

AUUUCUUGGAAGAAAGAAG

UGGAAGAAAGAAGACAUCC

GGAAGAAAGAAGACAUCCA

GAAAGAAGACAUCCAAAUG

AAGACAUCCAAAUGUCCGA

CAUCCAAAUGUCCGAUUCA

UUCCUGGCCAGUCAUCCAG

CCUGGCCAGUCAUCCAGUA

AGUCAUCCAGUAGACUCUC

AGACUCUCUCCACUCUUCA

How can it be converted to a vector of 4, 16, 64, 256, etc. multidimensions using frequencies of its mono, di, tri, tetra nucleotide sub sequences respectively?

More Ranjan Sarmah's questions See All
Similar questions and discussions