A Data set of siRNA sequence:
CUAAUAUGUUAAUUGAUUU
AAUAUGUUAAUUGAUUUAU
GAUUUAUACAAUUCCUUUC
CAAUUCCUUUCAAUUUUAU
CAGACCAAAAUUAAAUAAG
AGACCAAAAUUAAAUAAGA
ACCAAAAUUAAAUAAGAAA
CAAAAUUAAAUAAGAAAGU
UAAGAAAGUUACAUAAGAU
AAGUUACAUAAGAUUCCAU
ACAUAAGAUUCCAUUUGAG
AAGAUUCCAUUUGAGCAUA
CCAUUUGAGCAUACAUAAG
CAUUUGAGCAUACAUAAGG
AUAAGGCCAUGAUACUUUA
GCCAUGAUACUUUAAUGUG
UUAAUGUGAACCACCAUUU
UGUGAACCACCAUUUCUUG
GAACCACCAUUUCUUGGAA
AUUUCUUGGAAGAAAGAAG
UGGAAGAAAGAAGACAUCC
GGAAGAAAGAAGACAUCCA
GAAAGAAGACAUCCAAAUG
AAGACAUCCAAAUGUCCGA
CAUCCAAAUGUCCGAUUCA
UUCCUGGCCAGUCAUCCAG
CCUGGCCAGUCAUCCAGUA
AGUCAUCCAGUAGACUCUC
AGACUCUCUCCACUCUUCA
How can it be converted to a vector of 4, 16, 64, 256, etc. multidimensions using frequencies of its mono, di, tri, tetra nucleotide sub sequences respectively?