I have to do text classification with weka (SVM (SMO) algorithm)). Actually, it's a word sense disambiguation project. the files I prepared, contain some sentences (Persian language) and an English word in front of each sentence which shows its class. The files are arff. My question: for using SMO in weka for classifying these sentences, should I give weka these sentences as input or should I change the words to vector and give these vectors to weka? if I have to use the second one, according to what factor should I change words to vectors? I've enclosed a sample file.