I am working on a project involving vocabulary frequency lists and segmentation for Burmese. I am interested in knowing if what you do is related to what I am doing.
I was doing thesis about text classification on Myanmar news.So,basically ,my pre -processing step involve segmentation and collecting stop words for Burmese words.
Can you tell me the product/results of your text classification? How will the results/product be used? What factors/criteria do you use to classify texts?
Stop words mean words ending in glottal stops?
Here, we have been doing segmentation of Burmese words to make word frequency lists in Burmese.