In every text book we find rule based approaches (two level morphology) or lexicon based approaches for morphological analysis of words. I could also imagine that the task can be done by a Hidden Markov Model (HMM) that takes a sequence of morphemes as input and has as output  a lemma, wordclass and morphological features (like plural, 3rd person, future, etc.). Does anyone now a paper that describes such an approach? I guess, if there is such a paper it might be quite old.

To be clear: I am not interested in POS Tagging, but just in the morphological analysis. I am mostly interested in inflective/fusional indo-european languages, but hints to apporaches for Finnish, Turkish, Japanese or other languages can ofcourse be helpfull aswell!

Similar questions and discussions