A typical IE/NLP pipeline involves sentence segmentation, tokenization, POS tagging, chunking, entity detection and relation extraction tasks. Compare and contrast the functionalities provided following frameworks/toolkit for implementing such a pipeline using the literature:
(a) NLTK, (b) Stanford CoreNLP, (c) Apache OpenNLP (d) SpaCy and (e) GATE
Consider one such task (e.g. POS tagging, chunking or Named entity recognition) and evaluate the performance of two of the above tool kits on a relevant CoNLL data set.
(CoNLL--‐2000 for chunking, CoNLL--‐2002 for NER)