My motivation is to somehow (blindly) learn (negative) patterns with plain text corpora.

For a bag of words {this, is, a, book}, once a corpus tells us there is no usage of "book is this a" for sure, and so on so forth, then hopefully by negation one may find some hidden rules to promote the bag-of-words model to something similar to LDA.

http://en.wikipedia.org/wiki/Anastrophe

http://en.wikipedia.org/wiki/Latent_Dirichlet_allocation

Similar questions and discussions