Since hand-labeled data are difficult to obtain, Weak supervision and distant supervision methods are often presented as the best alternative.

https://hazyresearch.github.io/snorkel/blog/weak_supervision.html

However, I am wondering if we can get high precision results with such techniques since the the training corpus itself is usually pretty noisy?

Thank you in advance !

Similar questions and discussions