Since hand-labeled data are difficult to obtain, Weak supervision and distant supervision methods are often presented as the best alternative.
https://hazyresearch.github.io/snorkel/blog/weak_supervision.html
However, I am wondering if we can get high precision results with such techniques since the the training corpus itself is usually pretty noisy?
Thank you in advance !