I am looking for algorithm(s) to extract data and its value from unstructured documents.
We were given list of tokens against which the information from the documents to be extracted.
Going forward the ML should learn from itself to extract correct token and its values (possibly using RNN)
Preferably example using python would be useful.
Thanks