I have certain concepts that I want to match in longer sentences. I have about 50 concepts like this:

chief accepted policy,  

chief rejected policy,

proposal was deferred,  

...  

proposal accepted consensus,  

no consensus proposal rejected,

I am trying to draw up a diagram of how this organisation makes decsions regarding proposals. A proposal comes in every week. I have text files of all their email messages where they state these actions. I break the messages to paragraphs by identifying blank lines and then by using standforn NLP to break into sentences. Unfortunately their email messages contain long sentences.

I have tried sentence similarity using ws4j (wordnet), text classification (weka), string matching to match these concepts to sentences (it would be difficult to paste code here). But these does not give the desired result. I am going to look at word order similarity but I dont thing it would help. Bag of words, Phrase matching also wont help.

Here is why its not giving result. Its mainly because the sentence contains so many other things. Trying string matching with these words in sentences “chief” “accepted” “proposal”. Here I am looking for instances when chief actually accepted the proposal.

Meaningful capture:

"the revised proposal was accepted by chief at seminar."

Inaccurate capture: But also inaccurate captures happen which don't mean the same thing : same words also exist in these sentences and its not same meaning.

"Chief, i 'm particularly interested in hearing your opinion about the following three issues : should this proposal be accepted at all . ."

"Chief said, i will retract the proposal, because is too late for me.is the proposal accepted ? ."

"hi chief, should this proposal be accepted at all . ."

Opposite meaning Also sometimes the sentences contain the opposite meaning like: "Chief has decided that this proposal will not be accepted"

Anyone has any idea, please advice how to go about doing this.

I want to extend this study to more organisations after this organisation. They may use other methods of dealing with proposals. Therefore, I believe string matching, bag of words, regex matching might not work.

More Pankajeshwara Sharma's questions See All
Similar questions and discussions