Hi,
I need to write a tool that would by given keywords search in database of articles and recommend to users the most likely article containing proper information. I was thinking to use following search heyristics:
1. if keywords are in text in close proximity (near each other) it is more probable that article is on topic
2. if I can find article on some topic lets say from 2008 and on the same topic from 2012 (so newer) with many negations in text I could assume that old research was wrong and I should proritise newer article
3. I should allow queries in which user could define if he is looking exactly for some amount of keyword in one text or only one or more of them. Or for example that some keywords must be found and some does not
Are my assuptions correct ? Do you have any better ideas to return more accurate results ?