Could you please suggest me some good articles (original research or review) on comparative (comparison of their performance w.r.t accuracy, ranking, etc.) studies between text similarity measures like cosine similarity, BM25, Language modeling?
Text similarity analysis is hot topic in data mining, I suggest you to view articles published in high conferences and journals such KDD, PAKDD, ICDM an TKDE
Thanks a lot for your valuable hints. Yes, Sanad, I already studied the paper you have proposed. It includes a good bunch of similarity measures and their characteristics. It's really good. However, rather than the descriptions of their intrinsic attributes, I am looking for the experimental tests/ comparisons (in terms of performance) between them. Any clue?