I need to find the percent similarity between a given query and documents. Using various algorithms (Cosine Similarity, BM25, Naive Bayes) I could rank the documents and also compute numeric scores. However I need to find the percent similarity between the query and document for a given score. E.g.
If a document has highest score of (lets say) 32.0 for given query. Then how to convert the number 32.0 to percentage similarity.
As I don't know the highest value of similarity I can't compute the percentage. I tried using z-scores, but in that case also the scale is changed but we can't comment regarding the percent similarity, as a document with z-score of 0.5 or -0.5 or -2.0 could be the best match for the given query.
Is there any way to handle this situation? Could we use Cosine Similarity with BM25 to restrict the similarity scores in the range [-1,1]?
Thanks