I have calculated similarity scores from 302 textual stories. How can I use more representative similarity scores? I am afraid that similarity matrix tells us the similarity score of each story with the rest of stories in a column.

What if I want to find the overall similarity measures for each story in my corpus?

Lets suppose we have 3 stories

A is 0.53 similar to B

A is 0.73 similar to C

I want to find out the overall similarity of all A, B and C like below

A = 0.69

B = 0.53

C = 0.44

In this way, I would be able to use it in SPSS or ML to find important correlations or predict something.

Note: Data is fictious

More Tahir Abbas's questions See All
Similar questions and discussions