I have calculated similarity scores from 302 textual stories. How can I use more representative similarity scores? I am afraid that similarity matrix tells us the similarity score of each story with the rest of stories in a column.
What if I want to find the overall similarity measures for each story in my corpus?
Lets suppose we have 3 stories
A is 0.53 similar to B
A is 0.73 similar to C
I want to find out the overall similarity of all A, B and C like below
A = 0.69
B = 0.53
C = 0.44
In this way, I would be able to use it in SPSS or ML to find important correlations or predict something.
Note: Data is fictious