01 January 2018 3 7K Report

In semantic similarity a benchmark data set is used to measure the effectiveness of semantic similarity measures. SemEval-2017 and earlier workshops uses the Pearson correlation to judge the effectiveness of proposed semantic similarity methods. One major problem for Pearson correlation is the outliers that influence the slope of the regression line, and consequently on the value of the correlation coefficient. Moreover, The Pearson correlation does not have the ability to show wither a similarity method is good for non-similar, average-similar or similar text fragments.

Is it time to find an alternative way?

if yes, what do you suggest?

More Issa Atoum's questions See All
Similar questions and discussions