I have analysed a number of online articles and have their emotion analysis scores along with their sentiment (pos/neg/neutral) and the sentiment value. The fields are: Anger, Disgust, Fear, Sadness and Joy. What I would like to know if it is possible to somehow combine the values of the fields to represent them as one value. I also have comments related to those articles and have their sentiment and emotion scores as well in similar fields.

This would permit me to find a threshold so that I can use it to grade the article and the comments according to that single value. For example, an article might be: Anger=0.100637, Disgust=0.327951, Fear=0.243857, Joy=0.043951 and Sadness=0.364933.

Clearly in this example, the sadness value is the highest followed by disgust, but would it be right to ignore the lower score fields and classify that article as "sadness" related when "disgust" is that close? Would the "sadness" value be representative of that article? And what if another has 0.148988, 0.14043, 0.070271, 0.609123 and 0.103031? Equal parts "anger" and "disgust" but with 60% "Joy"? 

My first thought was to have some sort of mean but that would not be accurate at all as the difference in the different scores will certainly be lost.

Can someone please help me a little with this problem? Can all five values somehow be represented as one? Thank you.

More Silvio Abela's questions See All
Similar questions and discussions