If I have several methods to aggregate users ratings towards items (average, median, Bayesian estimate, Dirichlet reputation, .. etc) How can we evaluate the accuracy of items reputations? Now I use the rating prediction method with MAE measure, is there any other method?