In evaluating recommender system with binary rating data, which other evaluation metrics can one use aside F-1 (Precision and Recall) measure for better accuracy?

Similar questions and discussions