Being a computer science student, I don't know much about statistical testing. However, recently, a lot of work has reported statistical validation of their result. In machine learning-based prediction of effector proteins, how do you apply statistical tests to validate the result?