Hi ,
I know there are studies discussed that Online evaluations for systems are biased with several criteria and some of these criteria are the education level of the users and their familiarity and Intelligence on the interaction with the system. These issues cause the limitation for comparing between various algorithms online. Please, I am wondering if you know some papers mentioned these issues?
Thanks
Osman