I want to know, how we can estimate about best reward function (RF) from the list of RFs?
let suppose, I have two reward functions, which one can give the best performance.
1st RF
Loss = sum i=0 to N (Ai-Bi)
2nd RF
average loss
Loss = 1/N [sum i=0 to N (Ai-Bi)]
So from equations, how we can know about best RF?