I want to know, how we can estimate about best reward function (RF) from the list of RFs?

let suppose, I have two reward functions, which one can give the best performance.

1st RF

Loss = sum i=0 to N (Ai-Bi)

2nd RF

average loss

Loss = 1/N [sum i=0 to N (Ai-Bi)]

So from equations, how we can know about best RF?

More Mm Muhammad Muhammad's questions See All
Similar questions and discussions