This question explores the consequences of not carefully aligning rewards with the actual goals we want the AI to achieve, leading to unexpected or undesirable behaviours.

More Tieu-Tieu Le Phung's questions See All
Similar questions and discussions