May I ask what is the state-of-the-art for integrating RO or SO with DRL? Do you think this topic will be a powerful technique in the future? Following are some detailed questions.
What are the main challenges in this topic?
What are the typical applications for this method?
Do you have any inspired documents (original or classical) that can be shared with people to promote the development of this topic?