How reinforcement learning-based controller ensures system stability?

Generally, the Optimal control problem for sequential decision making under uncertainties typically seek control laws in an offline manner assuming availability of the underlying dynamic models. When the underlying models are unavailable or partially known, adaptive control approaches are employed in an online fashion. Thus, online RL methods represent an adaptive optimal control method in the sense that (sub)optimal control laws are obtained online using real-time measurements without a model. Stability analysis of optimal and adaptive control methods are crucial in safety-related and potentially hazardous applications. Informally, stability requires containment, that is, for bounded initial conditions the system state remains bounded for all future times. When interconnected with nonlinear dynamical systems it is identified that by regulating the input-output gradients of policies, the robust stability can be strongly guaranteed based on a semi-definite programming feasibility problem. The method certifies a large set of stabilizing controllers by exploiting problem specific structures.

Please follow the article “Busoniu, Lucian & de Bruin, Tim & Tolić, Domagoj & Kober, Jens & Palunko, Ivana. (2018). Reinforcement learning for control: Performance, stability, and deep approximators. Annual Reviews in Control. 10.1016/j.arcontrol.2018.09.005.” for more information on ensuring stability for reinforcement learning-based controller.

Rachit S Garg

Please refer

https://scholar.google.co.in/scholar?q=How+reinforcement+learning-based+controller+ensures+system+stability%3F&hl=en&as_sdt=0&as_vis=1&oi=scholart#d=gs_qabs&u=%23p%3Dq0yXk9oCf2cJ

Mohamed-Mourad Lafifi

Hi,

I suggest you to see links and attached files on topic.

https://orbi.uliege.be/bitstream/2268/240057/1/ARC_paper_new.pdf

https://www.cs.colostate.edu/~anderson/res/rl/matt-diss.pdf

Article Power Systems Stability Control: Reinforcement Learning Framework

Article Reinforcement learning for control: Performance, stability, ...

Best regards

Could you recommend some articles on Urban Transportation System optimization and Innovation?

Feedback defines the constitution of an organism?

Geotechnical Engineering (Proceedings of the ICE) time review?

GC-MS retention index prediticon?

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

Is it possible to plot the atom-projected band structure using GPAW?

Should I include H atom into C3N5 when i am doing DFT modelling?

Is there an alternative to a multinomial regression which allows the DV to be non mutually exclusive?

Why do we equate male and female arousal?

Are there any good simple systems or platforms to recommend?