When should I prefer reinforcement learning over optimal control theory?

More Martin Huber's questions See All

State of art in natural disasters?

Are increasing the costs of disasters in the affected countries.

01 August 2024 1,794 2 View

Dataset of synchronized cardiac angiography and ECG?

Hello, I'm working on medical project and I would need synchronized angiography with ECG? Does anyone know if some open source dataset of this kind exist? Regards, Bruno

25 July 2024 2,214 2 View

Is possible to edit a gene present in a plasmid by Crispr Cas9?

Hello everyone! Someone working with Crispr Cas9. I have a question. Is it possible to edit a gene that is present on a plasmid (in this case with a low copy number)? In this case, is it possible...

22 July 2024 7,073 2 View

How do I link researchgate with my pubmed publications?

I moved to a different institute and changed my email address. Somehow I lost the list of my publications. Is there a way to link to my PubMed publication list?

17 June 2024 5,312 2 View

Cheaper alternative to XSens mocap system for ergonomic data acquisition?

Hello all, I'm starting a project where we want to automate the video analysis of people working in different environments to produce ergonomic measures like hip flexion, shoulder extension, etc....

16 June 2024 1,843 1 View

Does anyone have experience with immunofluorescence of endoplasmatic reticulum?

Hi! I have performed several immunofluorescences of cultured ARPE-19 cells with different antibodies expecting in each one a cytoplasmic localization. However, it seems to localize in the...

06 June 2024 7,764 1 View

How can I change the coordinate system in Abaqus using Python?

Hello, I'm writing Python scripts for Abaqus and I'm facing a problem. I need to change the coordinate system in a .odb file before extracting data but I'm stuck. I can create my coordinate system...

30 May 2024 7,431 3 View

How can I induce neuroinflammation with LPS in HMC3 cells ?

dear all, I try to induce neuroinflammation in HMC3 cells using LPS for 24h. The concentration used was 500 ng/ml. But unfortunately no change on TNFa was obtained in LPS versus Control...

29 May 2024 4,007 1 View

How to prevent Vibratome Section shrinkage after 14 day culture?

I cut 50 um sections of tissue in agarose and need to culture them for 14 days before fixing.The sections have great surface area when I first section them, but after 14 days they shrink to sizes...

28 May 2024 9,861 0 View

I am trying to install gamit/globk software.After command./install_software it shows an error which is shown in below image?

PPPPlease

27 May 2024 8,219 1 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Are there any good simple systems or platforms to recommend?

In order to show people the beauty of control and enhance enthusiasm for learning control theories, are there any good simple systems or platforms to recommend?

05 August 2024 10,034 1 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Swimming/space travel depends on the proprioceptive muscle spindles?

When the entire neocortex is ablated in rodents, although they are still able to swim, all the limbs move continuously and asynchronously (Vanderwolf 2006; Vanderwolf et al. 1978). Normal animals...

03 August 2024 835 3 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

Senescence-associated beta galactosidase staining is False Positive in control group?

Hi guys If anyone is currently working on aging cells, you guys would like to give me some advice. I'm testing against biomarker (SA-beta-Gal), I encountered a false positive in the control group...

02 August 2024 6,735 1 View

Mahmood Mola

See the following articles to explore the applications of reinforcement learning.

1)Kiumarsi, B., Vamvoudakis, K. G., Modares, H., & Lewis, F. L. (2017). Optimal and autonomous control using reinforcement learning: A survey. IEEE transactions on neural networks and learning systems, 29(6), 2042-2062.

2) Polydoros, A. S., & Nalpantidis, L. (2017). Survey of model-based reinforcement learning: Applications on robotics. Journal of Intelligent & Robotic Systems, 86(2), 153-173.

3) Mahmud, M., Kaiser, M. S., Hussain, A., & Vassanelli, S. (2018). Applications of deep learning and reinforcement learning to biological data. IEEE transactions on neural networks and learning systems, 29(6), 2063-2079.

4) Nian, R., Liu, J., & Huang, B. (2020). A review on reinforcement learning: Introduction and applications in industrial process control. Computers & Chemical Engineering, 106886.

5) Lei, L., Tan, Y., Zheng, K., Liu, S., Zhang, K., & Shen, X. (2020). Deep reinforcement learning for autonomous internet of things: Model, applications and challenges. IEEE Communications Surveys & Tutorials, 22(3), 1722-1760.

6) ...

Also read the book below for a better comparison.

Lewis, F. L., Vrabie, D., & Syrmos, V. L. (2012). Optimal control. John Wiley & Sons.

Good Luck

Mohamed-Mourad Lafifi

Dear Martin Huber,

For tasks that are complex and difficult to formulate by the classic tools of optimal control, we are forced to orient ourselves towards automated agents in a very generic programming framework and to solve these problems in a simpler way is learning by reinforcement (LM). We can conclude that this technique is a plus for optimal control.

For more information about this subject i suggest you to see links attached files on topic.

https://web.mit.edu/dimitrib/www/Slides_Extended_RL_Lecture.pdf

https://medium.com/m/global-identity?redirectUrl=https%3A%2F%2Ftowardsdatascience.com%2Fcomparing-optimal-control-and-reinforcement-learning-using-the-cart-pole-swing-up-openai-gym-772636bc48f4

https://etd.ohiolink.edu/apexprod/rws_etd/send_file/send?accession=osu1543321543377163&disposition=inline

Article Differences and similarities between reinforcement learning ...

Best regards

Majid Mazouchi

Classical methods for control of dynamical systems require complete and exact knowledge of the system dynamics. However, most real-world dynamical systems are uncertain and their exact knowledge is not available. Adaptive control theory consists of tools for designing stabilizing controllers which can adapt online to modeling uncertainty of dynamical systems and has been applied for years in process control, industry, aerospace systems, vehicle systems, and elsewhere. However, classical adaptive control methods are generally far from optimal. On the other hand, optimal control theory is a branch of mathematics developed to find the optimal way to control a dynamical system. Reinforcement learning is actually closely tied theoretically to both adaptive control and optimal control. One can see RL methods as a direct approach to adaptive optimal control of dynamic systems. See the below link for more details:

R. S. Sutton, A. G. Barto and R. J. Williams, "Reinforcement learning is direct adaptive optimal control," in IEEE Control Systems Magazine.

This one provides an overview of the reinforcement learning and optimal adaptive control literature and its application to robotics:

Khan, S. G., Herrmann, G., Lewis, F. L., Pipe, T., & Melhuish, C. (2012). Reinforcement learning and optimal adaptive control: An overview and implementation examples. Annual Reviews in Control, 36(1), 42–59.

Zahoor Ahmed

well said by the above researchers i.e. the most important point keeping in mind is the unknown knowledge of the system dynamics or exact model of a system, some examples of RL can be seen:

Article Online optimal and adaptive integral tracking control for va...

https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9206554

https://research-information.bris.ac.uk/en/publications/reinforcement-learning-and-optimal-adaptive-control-an-overview-a

Farshid Asadi

Reinforcement learning and optimal control theory use same principles. In fact, they are the same thing, although with some differences.

In both of them you calculate an optimal control (called policy in RL literature), based on a given objective function (called reward in RL literature) for a dynamic system. In that they are the same things and share lots of tools and techniques.

The main difference between them is that optimal control theory is used when you have a mathematical model for your dynamic system. However, RL is used mostly when you do not have mathematical model for your dynamic systems.

To give an example, for controlling rigid body robots we have practically accurate dynamics model for them. So we usually use (approximate) nonlinear optimal control theory to derive optimal control for them.

But for playing a game like chess or controlling a soft robot, we do not have a set of mathematical relations accurately describing their evolution or the models are too complex. In this case we use model free optimal control, that is called RL, to calculate the optimal policy maximizing the reward.

The take away is that

If you have a (simple enough) mathematical model of dynamic system you dealing with, you might use optimal control theory.

But if you do not have exact dynamic model or if it is too complex, then you might use RL.

That being said RL can be used in any case, but there is no advantage if you have the dynamic model.

Martin Huber

Farshid Asadi Thanks for your answer. I just want to add, that also model-based RL exists (e.g. world models), which makes it difficult to differentiate RL from optimal control and also to decide which one to use.

Dimas M. Rachman

Another perspective is that

1. RL can view a system stochastically and

2. Optimal control usually views a system deterministically.

That being said, when the system is very hard/impossible to be "determined" as a mathematical system deterministically, then RL is preferable.