Reinforcement learning neural network control basis function?

More Wei Xin's questions See All

How to make overlapping p-h diagram in EES?

I want to draw ph diagram using Engineering equation solver EES for refrigerant R134a and R1234fy and some other. I don't know how to draw the multiple ph diagram on same dome or overlapping to...

30 July 2024 647 7 View

What binder would be the best choice to modify glassy carbon electrode with a 2D Material like Graphene or MXene?

I am trying to drop cast 2D Materials on Glassy Carbon Electrode for Cyclic voltametry and EIS purpose. What would be best binders to use?

23 July 2024 5,776 2 View

How to display grain boundaries in RVE model?

Many literatures have shown that the RVE model shows the grain boundaries between different grains. How can this be achieved using DAMASK+paraview?

21 July 2024 9,224 1 View

Abnormal data in in-vitro drug release study?

I'm doing in-vitro drug release study using dialysis bag method and collecting samples from the external medium at several time points, subjecting to UV-Vis spectrophotometry. But my absorbance...

14 July 2024 6,829 2 View

Are my cells contaminated with mycoplasma?

I suspect my cells are contaminated with mycoplasma. I fixed the cells with 4% PFA and stained them with DAPI. Below is the image I obtained. I don't observe the typical small, rounded DAPI foci...

11 July 2024 7,786 3 View

Black dots like contamination in my cell cultures. What are they?

There are black dots that seem like contamination in my corneal endothelial cell cultures. Some of the cells appear unhealthy and are enlarged. The black dots are more prevalent around these...

11 July 2024 4,829 0 View

What is the most simple language and character in the world?

Dear colleagues, As well known, we can deliver messages in different languages and characters, including Chinese, English, Latin, binary and decimal codes (if with a converter) and many other...

08 July 2024 5,422 13 View

Trouble shooting on mediation with survival regression and linear regression?

I did linear regression of X (independent variable) to M (Mediator) then I used survival regression to fit X to Y (dependent variable) With these questions: a. HOW to correctly do a mediation...

17 June 2024 1,478 6 View

Potato handbook crop of the future--PDF?

Whoever has an electronic version of this book, sell it to me，《potato handbook crop of the future》Thanks

15 June 2024 4,404 0 View

Why is the blue and white spot screen negative for all white spots?

The protein was expressed using the insect cell system, and the recombinant plasmid was successfully constructed as verified by sequencing, and the recombinant plasmid was transformed into DH10...

13 June 2024 2,590 4 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Swimming/space travel depends on the proprioceptive muscle spindles?

When the entire neocortex is ablated in rodents, although they are still able to swim, all the limbs move continuously and asynchronously (Vanderwolf 2006; Vanderwolf et al. 1978). Normal animals...

03 August 2024 835 3 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

Some new emerging problems on application of RL for scheduling in IoT networks?

I have seen plenty of existing works on applied Reinforcement Learning (RL) policies for optimized scheduling in IoT networks including Q-learning, DQNs, and the newer ones including PPO for...

01 August 2024 8,754 2 View

How to Compress Information Neurally?

Samuel Morse, the inventor of the Morse Code, understood that certain letters in the English language occurred more frequently than others (Gallistel and King 2010). To deal with this, Morse used...

01 August 2024 4,456 2 View

Shafagat Mahmudova

Dear Wei Xin ,

In reinforcement learning, it is a common practice to map the state(-action) space to a different one using basis functions. This transformation aims to represent the input data in a more informative form that facilitates and improves subsequent steps. As a “good” set of basis functions result in better solutions and defining such functions becomes a challenge with increasing problem complexity, it is beneficial to be able to generate them automatically. In this paper, we propose a new approach based on Bellman residual for constructing basis functions using cascadecorrelation learning architecture. We show how this approach can be applied to Least Squares Policy Iteration algorithm in order to obtain a better approximation of the value function, and consequently improve the performance of the resulting policies. We also present the effectiveness of the method empirically on some benchmark problems.

https://hal.inria.fr/hal-00826054/document

Regards,

Shafagat

Mohamed-Mourad Lafifi

Hi,

It is important to know that the approximation of the functions is done either by polynomials, Fourier series (linear combinations)... or by neural networks (non linear function) using the gradient and in particular NNs by the back propagation technique. Why the even order is to make appear during the development of the gradient or the retro propagation the two terms related to the parameters/sub functions.

Also please take look at the links.

https://tel.archives-ouvertes.fr/tel-00003985/document

Article Optimal Reinforcement Learning-Based Control Algorithm for a...

Article Reinforcement learning-based optimised control for a class o...

Best regards

Abbas Thajeel Rhaif Alsahlanee

It can make the control algorithm significantly simple to compare with the existing optimal control methods to achieve the optimized control.

Wei Xin

Dear Dr. Shafagat Mahmudova

Thank you for your brilliant remarks. I'm very interested in your new approach based on Bellman residual for constructing basis functions！Thanks again for your reply！

Dear Dr. Mohamed-Mourad Lafifi Thank you very much for your answers and your references.

Best wishs!

Hi Dr. Abbas Thajeel Rhaif Alsahlanee ,Thank you very much for your answers!

Omar Laith

https://libraryapp.uomustansiriyah.edu.iq/ebooks.php?a=view&recid=2767