Hello,
How to minimize a function as an objective function with decision variables, in DQN (deep Q-learning network)?
please let me know if you have any references or suggestions about that.
Thanks
Dear Hamed Hosseiny
Have a look with these articles:
1. https://towardsdatascience.com/double-deep-q-networks-905dd8325412
2. https://www.eventideib.polimi.it/en/events/reducing-q-function-estimation-error-in-dqn-algorithm/
3. https://markelsanz14.medium.com/introduction-to-reinforcement-learning-part-3-q-learning-with-neural-networks-algorithm-dqn-1e22ee928ecd
4. https://www.analyticsvidhya.com/blog/2019/04/introduction-deep-q-learning-python/
5. https://pytorch.org/tutorials/intermediate/reinforcement_q_learning.html
Kind Regards
Qamar Ul Islam
I found these tutorials useful.
https://www.mlq.ai/deep-reinforcement-learning-q-learning/
https://towardsdatascience.com/tutorial-double-deep-q-learning-with-dueling-network-architectures-4c1b3fb7f756
Dear Hamed Hosseiny,
I suggest you to see links on topic.
https://www.eventideib.polimi.it/events/reducing-q-function-estimation-error-in-dqn-algorithm/
https://openreview.net/forum?id=dN_iVr6iNuU
https://arxiv.org/pdf/1611.01929.pdf
Preprint Reducing Overestimation Bias by Increasing Representation Di...
Article Constrained Deep Q-Learning Gradually Approaching Ordinary Q-Learning
Best regards
How do the thermal conductivity, specific heat capacity, plastic strain, critical compressive displacement, and thermal expansion of both hot-rolled steel and concrete change after cooling based...
15 February 2021 1,385 5 View
In lattice modeling of RC structures, what is the mathematical concept behind choosing the cross-sectional areas for the simulation of, for instance, beams? Besides, seemingly, simulating the...
26 January 2021 2,638 2 View
Hi dears If I want to do supervised multi-classes classification of specific domain such as history, using one of the deep learning techniques. Which is the best word embeddings method should I...
22 January 2021 1,290 5 View
In the museums, the sensitive organic materials need to be displayed under 50 lux or less and aways from any UV or IR. LEDs still have a little amount of UV. What is the better option? Is it...
20 January 2021 5,965 3 View
Both tensile and compressive strengths of concrete increase with the decrease in temperature. With this in mind and considering that fracture energy is in direct correlation with the so-called...
18 January 2021 8,778 4 View
Hello, Is it useful in texts classification to use the keywords expansion method for "long-text examples" in the training process for deep learning classifiers? Will it increase the efficiency of...
16 January 2021 9,291 3 View
Given the degradation of mechanical properties in concrete under elevated temperature namely, tensile strength, modulus of elasticity, compressive strength, etc., what is the justification behind...
15 January 2021 3,651 4 View
Hello, I read some papers they used Pre-processing steps with text that will classify based on Sentiment Analysis. My question is, can I use text Pre-processing techniques in the sentiment...
11 January 2021 5,932 7 View
Hello, If I increased the number of keywords (Features) of examples in training set pertaining to a particular class regarding to Sentiment Analysis Classification , will increase the efficiency...
11 January 2021 5,299 4 View
Hello, I've read many review papers about which is the best classifier of Deep Learning in "text classification", some researchers prove that LSTM is the best, Some say CNN is the best, and some...
10 January 2021 909 12 View
What Characteristics makes CNN work better?
03 March 2021 1,458 4 View
i would to know some of the research gaps in the artificial intelligence field in most african countries.
03 March 2021 6,145 3 View
I have selected brain tumor images ...but now found that already lots of research done n this topic.
03 March 2021 5,774 3 View
dear community, my model is based feature extraction from non stationary signals using discrete Wavelet Transform and then using statistical features then machine learning classifiers in order to...
03 March 2021 6,994 5 View
I feel that the practice in teacher education in my country is below the expected performance level due to very poor management system. Hope I will learn something from your experiences.
02 March 2021 1,516 4 View
NFL theorem is valid for algorithms training in fixed training set. However, the general characteristic of algorithms in expanded or open dataset has not been proved yet. Could you show your...
01 March 2021 1,189 3 View
L1 and L2 regularization
28 February 2021 4,187 3 View
The following code (see 1st 2 images attached) is used to produce PID controller values that are designed to control the system (G). The code finds the PID controller values (noted as k) by using...
28 February 2021 6,560 14 View
For instance, the model is trained with A,B,C,D,E features to predict F target. Then, I want to make prediction on a new data, but with only A,C,D,E is known without doing missing value imputation...
28 February 2021 1,556 6 View
I am trying to classify and analyze the results of an SDS-PAGE based array for bacterial detection using machine learning, but I have trouble finding the best way to represent the results with...
27 February 2021 9,176 3 View