In reinforcement learning, If the range of observation is ±inf, how to normalize the observation?

More Xiaoyi Hu's questions See All

What is the principle/mechanism behind aging of carbon (graphite) containing refractory mix for isostatically pressed refractories?

The resin bonded carbon containing refractories are aged before use. How time of aging is determined? And mechanism behind aging.

23 July 2024 3,205 0 View

When I am trying to distill triethyl amine while drying it over calcium hydride. How do I design the setup to let hydrogen escape without losing NEt3?

I am new to research. The boiling point of triethyl amine (NEt3) is 89 degrees centigrade. I am skeptical that when I will let the hydrogen escape which is getting generated in situ, I will also...

21 July 2024 7,284 4 View

Why my gel electrophoresis have shadow bands? Please see the attached picture for the gel electrophoresis ?

Sometimes I see the shadow like bands and its not true band. I want to know that what's the reason for it. I am using 2% gel for running genotyping samples I have uploaded the gel picture in both...

19 July 2024 148 6 View

What is the future scope of acoustic emission?

17 July 2024 1,510 1 View

Is the protecting group boc of the amino group stable at 37°C?

I have a small molecule reagent with a boc-protected amino group. Now the reaction needs to be reacted at 37°C for 30 h. Is this protection group stable?

12 July 2024 3,745 2 View

Why can't I detect the plasmon resonance angle with water?

I am trying to measure the plasmon resonance angle of gold film and pure water using the Kretschmann configuration and a 633nm laser. Without flowing water over the gold, I can detect a clear...

10 July 2024 4,719 3 View

How can we generate topology file for water and helium system?

Hello! I am facing a problem, I tried using pdb2gmx to generate topology file but it shows error HEA residue not found then I tried copying the forcefield file and edited accordingly but again...

24 June 2024 1,242 0 View

How to eliminate imaginary phonon frequency in phonon dispersion of Coved Graphene Nanoribbons?

Dear all, I want to calculate the phonon dispersion of 4CNR-1-0 (structure is in Fig. 2 (b) in this article Designing coved graphene nanoribbons with charge carrier mob... ) and then calculate...

17 June 2024 6,911 2 View

Mutagenesis question, 9 mismatch pcr cycle?

Dear all, I am currently trying to perform mutagenesis to change 9 mismatch. Is it possible to do it in one round of PCR? if so, are there any protocol I can learn from?

16 June 2024 875 2 View

Best Tools for Wrist PPG Signal Analysis: Your Recommendations?

I'm conducting research on photoplethysmography (PPG) signals obtained from smartwatches worn on the wrist. The main goal is to analyze the PPG waveform and extract key fiducial points and...

13 June 2024 3,286 0 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

After COVID-19 it has seen that EFL learners technological affiliation has raised. In addition, in the post-COVID period learners started to engage AI technologies like ChatGPT while learning...

08 August 2024 8,964 4 View

What are examples of AI for good projects a teacher can assign to students?

So I am organizing an AI seminar. What are possible AI projects in the AI for good spirit? something the students can do and have an impact?

08 August 2024 9,437 4 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

How to design human-centered classroom in the age of A.I.?

08 August 2024 347 5 View

Do experts have journals in the field of artificial intelligence and big data that are not indexed by SCI or EI?

05 August 2024 8,836 2 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

What's the role of IT & AI in Telecommunication Industry?

05 August 2024 8,264 3 View

Can usage of AI tools like chat GPT in research work is recommendable ?

AI tools like ChatGPT can enhance research work significantly when used responsibly and in conjunction with thorough human oversight.

05 August 2024 1,842 3 View

Stam Nicolis

They can’t be infinite, because that’s ambiguous. So they should be mapped to finite values.

Oliver Wallscheid

If you receive an infinite observation, I would guess that something went wrong within the learning process or on the simulation model part (environment), e.g., a bug or the learning process diverges. In this respect, I would recommend to go troubleshooting here and first investigate the reason why you get an inf return value.

Joachim Pimiskern

Where does +-inf come from? Maybe the creator of the data source misused the IEEE representation to express a special situation. So you could double or triple your input neurons, set the original value to 0 and indicate that the value was + or minus infinity by setting the additional input neurons to 1.

Regards,

Joachim

Frank Cheng

The values of observation come from the environment. Please check your environment model or limit such value at the observation input of agent.

Syed Muhammad Talha Zaidi

please check your network input values, whryer they are normalized or not. Normalizing input values can sometimes help to limit the output response. Actions cant be inf.

Xiaoyi Hu

This is my fault. I asked a wrong question. I mean "if the range of observations and actions are ±inf, how can I normalize them?", rather than the values of observations and actions are ±inf. @Syed Muhammad Talha Zaidi @Frank Cheng @Joachim Pimiskern @Oliver Wallscheid @Stam Nicolis

As an engineer I would doubt that there are no technical/natural limits regarding your observations/actions in your application. Any real-world system has its natural boundaries which can be used as a baseline for normalization. Maybe you can share some more details on your considered application for a mutual discussion on possible state/action limitations.

Raoul Raftopoulos

In any reinforcement learning environment, you most likely know the number of actions that the agent can take. It cannot be infinite. First of all, understand if the agent will choose each action(s) among a Discrete or Continuos set of value (check the gym and spaces library, Box, Discrete and MultiDiscrete action spaces).

On the other hand, I can imagine normalizing observations can be a little trickier.

One thing you could try to do is to normalize the observation based on the maximum observed value for each feature. That's what worked for me.

Chuck A Arize

https://towardsdatascience.com/ultimate-guide-for-ai-game-creation-part-2-training-e252108dfbd1