Regarding AI based on Deep Reinforcement Learning, how do we decide what rewards or punishments to give a machine?

05 March 2024 0 4K Report

This question delves into the challenge of designing reward systems that accurately reflect the desired outcomes. It's important because the way we set up these rewards can significantly influence what the machine learns to do.

Badges
Science topic

More Tieu-Tieu Le Phung's questions See All

Help me download paper?

I have 2 papers below, but I can't access this, you can help me? Shuai Zhang, Xiaodi Li, Xingyu Zhou, Yuning Wang, Yue Hu, Cloud removal using SAR and optical images via attention mechanism-based...

18 July 2024 9,635 0 View

Differentiation THP1 cells into M2 macrophage with IL4 and IL13?

Recently, I tried to differentiate THP1 into M2 macrophage using IL4 and IL13 (purchased from R&D). This is my protocol. THP1 was seeded into 6-well plate (10E6 cells/well). Incubate with PMA...

15 July 2024 5,153 2 View

The journal change publisher, will my published paper get indexed?

Dear all, May I ask a question about indexed by Scopus. 1 month ago, I have published my paper in the Journal name: Challenges in Sustainability in 20/05/2024. However, this Journal has changed...

06 July 2024 6,734 2 View

Why is my thin film PDMS TFM device warping only when with cells?

I've been fabricating traction force microscopy devices in glass bottom dishes, using a method that first spins a 100 micron PDMS layer, then a ~1 micron PDMS + fluorescent beads mixture layer....

09 June 2024 652 2 View

Can Vapour Modernity dissolve institutions?

To be developed

16 May 2024 9,242 0 View

Why the controversy surrounding the Hydrolic Apple ad actually signals the return of Analogue?

To be developed.

09 May 2024 5,714 0 View

Can I have full sequence of a purified protein (by SDS-PAGE)?

Hi everybody! I reads some papers or webs on protein sequencing using maldi top MS to sequence digested peptides. But I am wondering that all informations i collected is only about identification...

04 May 2024 3,731 1 View

How to calculate the Magnitude in "Edit Load" section in Abaqus?

In the "Edit Load" section in Abaqus, I saw the "Magnitude" value, so how do I get the value of it with Load: Sigma = 1MPa? Does it have any formula to it?

04 May 2024 6,165 2 View

Is there a suggested template to follow for a integrative literature review, using Whittemore & Knafl methodology?

I need to write a 5000 word, literature review and I have chosen an integrative review using Whittemore & knafl. I'm struggling to decide how I should best set this out, which topics to use in...

30 April 2024 365 0 View

Why the bands in my Western Blot did not appear at the expected molecular weight?

Dear Community, I recently performed a Western Blot to test whether the mitochondria marker ATP5A1 is in the experimental cells or not. However, the results did not show up at the expected MW . I...

24 April 2024 2,060 1 View

Given the organizational complexity of academic institutions does an internal institutional politics play significant role in an institution's growth?

There are few business activities more prone to a credibility gap than the way in which executives approach organizational life. A sense of disbelief occurs when managers purport to make decisions...

08 July 2024 1,323 2 View

Motivational systems bind perception and behavior?

It was gestalt psychologists that took issue with elementalism (Kohler 1929), that all perception/consciousness can be broken down into component parts. The expression ‘the whole is greater than...

16 June 2024 9,199 0 View

Why do we think and move as we do and why is this compromised in schizophrenic patients?

Peter Schiller once made an insightful comment on the operant training of behaving monkeys to investigate the visual system: “Perhaps we are just studying a monkey’s thirst for apple juice, and...

27 May 2024 5,123 0 View

Reward delivery during immobility triggers memory consolidation at the neocortex and cerebellum?

Anyone who has ever watched a Mafia film is familiar with the scene of a gangster relieving himself with pants pulled to the ankles, as the cubicle door bursts open followed by a barrage of...

21 May 2024 469 0 View

Is the receptive-field remapping signal of Goldberg and Bruce (1990) dependent on the cerebellum?

There is now overwhelming evidence to suggest, as anticipated by David Marr (1969), that for the neocortex to be fully operative all neural signals must loop through the cerebellum during and...

20 May 2024 5,575 0 View

How can a continent take a country to a people's court?

Example: Africa is a label for a continent. It has no overall leader or rulership, it is devided into countries. But now we are being told Africa is trying to sue Israel. But who is Africa? If...

14 April 2024 452 5 View

What emotional rewards do women obtain from sex?

Whether or not she herself reaches orgasm, many a female finds satisfaction in knowing that her husband or other sexual partner has enjoyed the contact, and in realizing that she has contributed...

11 April 2024 3,234 2 View

How to implement RL algorithms for 5G ?

I am currently implementing the following two RL algorithms for 5G (i) Power control to maximize throughput in a multi-gNB multi-UE scenario, and (ii) Maximizing throughput subject to delay...

03 April 2024 5,653 4 View

Has Israel shifted from defending its existence to a more sensitive stage discussing the basic legitimacy of its existence?

لطالما حاولت تل أبيب تثبيت كيانها في منطقة لا تتشابه و أنظمتها السياسية و الاجتماعية و الثقافية بالخصوص ، باعتمادها على قوة " القهر" الدولية في مرحلة تاريخية من تأسيس النظام الدولي ، و مع المساندة...

02 April 2024 6,564 1 View

Regarding AI based on Deep Reinforcement Learning, can machines develop unintended behaviours from poorly designed reward systems?

This question explores the consequences of not carefully aligning rewards with the actual goals we want the AI to achieve, leading to unexpected or undesirable behaviours.

04 March 2024 5,624 2 View