Is there any simple optimal policy for i.i.d. evolving MDP?

More Huasen Wu's questions See All

Usage of internal standards in LC-MS/MS analysis?

Have you ever seen a LC-MS/MS method uses both internal standards and external standards (in matrix matching purpose) but the concentrations of internal standards are outside the calibration curve...

05 August 2024 3,084 6 View

How can I calculate formation energy using VASP-Ab-initio?

I would like to calculate the formation energy of P2-Na0.67Fe0.5Mn0.5O2 based on DFT, what should I do step by step. Any help would be appreciated. Thanks.

29 July 2024 8,248 2 View

What is the relationship between protein structure and N or C terminal tagging choosing?

I want to do 2,3-butanediol dehydrogenase(BDH) enzyme purification to confirm its activity for 2,3-butanediol. Before that, I need to confirm which N or C terminal tagging is better for enzyme...

28 July 2024 366 3 View

Are there always been barcodes, apapters and primer sequences in the FASTQ files of NGS?

Hello researchers, Sorry for my stupid question. I am learning the QIIME2 workflow for analyzing some 16s amplicon NGS fastq data. I found a very nice paper with data and code public available...

20 July 2024 5,405 2 View

Why is this tissue slice different?

This is a slice of intestinal tissue from the Chinese mitten crab (intestine in the upper right corner). I injected WSSV into this horseshoe crab. Now I want to know what these red spots next to...

18 July 2024 5,602 0 View

How to upload 16S and metagenomic data from fecal samples to NCBI SRA database?

Hi there, I have some 16S and metagenomic sequencing data from mice fecals and I want to upload them to NCBI Sequence Read Archive (SRA). I don't know what NCBI package is suitable for my animal...

03 June 2024 9,451 3 View

Can we register researchgate account by personal email?

One of my friends could not register an account on Researchgate with his own email (like gmail). Should we register an account with the school email?

31 May 2024 6,347 3 View

What are the potential complications associated with uncontrolled hyperglycemia in the perioperative setting?

What are the potential complications associated with uncontrolled hyperglycemia in the perioperative setting?The Impact of Hyperglycemia on Perioperative Outcomes in Pat...

07 May 2024 4,190 1 View

Why the temperature exceeds the defined max value in FLUENT simulation?

My FLUENT model includes fluid and solid regions, with Discrete-Ordinates (DO) radiation model. The heating boundary conditions are defined by a named expression of Temperature-Time for the...

16 April 2024 6,636 3 View

Magnetic Backhausen noise ?

When detecting magnetic Backhausen noise in a DC bias field, why do there be two peaks, one large and one small, in a sinusoidal excitation cycle?

15 April 2024 6,903 0 View

What is the difference between mathematical R^4 space and physical 4D unit space?

We assume that the difference is huge and that it is not possible to compare the two spaces. The R^4 mathematical space considers time as an external controller and the space itself is immobile in...

10 August 2024 6,678 14 View

Hello researchers Is this a random laser or just fluorescence?

I am using Rhodamine6G as gain medium and silver nanoparticles as scatterers on a microscope slide and laser input 532 nm comes from above.

09 August 2024 9,894 2 View

How to increase simulation box size?

We intend to study the interaction between peptides and polymer (like PP, PE and PS) through MD simulations using Martini force fields ( Martini 2 for PP and Martini 3 for PE, PS). We have...

08 August 2024 4,842 0 View

Is it possible to plot the atom-projected band structure using GPAW?

Hi, I'm currently working on a project where I need to plot the atom-projected band structure using GPAW. I've been able to calculate the band structure for my material, but I'm having trouble...

07 August 2024 269 3 View

Hello all, Looking for international reviewer to review Ph.D thesis in wireless sensor network.Can anybody help?

My name is Apurva Saoji. I am a Ph.D scholar in Computer engineering in India. I am looking for international expert in reviewing my PhD thesis, "Competitive Optimization Techniques to Minimize...

07 August 2024 4,600 2 View

Should I include H atom into C3N5 when i am doing DFT modelling?

Hi all, my experimental XPS results shown that my C3N5 sample consists of N-H bond, hence in this case I should incorporate the N-H bond into my DFT modelling. However, I do notice several papers...

07 August 2024 8,414 2 View

Are there any good simple systems or platforms to recommend?

In order to show people the beauty of control and enhance enthusiasm for learning control theories, are there any good simple systems or platforms to recommend?

05 August 2024 10,034 1 View

"A Markov-like Model for Patient Progression"?

A Markov-like Model for Patient Progression" Markov Chain Monte Carlo (MCMC) Markov Chain Monte Carlo (MCMC) is a powerful computational technique used to draw samples from a probability...

05 August 2024 10,079 0 View

Why do exism movements become permanent dictatorship threats within liberal democracy thinking under majority rule-independent rule of law system?

Exism movements after gaining power within liberal democracies under majority rule and independent rule of law system become permanent dictatorship threats, but why this is the case is not clear...

04 August 2024 8,125 3 View

Do you know any references for analyzing stochastic fiber orientaion composites ?

Hello everyone I am looking for one or some books for propertes and behaviors of stochastic fibre orientation composites. unfotunately I could not find any suitable reference for thias by...

04 August 2024 3,461 3 View

Joachim Arts

If the state distribution in each period is iid regardless of the decision taken, then any policy is optimal. This is probably not the answer you are looking for. In most MDP models, the decision you take affect the transition probabilities to future state. Perhaps you can give more details about your problem and then I can give an answer that is of more help.

Huasen Wu

Thank you Joachim! I focused on the resource constrained case, and the problem becomes more difficult. Consider a $T$-horizon MDP, where the states $X_t$ are i.i.d. over time. In each time-slot, I decide to {transmit, wait}. If I transmit at $t$, I will receive $X_t$ units of utility, and pay 1-unit of energy. But I have only $P$ units of energy ($P < T$). What will be the optimal policy (simple policy, which may be expressed by closed form, rather than calculating the cost-to-go function), or near-optimal policy? Thanks!

Håkan Warnquist

It is not i.i.d because the amount of energy left P is part of the state and your choice of action influences the value of P. If the domain of X_t is enumerable, then the problem can be solved using standard a dynamic programming solution in time |X_t|*P*T.

Thanks Hakan. Yes, one can viewed this problem from that perspctive.However, I am wondering if there are some simple policies that can achieve good performance when $T$ and $P$ go to infinity in proportion. I think the one using the average resource constraints should be asymptotic optimal (as $T$ and $P$ go to infinity), as in E. Altman, Asymptotic properties of constrained Markov decision processes, 1993. I am not sure whether there are some other policies that have even better convergence performance.

Also, this is in fact an optimal multiple stopping problem. Are you aware of any policies fo such a problem?

Unfortunately I have little experience working with other stopping problems than the basic one so I am a bit out of my comfort zone. But for your problem, with T and P proportionally going to infinity, perhaps you can do something with the quantile function of X. One solution could be to use a policy where you transmit when X>Q(1-P/T) where Q is the quantile function of X. Then you will in average consume P/T resources and the expected value is E(X|X>Q(1-P/T)). However, this is just quick guesswork. I have no idea if it is optimal.