I'm working on an time-discrete Dec-MDP where my states are defined by a network's topology at each moment t. Basically, the nodes are agents in 3D space and the edges are links between them (if they are close enough to communicate) so the topology is dynamic. But, the environment we're modeling with this Dec-MDP is ruled by deterministic differential equations: the agents move following preset trajectories and the actions taken by the agents (taking shots of some targets if they are in their trajectories, and communicating with other agents) doesn't affect the state of the environment (network's topology). The reward however depends on the task execution speed and quality, the latency of communication, the battery charge of the agents, and some other factors. I'm not sure if considering the topology of the network as the state of the environment makes sense, or if it's enough at least to describe the environment. Should I consider other factors?