I want to know how can we analyse whether a cloud task is dependent or independent. what is the benchmark for determining whether a task is dependent or independent in nature?
Dear Sir, Thank you for your valuable suggestion. As far as first part is concerned, tasks are executed based on given order and the tasks priorities are known in advance. so , i am not concerned about it. However, as you have written in the second part, is relevant to my topic. Data(task A) intersection Data (task B)? i got this, if intersection of two entities is Null, the tasks are independent otherwise dependent in nature. But what i actually need is : on what criteria do we calculate the intersection of two tasks? Please elaborate it in depth. Thank you sir
Most cloud deployments have tools which show dependency graphs between tasks and their impact on storage and network resources. Cloudera is a leader in Hadoop space for providing visualizations of resources and file system, which solve quite what you are looking for.
In-order to understand the impact of dependency of tasks and how they could help optimize resource, I would link you to the following:
1. LDM: Lineage-aware Data Management in multi-tier storage systems
Preprint LDM: Lineage-aware Data Management in multi-tier storage systems