To reduce intra/inter data center traffic, having knowledge of inter-VM communication traffic can be very useful. Using inter-VM traffic information, we can effectively place related VMs on the same server/cluster/rack to reduce cloud traffic.
Question is: Is it possible to predict inter-VMs traffic of a task that its VMs should communicate to each other (e.g., HPC) or not? If yes, I am looking for a reference. Because some papers assumed having an inter-VM traffic matrix without mentioning how they get this matrix. I would like to stress this point that I talk about processing-intensive applications, not web services. Thanks.