My C++ app is already parallelized in linux using MPI. The process is splitted in N cpus, but some of the data is repeated in each process, and for technical reasons this data cannot be easily split over MPI. How can I sue MPI_PUT and MPI_GET to reduce the total memory consumption of my program? Can you help me with a simple example?