What type of C++ code are you looking for (i.e., what libraries) for the implementation, and what architecture (e.g., Shared Memory Machines)? So for libraries I mean things like MPI, OpenMP, CUDA, OpenCL. Any others if not a part of this list? These details may help researchers help you with your problem.