Hello dear all,

I want to synchronize all threads of a child kernel before executing other operations in a parent kernel in CUDA. How can I do this? I have many threads in many blocks. I used 1D blocks and 1D grid.

Thank you very much

Similar questions and discussions