Hello Dear all,

What is the optimal number of threads per block to choose for CUDA programming? I mean, is there any rule to follow before doing experiments?

Thank you very much

Similar questions and discussions