You can pick how many cores are used. Moreover, you can attach one process to one core, which may further improve parallel performance. Do this outside of the GUI, and your code runs that much faster.
Nested parfor is not allowed. It seems your requirements are large, it may be worth your time to convert your codes to another programming language that supports OpenMP.