The Scala influence in NetLogo would help with such a version. For now, all I've seen is distributed BehaviorSpace runs (parameter sweeping on one model). A Python example:
Unsure if/when there will be a parallel NetLogo, but there are alternatives of course, such as high performance Repast: http://repast.sourceforge.net/repast_hpc.html and also FLAME GPU http://www.flamegpu.com/ There also appears to be a language named HLogo proposed in 2013, which is a parallel version of NetLogo, but I am unsure of its availability: http://dspace.library.uu.nl/handle/1874/284708
Myself and a colleague have published some scripts about this. Here's his first one. http://resources.modelling4all.org/guides/running-experiments-on-a-cluster-of-computers
However, to my knowledge there isn't a parallel distributed version in the works. REPAST HPC is likely your closest best bet if you're looking for that.