Hello everyone,
I've implemented a FEM code in matlab, and now I'd like to make it faster. Since I have to perform nonlinear dynamic analysis, I have a lot of iterations, each one requiring to asseble the full stiffness matrix K.
What I would like to do thus is to parallelize the assembling procedure of K. Till now I tried parpool and spmd, the latter with poor results, the first one performing nicely (speedup factor x2... despite using 10 cores...) but only under a certain number of elements. Overcome a certain treshold, parallel computation (14 cores) would take as much as 10 times the single core version.
I understand this may be related to overheads in the comunication between "master" and workers and/or slicing procedures, but it seems I cannot get the hang of it...
Does anyone have suggestions and/or can point me to some useful material on this specific matter?
Thank you all in advance,
Jacopo