Hi everyone,

I would like to increase the speed of STRUCTURE software by splitting the work on two computers or two instances running simultaneously. I want to run for 10 clusters.

Therefore, I would like to work with the same data but instead of searching for K 1 to K 10 I split the job on two instances of the STRUCTURE software. One running for K 1 to 5 and the other one running for K 6 to 10. Again: the input data and all other settings will be exactly the same for both instances of the STRUCTURE software.

I've tried it with a small dataset. The clustering of the merged dataset was the same as the original data.

I know that I can speed up STRUCTURE by reducing the number of Burnin or MSMC reps. But that’s another issue.

I would be happy to get feedback from the community if my approach is appropriate or giving me wrong results.

Thank you.

Said

More Dadshani Said's questions See All
Similar questions and discussions