How should I initialise the mpirun parameters for running a quantum espresso 'scf' calculation for the best parallel computing performance?

More Nabasindhu Das's questions See All

For an in-vitro drug release study, what molecular weight cut-off (MWCO) dialysis bag is required for a 117 kDa protein?

kindly reply me. Thanking you in advance.

05 August 2024 7,727 4 View

How can we identify (in silico) the interacting amino acid residues or the nucleotides involved in the Protein-Protein / Protein-RNA interaction?

Hello! everyone, I am trying to study in silico Protein-Protein and Protein-RNA interactions. Now, is there any tool with which I can identify the interacting amino acid residues or the...

14 July 2024 950 2 View

Any suitable Animal models to evaluate Mast cell stabilizers?

I am looking for an in-vivo model to evaluate standardized herbal extract for its potential as a mast cell stabilizer.

14 July 2024 779 0 View

What is the mathematical expressionof theoretical yield strength calculated considering solution strengthening in stainless steel 316L?

In the expression for solid solution strengthening model that has been widely used in the literature use a linear form consisting of strength coefficient, and concentration of elements in wt.%....

11 July 2024 4,476 3 View

How to insert cholesterol inside DPPC membrane by replacing same number of DPPC molecules in GROMACS?

I am trying to insert cholesterol inside the DPPC membrane by using gmx insert_molecules -replace. But number of DPPC molecules replaced is much higher than the number of cholesterol inserted?...

30 June 2024 4,160 2 View

How to synthesize Dichloro(p-cymene)ruthenium(II) dimer in a good yield?

Suggest some procedure where i can synthesize [Ru(p-cymene)Cl2]2 in a good yield.

23 June 2024 9,244 2 View

Analysis of microbial biomass for nutrient cycling?

It will be an honor, if anyone can suggest the standard methodology for analyzing nutrients(N,P,K) in soil microbial biomass

23 June 2024 3,850 4 View

Can i use polystyrene as a matrix in a hybrid COMPOSITE?

fiber powders will be used as a reinforcement.

23 June 2024 10,027 3 View

After expert validation of the research tool for data collection, what should be done first after piloting. Reliability analysis or factor analysis?

Which would be prefered first (factor analysis or reliability analysis of a Likert type scale)

19 June 2024 9,378 4 View

Which media will produce more biomass in saccharomyces cerevisiae?

Dear all, For higher biomass for saccharomyces cerevisiae which media is suitable in shake flask? 1. YPD media 2. Basal salt media and what is the optimum C/N ratio ( 5:1 /10:1). Thanks &...

18 June 2024 2,106 0 View

Is it possible to plot the atom-projected band structure using GPAW?

Hi, I'm currently working on a project where I need to plot the atom-projected band structure using GPAW. I've been able to calculate the band structure for my material, but I'm having trouble...

07 August 2024 269 3 View

Separation of organic acids-HPLC?

Hello What should be done to separate and identify organic acids in HPC when their RetTime is the same?Like oxalic acid with Propanoic Acid.or acids that have a very close RetTime.

07 August 2024 8,782 3 View

AUX gas reading problem on QE with full MS and PRM method in one run?

Dear QE-users, In the method where full MS positive mode and PRM mode are used, we always get an incorrect auxiliary gas reading (41 instead of 25). This only happens in this method; other...

06 August 2024 4,953 0 View

How to use Density Functional Theory to calculate carrier mobilities of solid system?

Hello, everyone. I have tried to determine carrier motilities of some materials, by Density Functional Theory, using Quantum ESPRESSO. There are a few methods to do it, like a package called...

04 August 2024 8,894 1 View

People weight in Oaxaca Blinder Decomposition on R?

Hello guys! Do you have experience running a Oaxaca-Blinder decomposition on R applying person weights. How do you suggest doing it? I have a variable PERWT which gives more information on how...

04 August 2024 6,033 0 View

Which test should be used to study association among demographic profile and awarness level?

i have to study the awareness and adoption level of cloud computing in a district of India. i also want to use association among demographic variables like gender, age, education, income etc and...

02 August 2024 2,420 3 View

How to perform EEG source analysis on each trial of data separately?

Hello Everyone I have a question about structure for connectivity analysis on sources. My goal: preprocess and cut data into trials create headmodels, using template MRI file perform source...

30 July 2024 2,744 1 View

How gold procurement?

Please note it

29 July 2024 4,920 3 View

How to add Cr parameter in autodock?

When I run autogrid4 it says: autogrid4: ERROR: Unknown receptor type: "Cr" -- Add parameters for it to the parameter library first! Look forward to your reply.Thank you so much!

29 July 2024 488 0 View

Why running a restart analysis in Abaqus in Ubuntu OS gives an error as attached when running the same job in Windows doesn't give any error and runs?

I am trying to run a restart analysis, which imports deformed configurations of parts from a generated ODB file. It runs fine in Windows OS but when I try to run it in Linux OS, it is giving some...

29 July 2024 9,572 3 View

Mohammad Saeed Bahramy

There are different way to get the best parallelization performance from QE. Here are a few tips:

1. If your calculation consists of a large number of k-points, npool can strongly enhance the process. It must be a divisor of the total number of k-points. I personally set it to 4.

2. In combination with npool, you can further improve the performance using the bgrp command. It allows the program to split the KS states across the selected group of processors.

For example, assuming you have 32 cores you can try something like this,

mpirun –np 32 pw.x –npool 4 –bgrp 4 –input pw.in

3. You can also try to parallelize the matrix diagonalization routines, using ndiag command. Choose it such that ndiag

Nabasindhu Das

Hi Mohammad,

Thanks for your answer. I tried to implement it and it reduced the runtime by a factor of three. However, I have a few another ancillary questions, which I hope you could help on:

1. Can we judge the amount of parallelization by evaluating the difference between wall time and the cpu time?

I think a good way to know that parallelization happens, is that the difference between them is low. However, sine a lot of that time difference is accounted by MPI communications, I think using too many cores is a bad idea, right ?

2. In the output file, this is shown: Subspace diagonalization in iterative solution of the eigenvalue problem: a serial algorithm will be used

This changes by using ndiag. Is there some criteria to know when to use serial and when to use parallel for this process?

Much thanks for your previous answer, and hoping you could shed some light on these as well.

Dear Nabasindhu Das

Glad to hear you've made some improvements. Regarding your questions:

1. Yes, ideally there should be a linear down-scaling between the wall time and the number of cores (or cpu time). However, due to the lagging happening during the inter-node MPI communications, usually as the number of cores increases you start to notice a rapid deviation from linear scaling. This simply means after reaching a threshold, increasing cores is not going to help you (or in some cases actually can slow down the whole calculation). For small systems, I usually use only 16 to 32 cores and for big ones I personally have never gone beyond 128 cores (mind that this is just a personal preference due to my system settings).

2. QE used to do parallel subspace diagonalization in its earlier versions by default. The recent versions however do not do this by default, as it does not necessarily perform better than the serial one. As you already found out, you can explicitly turn this option on with the command-ine "-ndiag N". There's no clear criterion When you should choose one over another. Usually, if you're dealing with large matrices parallel subspace diagonalization works better, provided that your inter-node connections are fast.

Hope this information helps and good luck with your parallelizations.