Two independent NAMD sims at the same time?

30 December 2024 1 4K Report

For background, I am trying to run two independent and separate .conf files each on a different GPU. It is NAMD molecular dynamics simulations I am running.

I login correctly and everything and specific this to request: qrsh -l gpu,A100,cuda=2,h_rt=12:00:00. I am running via remote server to a HPC cluster.

When I go to run my files: CUDA_VISIBLE_DEVICES= 0 ~/bin/namd3 +p1 +setcpuaffinity +devices 0 config.conf > log.log or CUDA_VISIBLE_DEVICES= 1 ~/bin/namd3 +p1 +setcpuaffinity +devices 0 config.conf > log.log. (+p1 was done cause its GPUresident in NAMD. not important to the issue at hand however).

However, this slows down both simulations very much. Normally, it takes 1 hour to run a .conf file on a single A100 GPU card. But when I run both at the same time, even though they are on different A100 GPUs it slows down immensely and say it will take 4.5 hours to finish each one.

I am hoping to find a solution to this because it should finish in an 1 hour total because each sim is one different GPUs and they aren't interfering with one another. This was a GPU usage graph. The CUDA_VISIBLE_DEVICES doesn't fix didn't work. Below is my GPU usage during the two sims running on separate GPUs.

Ben Cardoen

Aarav Singh One possible cause is contention of RAM / VRAM, e.g. with only one GPU loaded, the machine can focus all PCI resources (and RAM) to that GPU's tasks. With two, you now introduce contention. You can test this by using gpu monitoring tools that capture GPU to host transfers.

If the topology (how many GPUs share the same bus) is under your control, you can (if the machine isn't otherwise loaded) try to change the GPU_ID (e.g. 0 & 3 vs 0 & 1) to see if that makes a difference.

If the tasks are independent, but the data is shared, your processes may be limited by IO (writing to disk/caching).

Badges
Science topic

More Aarav Singh's questions See All

Seeking Advice on Viability and Execution of Undergraduate Thesis Topic?

Hello everyone, I am currently developing a thesis proposal and would appreciate your input on its viability and how to effectively carry it out. My proposed topic is: "Does the perceived threat...

10 August 2024 8,992 0 View

Please explain how the plastic input value should be considered from the true stress-strain curve for the bilinear elastoplastic material model ?

I am working on Abaqus/Explicit(Quasistatic ) for the deformation of the auxetic structure model. Please explain how the plastic input value should be considered from the true stress-strain curve...

05 August 2024 454 3 View

How to increase citation in Research Gate?

How to enhanced h-index in Research Gate?

04 August 2024 3,368 4 View

To perform transfection with DharmaFECT Duo in AGS cells. Could you tell me what the ideal concentration is to avoid significant cytotoxicity?

I would like to perform transfection with the reagent DharmaFECT Duo (Horizon) on the AGS cell line. Could you please inform me of the optimal concentration to use without causing cytotoxicity in...

03 August 2024 3,851 1 View

What is meant by baseline of FTIR data?

I got comment on my FTIR data figure from a reviewer. The reviewer said "FTIR data in Figure should be repeated. there is no bassline." I made Y off set comparison graph of FTIR on OriginLab. Can...

03 August 2024 6,070 3 View

What is Random Audit?

HI there, I've came across several articles discuss about random audit an Non random to tax evasion or compliance. Most of the articles is relating about effect of audit (random or non random)...

31 July 2024 5,309 7 View

Is the mentioned CV graph a valid one as this graph have only one peak prominent (reduction)?

I have used Prussian blue nanoparticles as a redox couple. The PBNPs have been made using only one salt precursor. Also, during scan rate studies, a small oxidation peak can be consistently found...

31 July 2024 9,697 0 View

How to do wavelet transform of EXAFS data?

I want to do wavelet transform of some compounds for their comparative study? Kindly help me in this regard and provide with proper procedure.

31 July 2024 4,170 0 View

Following click reaction in cell lysates, protein is immobile and remains at the top of the gel in SDS-PAGE?

I am using CuBr/THPTA for a click reaction in total cell lysates. I am facing issues with my protein sample in non-reducing SDS-PAGE where it's not migrating properly and most of it remains at the...

29 July 2024 950 4 View

How to use energy flexibility in inventory modeling?

29 July 2024 2,192 5 View

AUX gas reading problem on QE with full MS and PRM method in one run?

Dear QE-users, In the method where full MS positive mode and PRM mode are used, we always get an incorrect auxiliary gas reading (41 instead of 25). This only happens in this method; other...

06 August 2024 4,953 0 View

How to Add Missing Water Molecules in Protein-Membrane Simulations?

I have protein-membrane simulations (PDB, PSF, DCD) and have noticed that water molecules near the protein are not visible in the simulations. How can I fix this issue? Is there a way to place the...

04 August 2024 1,200 2 View

People weight in Oaxaca Blinder Decomposition on R?

Hello guys! Do you have experience running a Oaxaca-Blinder decomposition on R applying person weights. How do you suggest doing it? I have a variable PERWT which gives more information on how...

04 August 2024 6,033 0 View

How to perform EEG source analysis on each trial of data separately?

Hello Everyone I have a question about structure for connectivity analysis on sources. My goal: preprocess and cut data into trials create headmodels, using template MRI file perform source...

30 July 2024 2,744 1 View

How to add Cr parameter in autodock?

When I run autogrid4 it says: autogrid4: ERROR: Unknown receptor type: "Cr" -- Add parameters for it to the parameter library first! Look forward to your reply.Thank you so much!

29 July 2024 488 0 View

Why running a restart analysis in Abaqus in Ubuntu OS gives an error as attached when running the same job in Windows doesn't give any error and runs?

I am trying to run a restart analysis, which imports deformed configurations of parts from a generated ODB file. It runs fine in Windows OS but when I try to run it in Linux OS, it is giving some...

29 July 2024 9,572 3 View

How to use Desmond in HPC ?

Our department has recently acquired an HPC (High-Performance Computing) system, and I'm thrilled to take my molecular dynamics calculations to the next level using Desmond. I used to run my...

28 July 2024 6,553 1 View

Can anyone kindly suggest research topics in the area of thermodynamics, heat transfer, fluid mechanics or renewable energy?

I am an undergraduate mechanical engineering student seeking suggestions on topics related to either thermodynamics, fluid mechanics, heat transfer or renewable energy for my final year project....

24 July 2024 4,058 5 View

Cuáles fueron las tendencias en investigaciones en arquitectura, urbanismo y patrimonio edificado en decadas del 2000 al 2020?

Cuáles fueron las tendencias en investigaciones en arquitectura, urbanismo y patrimonio edificado en decadas del 2000 al 2020? Porque requiero conocer tesis de posgrado nivel maestría...

24 July 2024 5,494 1 View

How do I run a dead well test for a QuantStudio 5 qPCR machine?

I am preparing to run a lot of samples on a 384 well plate and I wanted to test if any well in my PCR machine is dead so as not to lose any data. I would really appreciate help with how I can do...

24 July 2024 8,172 1 View