Would parallel speed-up (ψ) be influenced if a MPI-based code is run on a single (multicore) node / machine?

More Shaswat Anurag Saincher's questions See All

How to assign bond coefficients for multiple distinct bonds in LAMMPS?

The table bond_style doesn't allow us to tabulate the parameters of the bond_coeff for a particular bond_style. Is there any way to read in values for 1000 distinct bond_types instead of having to...

10 August 2022 2,112 0 View

Why the mixture of NaOH solution (10-12M) and Na2SIO3 solution is getting solidify after mixing?

These days I am noticing that the mixture of NaOH (NH) (10-12M) and N2SiO3 (NS) solution gets solidify after 24 hours. The ratio of mixture of NS to NH is 2. To mention, the same solutions that we...

18 January 2021 8,173 2 View

Special issue on one-part geopolymers/alkali activated materials (AAMs)

Dear Researchers, We are excited to announce a special issue on one-part geopolymers/AAMs for the Journal "Advances in Civil and Architectural Engineering (ACAE)". ACAE stands as a fully...

01 January 1970 9,453 0 View

Article indexed on Scopus and Web of Science but not Google Scholar

I have heard the exact opposite from most where their articles are indexed on Google scholar but not on Scopus. In my case I have an article that is indexed on Scopus as well as Web of Science,...

01 January 1970 5,316 2 View

What should be the basic properties of any concrete to be used for normal constructions?

What are the basic requirements to be satisfied by any concrete to be used for normal buildings (G+5 storey or small). I mean in general sense, what should be the slump value, what should the...

01 January 1970 5,768 7 View

For setting time of Geopolymer concrete, the penetration resistance test (IS-8142 :1976) is how much appropriate?

In case of normal OPC based concrete we test the setting time of cement paste and consider it as the setting time of concrete due to less variation. In case of geopolymer concrete I am certain...

01 January 1970 5,917 6 View

How to define an anisotropic material with asymmetric elastic compliance/stiffness matrix in ANSYS APDL?

I need to model an anisotropic material in which the Poisson's ratio ν_12 ≠ ν_21 and so on. Therefore, the elastic compliance matrix wouldn't be a symmetric one. In ANSYS APDL, for TB,ANEL...

09 August 2024 5,048 2 View

Request Python code?

Request Python code from this article : Gender equity of authorship in pulmonary medicine over the past decade. THANKS!

08 August 2024 6,242 2 View

Why does everyone use vs code?

Visual Studio Code (VS Code) has become a popular choice among developers for several reasons: 1. **Free and Open Source**: VS Code is free to use and open source, making it accessible to...

07 August 2024 7,013 4 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

Please, what is the memory consumption of the Matlab function quad tree decomposition procedure [S = qtdecomp(I)] with respect to the input set I?

27 July 2024 5,455 2 View

How to Import Blade geometry from CAD file in ANSYS design modeler?

Hello. I have the geometry of a blade in CAD file (stp) and I want to prepare the blade for meshing with turboGrid. I must import this file into designModeler and then transfer to the...

27 July 2024 356 3 View

All math can be explained by iterator of code?

all math can be traversed by code? all math can be translate to code?

26 July 2024 9,530 0 View

How can i solve the code problem in Sentaurus TCAD?

Currently, I have created a gate-all-around process using sentaurus tcad and I am trying to measure the noise using the cmos noise code. However, an error occurred during the simulation. The error...

24 July 2024 6,352 0 View

Getting standard errors and T-statistics for Dynamic panel data with instrumental variable method?

Dear all, I have a user written estimation code based on the paper dynamic panel data estimations with fixed effect by Galvao(2009). I have seen many papers using this method and pvalue, standard...

22 July 2024 4,588 1 View

Where do I download the codes, databases commonly used by researchers?EMG,EEG,etc?

Where do I download the codes, databases commonly used by researchers?

16 July 2024 295 0 View

Niko Komin

I don't fully understand your question but one thing is that the i7 doesn't have eight physical cores. It has four cores and multithreading (am i right?). So using more than four cores can under certain circumstances affect speed negatively, no matter if you're using MPI or OpenMP.

Shaswat Anurag Saincher

Dear Niko Komin,

Thanks for your answer...it adds a new angle to my approach towards performance evaluation of parallel CFD codes. The issue here is about understanding the differences between the terms "processor", "core" and "thread" for i7. Actually this link to a question on stackoverflow sheds some more light in support of your comments...

https://stackoverflow.com/questions/10403201/8-logical-threads-at-4-cores-will-at-a-maximum-run-4-times-faster-in-parallel

I think I may need to rephrase my question as "How much is the maximum speedup obtainable from a single node i7 machine?" :)

Thanks for the help...I really appreciate it.

Best regards.

Shaswat Saincher

I found one more link that I'd like to share...

https://www.pugetsystems.com/labs/hpc/Hyper-Threading-may-be-Killing-your-Parallel-Performance-578/

Shaswat

Exacty. I don't know if 'disabling' hyper threading on your personal machine makes any sense. I just tell my program to use only four parallel executions.

I did try disabling hyperthreading from the BIOS. Here's what I observed:

ψ(Np=4) hyperthreading enabled-------> 3.141

ψ(Np=4) hyperthreading disabled-------> 3.433

Hence, as explained by Dr. Kinghorn, disabling hyperthreading on i7 does lead to an improvement in parallel performance. I guess the improvement is not very dramatic because the code is already being executed in Linux. This finding is also in concurrence with the observations of Dr. Kinghorn.

I again thank you for prompting me to think in this direction, I greatly appreciate your help.

Sunwoo Lee

Hyperthreading is a technique that tries to maximize the number of instructions executed in a certain amount of time. Even though it looks like there are more cores logically, the system has the physical cores only. it means that if you use all the logical cores, you cannot expect the linear speedup. It is known that, on a single socket, a hyperthrraded core can provide about 30% performance improvement. If the number of socket is doubled, the performance improvement is halved.

Bill Long

In general, a well designed MPI code should run fine on a set of ranks that are all in the same SMP domain. It can sometimes even outperform a comparable code using OpenMP. So, normally it would not be worth rewriting the code to switch from using MPI to OpenMP. I agree with earlier comments that it is better to limit to one MPI rank/core and avoid the hyperthreads.

Dear Sunwoo Lee and Bill Long,

I am grateful for your comments.

In our case, what's happening is that the speedup decreases after we exceed the number of physical cores on our i7 system (SMP architecture). As advised in the comments, I will try avoiding hyper-threading either by using Np

Greetings everyone.

I just came across some papers discussing the impact of hyper-threading in HPC environments....I think they are worth sharing:

Article An Empirical Study of Hyper-Threading in High Performance Co...

https://pdfs.semanticscholar.org/df61/e878fa9052d85abe8fb341ee6d42b4f4c6c7.pdf