When we want to check the impact of a variable on another variable, which statistical techniques should we apply?

More Annu Annu's questions See All

Absorption coefficient of methane?

Hello, Can anyone provide me with the absorption coefficient of methane gas at 7.7 um? Any reference?

06 August 2024 980 5 View

How are Large Models Exploring and Outputting Knowledge Understanding in Specific Content Areas, and What Does Academic Research Say About It?

Hello everyone！ I am currently exploring the performance of large models in understanding knowledge in specific domains, and attempting to construct a knowledge framework similar to what...

05 August 2024 5,729 2 View

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity?

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity? What is the acceptable percentage of error (regardless of the metric)? Could you suggest...

03 August 2024 5,358 0 View

How do i get an account to upload my published papers?

need to open an account to upload my published papers

01 August 2024 9,255 1 View

What is the problem with these tissue culture plants?

All plants are green but some of these plants becomes yellow. I did not found any reason. Please help me to find out the real problem.

01 August 2024 589 4 View

How to correctly use the UTE and ZTE pulse sequences in Bruker's ParaVision software?

I am using a Bruker 600M solid-state NMR spectrometer with a Micro 2.5 microimaging system. The test sample is a tube of 1M LiCl aqueous solution, and the nucleus detected is 1H. I am trying to...

01 August 2024 9,227 1 View

Is artifacts in XPS possible to build high deviation in binding energy larger than 5 eV??

Hello. Thanks for your consideration to see my question. Recently, I conducted XPS anaylsis of g-CN that is prepared from thermal polycondensation of DCDA, so-called conventional bulk-g-CN,...

30 July 2024 9,824 2 View

Which statistical test should we use?

N=6 Comparing pre and post test likert scale responses. Participants are mix of practicing & preservice teachers.

30 July 2024 7,233 4 View

How to build my own lab made four point probe set up?

Hello, I'm trying to measure the conductivity of semiconductor films but since I don't have a commercial four point probe set up I would like to build one on my own in my lab. I have generators,...

30 July 2024 906 2 View

Can the limit of quantification (LOQ) of an analytical method fall outside its linear dynamic range, or must it always be within it?

Can an analytical method's limit of quantification (LOQ) be outside its linear dynamic range, or is it always required to be within it? Please provide a thorough explanation supported by verified...

29 July 2024 7,198 9 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Is there an alternative to a multinomial regression which allows the DV to be non mutually exclusive?

I am trying to analyse data from a survey examining what variables affect teachers perceived barriers to incorporating technology into their classroom. I have 5 predictor variables however my DV...

06 August 2024 1,752 3 View

Sal Mangiafico

When you say "impact", do you want to imply that there is a cause-and-effect relationship ? Or just that there is some kind of relationship (e.g. correlation) ? And maybe that knowledge about the real world can bring one to the conclusion of "impact" ?

Proloy Barua

@Annu Annu, I agree with @Salvatore S. Mangiafico. Please clarify whether you want to "correlation" or "causation" by the "impact of one variable on another variable". Regression is recommended for cause-and-effect relationships.

Annu Annu

Can we generalise impact of one variable on another by using 'correlation' only? Because in the objective I didn't specify that I will establish a 'causation' or cause and effect relationship.

Panos Petsas

Since you want to generalize the impact only with the term of "correlation":

Correlation is a two-way street! If variables X and Y are highly correlated, you can say that there is a relationship between them, either positive or negative!

-You can claim for example that for your sample, records with high values of X tend to have high (or low) values of Y.

-But you cannot claim that higher values of X result in higher (or lower) values in Y! Or the opposite!

-Therefore, correlation does not imply "impact" of one variable over the other, it just reveals their relationship!

> If you want to check for possible relationship between two variables, X and Y, the first thing you should do is plot them! The graph would reveal whether there is a relationship or not!

> You can use some correlation coefficients, like Pearson or Spearman correlation (you can search for them in wikipedia). These can give you some answers, depending on your question and data.

> You can always use regression techniques to get their relationship. Based on the graph you take from their plot, you might find that they have a quadratic relationship, so you should consider the variable X^2 along with the original X to get a more optimal result!

I generally agree with Panos Petsas , but I think it's also fair to invoke the word "impact" if you can reasonably assume there's some causal relationship. Like if I say there's a correlation between ice cream consumption and air conditioner use. No one thinks that one causes the other. But if I say there's a correlation between daily air temperature and air conditioner use, it's fair to say that one impacts the other, because we know from life experience that it's reasonable to think that people use the air conditioner more when the daily air temperature is higher.

Overall, I wouldn't worry about about using the word "impact" in your research objective. Instead, just report your findings as fairly as possible, whether you think there's an "impact" or not.

James R Knaub

Panos said "...the first thing you should do is plot...." Yup. However, in doing regression, one is the predictor and the other is the response, so beware that the regression may well have omitted variable bias. The best predicted-y is from the best combination of predictors - not too many and not too few, and just the right ones. But you can still see how one predictor acts alone, though it may be somewhat different in combination with other predictors. Also, sometimes, one predictor is best.

James R Knaub You are absolutely right! I assumed that there is only one predictor variable (X). If there are many predictors, your approach is the most suitable!

David A. Jones

There are other possibilities besides regression. But selecting an appropriate approach depends both on how much data you have, and on what fits into your overall objective for your immediate project..

For example if both:

(i) you have a lot of data;

(ii) you want an easily interpretable report....

then you could aim to show estimated probability density functions of one variable, conditional on another variable being in particular classes. This would allow a visual presentation of the size of any effect in comparison to the size of underlying unexplained variations.

Other versions of this might use box-plots to show a visual assessment. Other visual approaches might include multi-coloured scatter plots to attempt to deal with several variables.

But to select a good approach you need to start with thinking about what would be a good way of presenting any results. There may be too much emphasis nowadays on "significance tests" rather than on relating the size of any "impact" to the real-world situation.

Say we have

y = f(x) + e,

verified by a graphical residual analysis and a cross-validation to be reasonable. Then we see the "impact" of x on y, say x^3 predicts y, or whatever other function of x is found to perform well, the simplest being a ratio estimator, to predict for y.

But we may have other predictors needed, so that

y = f(x) + (predicted-y - f(x)) + e

is appropriate. Then we still see the "impact" of x, by whatever function, in whatever combination with other predictors.

Remember that the e, or better epsilon, often have higher sigma associated with larger predicted-y-values.

Cheers.

Ronán Michael Conroy

The word impact suggests that we are presenting a measure of effect size. And yes, there are many measures of effect size, some derived from regression models of various sorts, and others not so derived.

For example, we can measure the effect of a treatment on a binary outcome such as recovery using the relative risk, the odds ratio, the number needed to treat (and the number treated needlessly!), the preventable fraction in the treated etc.

The measure of effect size is determined not by the statistical model but vice versa. We have to specify the question to know the effect size estimate that will best answer it.

Max Beran

The way the question is put implies some potential agency of X over Y, and perhaps also a not so hidden third variable over which both are changing. An obvious current case would be the impact of CO2 (X) on global mean surface temperature (Y) where both are changing through time. In such a case it is helpful to work with delta-X versus delta-Y to help reduce the effect of the third variable, time. It is also useful to work with lagged cases - this years delta-X and last year's delta Y to assist with judging the direction of the agency. A lagged correlogram can be very revealing suggesting, as in this case, a two-way influence where both X and Y feed into and are fed from some wider system.