Is causation a confirmation for correlation in social science research? How is it different for natural sciences?

More Mukul P Gupta's questions See All

Why Do TDS and EC Increase with Larger Wastewater Volumes, While BOD and COD Decrease?

I have carried out MFC experiments on three different volumes, 50, 500 and 1000 mL of wastewater. Results after MFC treatment shows that TDS and EC are more in larger volumes of water i.e. TDS and...

09 August 2024 9,621 0 View

How to enrich pig excreta for increasing nutrient quality organically ?

Pig slurry is rich in major and minor nutrients. Is there any way to improve / Enrich its manure quality to be used in agriculture organically ? please share your knowledge.

09 August 2024 5,605 2 View

Is it possible to plot the atom-projected band structure using GPAW?

Hi, I'm currently working on a project where I need to plot the atom-projected band structure using GPAW. I've been able to calculate the band structure for my material, but I'm having trouble...

07 August 2024 269 3 View

Unusual intensity drop in some sections of chromatograms in DDA?

Hi, we have measured tryptic peptides using both DDA and DIA method on QExactive. In DDA replicates i saw unusual intensity drops occurring at the same sections of chromatograms in DDA replicates...

07 August 2024 3,218 4 View

Leaf area of tomato ?

Hi How can this equation Ln(LA) = 1.038 + 0.89 ln(X) be applied to calculate the leaf area of a tomato? Can you explain with an example and what is the substitution of Ln and ln?

06 August 2024 2,508 2 View

Why did the authors extrapolate a phenotype that they experimentally proved in one bacterial strain across the whole genus of the organism?

I aim to be as skeptical as possible regarding whether a pair of orthologous genes results in the same phenotype in their different but related bacterial organisms under similar environmental...

05 August 2024 6,787 4 View

How to preform densitometry on SDS-page bands?

I ran a SDS-page of a bacterial lysate and I want to quantify protein concentration in a specific band. I was thinking of using a standards ladder or make some standards are different...

05 August 2024 9,805 3 View

XRD Analysis is showing only Calcium carbonate. It is not showing other compounds. Can anyone help me get the other compounds?

XRD Analysis is showing only Calcium carbonate. It is not showing other compounds. Can anyone help me get the other compounds

04 August 2024 3,019 3 View

Which solvent is better to dissolve with secondary metabolites extracted from fungi?

I work on MCF7 cell cell for anticaner purpose and I wa to do drug preperation the drug ( secondary metabolites extracted from Aspergillus) My question which solvent is better with these secodary...

03 August 2024 4,725 2 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

Which Scopus Journal provides the most affordable fees?

"PUBLISHING IN A SCOPUS JOURNAL" Researchers are now at a cross road. The critical need to publish in a Scopus or ISI, etc journal is ever vital. Journal Publication fees must be submitted....

10 August 2024 8,621 1 View

GC-MS retention index prediticon?

Hello experts, Does anyone know any free software about retention index prediction ?

08 August 2024 7,403 2 View

Is there an alternative to a multinomial regression which allows the DV to be non mutually exclusive?

I am trying to analyse data from a survey examining what variables affect teachers perceived barriers to incorporating technology into their classroom. I have 5 predictor variables however my DV...

06 August 2024 1,752 3 View

In order to run Multinomial Logistic Regression, is it required that the data be in the long format?

I am using unit level data (IHDS round 2) & Stata 17

06 August 2024 5,725 2 View

Could you please suggest methods to compare free protein and immobolized protein binding properties?

I have an antibody binding generic protein and I need to compare its activity in a free and immobolized form. I understand that there are a number of methods to determine Kd value of a free...

05 August 2024 5,311 0 View

How to report results of Generalised Linear Mixed Models in a journal article?

Hi everyone, If you have written or come across any papers where Generalised Linear Mixed Models are used to examine intervention (e.g., in mental health) efficacy, could you please share the...

04 August 2024 4,130 4 View

Request a single Lecture notes for math as detailed as this that I can find in one place?

- The Existence/Uniqueness of Solutions to Higher Order Linear Differential Equations - Higher Order Homogenous Differential Equations - Wronskian Determinants of $n$ Functions - Wronskian...

03 August 2024 2,366 0 View

Where can I find free research instruments for Nursing?

for eg. transition shock scale

01 August 2024 5,998 0 View

Why do open and free science in a world where science is not open and free?

Because I have realized that the world tends more and more to do open and free science and there is a trend more and more to choose free databases, free tools and open access platforms.

01 August 2024 10,046 1 View

Normality assumption for linear regression is The assumption of normality is whether for residual errors or predictor variavble?

When we conduct linear regression, there are several assumptions. The assumption of normality is whether the residual errors are normally distributed, not whether a predictor is normal?

31 July 2024 6,164 3 View

Ariel Linden

Hi Mukul,

Interesting question!

I would argue that if there is a causal relationship between two variables, then there will have to be a correlation between them.

In thinking of this within a simple regression framework, you would have Y = a + b(treatment), where the beta coefficient will tell us whether there is a statistically significant treatment effect of treatment on Y (note that this is assuming that the data are from an RCT, and that indeed the relationship is causal). As such, this relationship also represents a correlation between Y and treatment.

However, if these data are not from an RCT, we would still have the same regression (albeit with added covariates to control for confounding), but we can only assume that the relationship is causal. Thus we may have a correlation between Y and treatment, but the causal relationship is based on the assumption that we have controlled for all sources of confounding something that in practice we can never know). Conservative commentators will argue in this case that we can claim correlation, but not causation, by the fact that the data are not derived from a randomized trial.

I hope this helps

Ariel

Jessica Williams-Nguyen

Ariel, correct me if I'm wrong here, but I believe there could be relatively rare cases where two factors (e.g. treatment, which I'll call X, and Y) that are causally related could be empirically uncorrelated. This could occur if there are two causal pathways between X and Y mediated by different factors (e.g. X->A->Y and X->B->Y). If one of these pathways has a negative effect, the other has a positive effect and the absolute magnitude of these effects is equal or very nearly equal, we would observe no correlation between X and Y. In reality, such situations will be very rare, but we are left unable to say that causation always implies correlation.

Hi Jessica,

You bring up a good point. From a theoretical standpoint, there would have to be a 100% indirect effect, that is, X impacts M and M impacts Y, but X does not directly impact Y. From a practical standpoint I have never seen such a case where there is absolutely no direct effect of X on Y.

So from a theoretical perspective, there could be a scenario where there is causation but no correlation, however in the real world this is a highly unlikely scenario.

By the way, I am attaching a paper I recently wrote on mediation which discusses these issues in a more specific case .

Article Using mediation analysis to identify causal mechanisms in di...

Thank you, Ariel! A very nice paper. I look forward to the increasing application of mediation analyses to diverse fields. On the original question, I will also note that in highly nonlinear systems, we also cannot rely on the assurance that simple correlation will be present when there is a true causal link. Here is a short and accessible editorial from physicist Mark Buchanan discussing the issue.

Mukul P Gupta

Appreciate your answers and responses to my quite a naive question Jessica and Ariel.

In non linear relationships or for that matter, in the simplest of a non linear relationship like a quadratic relation of the kind y = a + bx + cx2 the derivative will be of the kind dy = (b + 2cx).dx, a linear kind of function.

What would we say about correlation between y and x.

Krishnan Umachandran

Correlation to Causation - Stress-coping, self-medication, or tension-reduction for smoking behavior.

http://www.uic.edu/depts/psch/kasseldocs/1292270.pdf

Dr Gupta, in the quadratic example you pose, whether you could find a correlation would depend on where in the state space and with what tools you are trying to look for one. It should be straightforward to find a quadratic relationship, given that you know this is correct functional form of the relationship. If you want to find a correlation on the linear scale, there are some parts of the curve where this would be possible. But failure to find a linear relationship doesn't mean there's no causation. In the attached graph, there is indeed causation between x and y. I know because I simulated the data as y = 2.5 - x^2 with some stochastic noise added. Yet we can't detect the relationship with linear regression. Of course, in this simple case, a look at the graph would quickly set us on the right path. In more complicated systems, I think it can be more of a challenge to assure we have an appropriate functional form. Is this any help?

Thanks Jessica for taking the time to respond.

My question hints at the challenge that we might face in analyzing big-data where simple linear relationships may not exist. This may be the time to look at other ways of understanding data than the association-correlation-causation-regression methods.

I agree. This an issue I've been thinking about quite a bit lately. It seems to me that the investigation of complex, nonlinear and even chaotic systems is both the promise and the peril of "big data". In epidemiology, we have embraced graphical models, especially directed acyclic graphs (DAGs) as a way to sort through the multitude of interacting associations and improve our estimates of causal links. But I haven't seen them applied quite as widely to dynamical systems.

On moving beyond association, I'm not sure quite how. I don't think I have yet to come upon a empirical method that is not relying on association in form or another to infer causation (not that I've come upon them all). As statistician Edward Tufte said, "Correlation is not causation, but it sure is a hint." Simulation modeling is one way to tackle the problem from the other end, i.e. understand the generative process from first principles making plausible assumptions and see if you can explain observed phenomenon. But that also has its issues.

I see very interesting theoretical work on this issue coming out of the field of ecology. Perhaps this blog post I stumbled upon recently would interest you.

https://dynamicecology.wordpress.com/2014/06/30/steven-frank-on-how-to-explain-biological-patterns/

Abdunasir Sideeg

I absolutely agree with Ariel Linden's statement: "..from a theoretical perspective, there could be a scenario where there is causation but no correlation, however in the real world this is a highly unlikely scenario."