How to treat R2 and p-value in linear (curve) fitting correctly and objectively?

More Xingxing Zhao's questions See All

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

After COVID-19 it has seen that EFL learners technological affiliation has raised. In addition, in the post-COVID period learners started to engage AI technologies like ChatGPT while learning...

08 August 2024 8,964 4 View

How to generate a citation of my paper from ResearchGate?

How we can cite the papers from ResearchGate. I am trying to create citations for this article, Quantum Machine Learning Algorithms for Optimization Problems: Theory, Implementation, and...

08 August 2024 6,690 3 View

Does Anyone have expertise in in vitro transcription and RNA pull down assay?

I am currently working on LncRNA; to know the lncRNA-protein interactions I want to do RNA pull down assay, so I need to design primers with T7 promoter. I need assistance in this regard.

07 August 2024 6,622 1 View

How to fix background error in rietveld refinement of one XRD peak using GSAS-II?

I want to refine one XRD peak of my in-situ xrd but the background is never working good which ultimately fails the refinement. How to refine and adjust the background using GSAS-II

05 August 2024 5,291 2 View

How can I add own Henry coefficients in Aspen Plus?

Hi, i would like to simulate an absorption process in Aspen Plus. I want to use the NRTL model und would like to add some individual Henry coefficients. Is that possible and how?

05 August 2024 2,333 2 View

Why might the impedance values for DI water and 0.1X PBS buffer solution exhibit a decreasing and increasing trend, respectively over time (HP 4194A)?

Hello everyone, I'm encountering an issue with my electrochemical impedance spectroscopy (EIS) measurements and would appreciate some insights. Experimental Setup: Electrodes: Gold interdigitated...

05 August 2024 3,783 2 View

Can usage of AI tools like chat GPT in research work is recommendable ?

AI tools like ChatGPT can enhance research work significantly when used responsibly and in conjunction with thorough human oversight.

05 August 2024 1,842 3 View

Usage of internal standards in LC-MS/MS analysis?

Have you ever seen a LC-MS/MS method uses both internal standards and external standards (in matrix matching purpose) but the concentrations of internal standards are outside the calibration curve...

05 August 2024 3,084 6 View

ANY free software for reconstructing neurons in the microscopic image?

Hi everyone, I am working on brain slices for visualizing a protein in the soma and dendrites, using a fluorescence tag. However, I need a tool (not paid) for reconstruction of the whole neuron,...

04 August 2024 4,725 2 View

How effective is the Citi Bloc standard basket in enhancing the accuracy and comparability of international construction cost assessments?

Citi BLOC Standard Basket Definitions: A standardized unit representing a fixed basket of construction materials, labor, and equipment costs priced in various cities. Purpose: To create a common...

04 August 2024 8,997 1 View

What is the acceptable p-value cutoff for GO enrichment analysis ?

I have an RNA-seq data that I have analysed using Limma-voom and have extracted the gene IDs, log2FC and the p-values. At p value < 0.05, I have over 10,000 DEGs, however, when I run the GO...

31 July 2024 225 2 View

How to do Mann-Whitney U test with Bonferroni corrected p-values?

Dear All, My lab primarily works on insect wing patterns. In one of the projects, my student and I have defined 19 abnormality characters on the forewing and 6 abnormality characters on the...

31 July 2024 6,464 5 View

Bonferroni correction. I have independent t-test, paired t-test and ancova conducted. Which test would require Bonferroni adjustment?

I have two groups that I test on three different tasks. I have 4 independent t-test, 6 paired t-test and 2 ANCOVA. My concern for which t-test should I conduct bonferroni correction. At the moment...

28 July 2024 7,827 6 View

Standard curve of H2O2?

Hi, I need to prepare a standard curve of H2O2 to evaluate the primary products of oxidation. H2O2 I have is 35%. How do I prepare a stock of H2O2 10 microlitere. If s.o has experience, I would be...

26 July 2024 3,460 2 View

What is the impact of collaborations with key suppliers on an SME's competitiveness?

The literature on Supply Chain Integration (SCI), Supply Chain Management (SCM), and collaborations, spanning several decades, deserves special attention. This topic, although addressed for a long...

23 July 2024 5,672 3 View

Chi-square test for allele distribution?

Hello, when calculating the p value for the alleles in the table, how do we place the values in the chi-square test in the four-eyed table? Thank you very much for your attention.

17 July 2024 998 2 View

Does the bread-making process lead to a sufficient reduction in lectin activity of bean flour to guarantee food safety?

I'm currently working on the implementation of pulses (legumes) in bakery applications. In one case, pulse flours could be added to bread to increase the nutritional value. Pulses (in particular...

17 July 2024 7,293 3 View

How to calculate Cohen's d from CI 95 and t value from a paired sample t test?

We have conducted a systematic review to investigate the effectiveness of a treatment for a psychological disorder. We aim to report effect sizes and p values of the reviewed studies but one study...

10 July 2024 7,186 4 View

Can I use soybean, sunflower, or olive oil to prepare a standard curve for lipid estimation in plants or pollen?

I am trying to analyze lipids in plants; specifically pollen by using Sulfo-Phospho-Vanillin (SPV) Method! Can I prepare a standard curve for this estimation using soybean, sunflower, or olive...

01 July 2024 3,267 1 View

Generating multiple standard curves from a single template?

Hi everyone! After thinking about it on my own, I'm posting a question here. Has anyone ever tried creating qPCR standard curve samples (plasmid or DNA standard) by inserting multiple target...

30 June 2024 1,569 1 View

Jochen Wilhelm

R² ist the proportion of the variance of the data that is "explained" by the model. There is no deeper interpretation. It roughly gives an impression about how closely the data points are located around the model curve, relative to the range spanned be the predictors. This is almost never a very useful measure. The only exception might be in analytical fields where the quality of a calibration curve might by assessed by R² (R² must be very close to 1; otherwise the data are not suited). I would be happy to learn if there is some practical use for R² I am not aware of.

There are two things much, much more interesting:

(i) the estimated parameter values (e.g. in a simple linear regression: the slope of the regression line) and

(ii) the residual variance or a similar measure that tells us how close an observation can be expected to the model prediction.

(i) is often interesting because it tells us how strong the estimated effect of the predictor is. A confidence interval for the estimates can be interpreted as a range of parameter values that are compatible with the observed data. If only the direction of the effect is of interest, one may give a p-value instead which would indicate if the confidence interval is completely on one side "null" or if the data are compatible with aparmeter values at either side of the "null".

(ii) is sometimes interesting, particularily when the model is used for predictions.

Now looking back on the figure already identifies a problem with the reporting of R²:

In the lower left diagram, R² for Tem/Bor forrest is 0.2, and for agriculture it is 0.19. These values seem similar. However, the slope of agriculture is steeper than that for Tem/Bor forrest. Hence, changing the pH has a larger effect on the species richness in agriculture. This is completely lost by looking at the R² values. Of course, to make a point of this one would need to reject the hypothesis that the effects in both environments are equal. I don't know if this was the aim and/or done.

Abolfazl Ghoodjani

hello Xingxing Zhao

Pay attention to these two formulas. It shows you why this happened in this study.

However, I think it was wrong to use linear regression for these data.

Christian Geiser

Formal and informal guidelines for R-squared as a standardized effect size measure vary across fields. In some fields like mine (psychology), researchers are often quite happy when they find an R-squared around .20 (even a multiple R-squared derived from multiple predictor variables!). In other fields, a value of .20 may be seen as a small effect.

Cohen (1988) provided effect size guidelines for (bivariate) Pearson correlations r, according to which r = .1 would be small effect, r around .3 a medium effect, and r around .5 or larger a large effect. According to Cohen's guidelines an R-squared of .20 would constitute a medium to large effect (r = .447).

Cohen, J. (1988). The effect size. Statistical power analysis for the behavioral sciences, 77-83.

Daniil ( Даниил ) Александрович Fedorov ( Федоров )

Dear Dr. Christian Geiser , if n is less 21 and r = .447, r is does not significant difference from nought ( P is less 0.95 ).

Xingxing Zhao

Experts and professors, thank you very much for your generous response. The significance of statistics seems to vary according to the discipline and research direction. Of course, I have recently found research beyond linear interpretation, which negates the assertion that good fit (higher R2) is the standard of causality. For example, nonlinear time series analysis method, empirical dynamic modeling and so on, but they have not been deeply understood at present. Here are several articles for your reference and later readers.

[1].Chang, CW., Miki, T., Ye, H. et al. Causal networks of phytoplankton diversity and biomass are modulated by environmental context. Nat Commun 13, 1140 (2022).

[2].Ye Lin,Tan Lu,Wu Xinghua,Cai Qinghua,Li B Larry. Nonlinear causal analysis reveals an effective water level regulation approach for phytoplankton blooms controlling in reservoirs.[J]. The Science of the total environment,2021,806(P4).

[3].Review on Causality Detection Based on Empirical Dynamic Modeling

I hope it can help.