R-squared values between 1 and 5% in linear regression social science?

More Sara El-Deeb's questions See All

Do you think can be any Uranium bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about Uranium ore deposits in world.

11 August 2024 6,720 0 View

Do you think can be any diamond bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about diamond ore deposits in world.

11 August 2024 2,167 1 View

What is the difference between mathematical R^4 space and physical 4D unit space?

We assume that the difference is huge and that it is not possible to compare the two spaces. The R^4 mathematical space considers time as an external controller and the space itself is immobile in...

10 August 2024 6,678 14 View

If Banks do not provide credit facility, what are the options available for FPOs and impact on producer’s income?

10 August 2024 8,198 5 View

Controlling for pupil light reflex when analyzing pupil size time course?

I used eye tracking to examine how participants from two different populations (A and B) react to an image. Participants in population A exhibit larger pupil sizes over time, but they also have...

10 August 2024 3,229 0 View

What are a “Farmers Producer Organization” (FPO) and its essential features?

10 August 2024 477 5 View

Strugglling with m6A dot blot any suugesstion ?

I have been doing the m6A dot blot for a while with no improvement, I am extracting the RNA, and I can see the dots although the three biological replicas give a different reading on the memberan...

10 August 2024 8,539 5 View

Do interactions between biosphere, carbon cycle, & water cycle impact global warming & interaction between atmosphere & hydrosphere?

How do interactions between the biosphere, the carbon cycle, and the water cycle impact global warming and interaction between the atmosphere and the hydrosphere?

09 August 2024 3,291 2 View

How to get moment output in Abaqus Standart?

I have input a moment load in module load Abaqus, i put my moment load on the node surface (using reference point). I have define moment in history output and make a set for moment too. But the...

08 August 2024 4,831 4 View

How is energy cycled through the Earth's climate system and how do matter cycle and energy flow through the rock cycle?

08 August 2024 8,162 0 View

Is this a facetotecta nauplius?

This larva was captured using a plankton net in the Persian Gulf during the summer. I believe it may be a Facetotecta nauplius.

08 August 2024 3,746 4 View

May members post flyers about opportunities to present at a conference? If so, where to post?

May members post flyers about opportunities to present at a conferehttps://veraeducation.com/nce? If so, where to post for the Virginia Educational Research Association? https://veraeducation.com/

08 August 2024 4,585 1 View

Is it possible to plot the atom-projected band structure using GPAW?

Hi, I'm currently working on a project where I need to plot the atom-projected band structure using GPAW. I've been able to calculate the band structure for my material, but I'm having trouble...

07 August 2024 269 3 View

Hello all, Looking for international reviewer to review Ph.D thesis in wireless sensor network.Can anybody help?

My name is Apurva Saoji. I am a Ph.D scholar in Computer engineering in India. I am looking for international expert in reviewing my PhD thesis, "Competitive Optimization Techniques to Minimize...

07 August 2024 4,600 2 View

Should I include H atom into C3N5 when i am doing DFT modelling?

Hi all, my experimental XPS results shown that my C3N5 sample consists of N-H bond, hence in this case I should incorporate the N-H bond into my DFT modelling. However, I do notice several papers...

07 August 2024 8,414 2 View

Hello Everyone ! I'm looking for a good journal to publish my manuscript with low publication cost?

I am Looking for a Science Journal with good impact factor and low publication cost to publish a review paper. Your suggestions would be appreciated.

06 August 2024 6,796 3 View

Is there an alternative to a multinomial regression which allows the DV to be non mutually exclusive?

I am trying to analyse data from a survey examining what variables affect teachers perceived barriers to incorporating technology into their classroom. I have 5 predictor variables however my DV...

06 August 2024 1,752 3 View

In order to run Multinomial Logistic Regression, is it required that the data be in the long format?

I am using unit level data (IHDS round 2) & Stata 17

06 August 2024 5,725 2 View

Research Methodology - Impact of Corporate Reputation on Stakeholders Behaviors?

Please can anyone support with the survey questions based on RQ measures and propose how to do it in FMCG industry and include as well the role of brand equity Thanks

06 August 2024 949 0 View

Weak DAPI staining after immunohistochemistry - how to improve?

After immunohistochemistry of previously fixed in PFA and EtOH and then frozen 20 μm sections of zebrafish brain, DAPI staining is very weak (right) compared to the same sections stained without...

05 August 2024 9,637 2 View

Kelvyn Jones

When analyzing individual (not aggregated) data such low values are not unusual - you have to decide is it practically useful and have the assumptions behind the analysis been met. Individuals are typically very heterogonous in their attitudes, actions and behaviours.

I am reminded of a famous clinical trial of the effect of taking aspirin on heart attack - the odds ratio was so dramatic that the trial was stopped and placebo group advised to take aspirin. And yet the odds ratio of a heart attack for placebo compared to taking aspirin was, a rather a lowly 1.83 and the R2 was a puny 0.0011; yet this was sufficient for action.

Your arguments are strengthened if you testing a relationship and have not gone on a fishing expedition and you have tried to take account of theoretically relevant confounders. Epidemiology has gone to some extent from 'what are the causes of this outcome?' to 'does this potential cause have an effect?.

I would also add if you are modeling binary(0 and 1) outcomes it is exceedingly difficult to achieve high R2 values as the predicted probability values are not very likely to be exactly 1 and 0!

Finally we have to accept that there are outcomes where chance does genuinely part a large part so we now have evidence that luck plays a bigger part in some cancers than genes and lifestyle; see

http://www.iflscience.com/health-and-medicine/bad-luck-identified-biggest-cancer-risk

So for me it is theory, focused question and the size of the slope term and careful evaluation of the model rather than just R2.

Rajiv Pandey

Such model will not capture the essence of the study rather I would prefer to say that this sort of relationship should not be attempted. I prefer to recheck the variables (the dependent one) or model form (may be non-linear, if you are sure for relationship). Pl see.

Subhash Chandra

Kevlyn's observations are worth looking into. It may possibly be worthwhile to also account for any hierarchical structure in data (if there is one), treating the various nesting levels as random variables. Such a regression model, with X variables assumed as fixed effect terms, may be fitted using ReML. While this may or may not affect R-sq, it is will provide a regression model that the data "actually" support. Depending on the purpose of the model, R-sq may or may not be right statistic to use to judge the suitability of a model. For example, if the purpose is to use the model for "prediction" (not explanation), you could try using some kind of cross-validation to measure the predictive accuracy of the model in terms of prediction error.

Han Ping Fung

Agreed with previous scholars might need to re-evaluate the research model based on additional literature review. Following are the guideline for R-squared (R2) values:

1) Falk and Miller (1992) recommended that R2 values should be equal to or greater than 0.10 in order for the variance explained of a particular endogenous construct to be deemed adequate

2) Hair et al. (2011) & Hair et al. (2013) suggested in scholarly research that focuses on marketing issues, R2 values of 0.75, 0.50, or 0.25 for endogenous latent variables can, as a rough rule of thumb, be respectively described as substantial, moderate or weak

Prabirjit Sarkar

If the basic objective is to examine the effect of one or two variables on another variable (on the right hand side), one has to look at the sign and statistical significance of the explanatory variables -low R square may not matter much -what matters is this significance. For a fully specified model the objective is to predict the behaviour of dependent variable on the basis of explanatory variables -here poor R square means that the explanatory power of the model is very low -many explanatory variables are left out of the analysis and/or the model is mis-specified.

Sara El-Deeb

Thank you for your insights especially Kelvyn Jones.

A statistician advised me in order to increase the R square is to do a correlation analysis between the items of the two variables and the ones with weak correlations to remove the items and then run the test again. The R square should increase substantially.

But i prefer to keep the items as is and will give a justification thanks to your help

David M. Fields

Depends on what you are trying to assess; blanket faith in trying to stay within the confidence interval is more-or-less stargazing.

Brian R. Urlacher

Gary King has a nice rant about the limits of R2 as a guide in assessing model quality

King, Gary. "How not to lie with statistics: Avoiding common mistakes in quantitative political science." American Journal of Political Science (1986): 666-687.

If you are using a binary/ordinal model and relying upon a pseudo-R2 the measure is even worse. At last check, Stata was discouraging people from reporting the pseudo-R2 statistic. There are better ways to evaluate model fit for binary/ordinal models (i.e. separation plots)

Greenhill, Brian, Michael D. Ward, and Audrey Sacks. "The separation plot: a new visual method for evaluating the fit of binary models." American Journal of Political Science 55.4 (2011): 991-1002.

Lazarus Adua

I will say r-squared of 5% is pretty low, even in social science context, but your underlying theory and assumption should drive the model specification and selection. Keep in mind that the r-squared is just one of several things you must pay attention to when judging a model's fit with the data.

David L Morgan

To some degree, this is an issue of statistical versus substantive signification. A key point to make about statistical significance for R-sq is that it depends on sample size. Even a very low R-sq can be statistically significant if the N is large enough.

By comparison, substantive significance is always somewhat subjective. Han Ping Fung reports on some typical standards for substantive significance. The basic argument here is that if you fail to explain any variance, then something is wrong with your theoretical model or your measures -- or the concept you are trying to explain really is random and nothing will explain it.

I suspect the statistician you consulted was suggesting that something is wrong with your measures and that you should look at those variables more closely. In particular, if they are scales that you constructed, something might have gone wrong in that process. Given how frequently that does happen, I would "up vote" the suggestion to look more closely at the correlations between the individual items.

James R Knaub

Sara -

Are you looking at continuous data? Scatterplots can give you great insight into your data and be far more helpful than R-square. I'd also suggest looking into the "variance of the prediction error," and especially for simple linear regression, the standard error of the estimated slope. These would be more informative.

Are you using multiple regression? Then adjusted R-square is better, but it still is not a great measure. There are various factors that can influence it.

Cheers - Jim

Rathiranee Yogendrarajah

Sure. I agreed with the above answers. Further, you can consider the significant relationship between the variables. It depends on your assumption and practical issues. I also have the experience like this.