Using GLM (or OLS) with a categorical predictor variable (15 countries), how do I compare with the grand mean instead of using a reference category?

More Thomas Hansen's questions See All

How can women be responsive when they can make love for hours?

… with my present female lover … she and I spend anywhere from two hours to six hours in caressing, touching, cuddling, hugging, lip kissing, deep kissing and intimate conversation before,...

11 August 2024 4,521 0 View

Adhesion strength of coating?

How can I determine a good adhesion strength range for coatings on polymer surfaces, such as DLC on polymer substrates? Is there a specific threshold for adhesion strength (from T-peel tests)...

10 August 2024 942 3 View

Why do men not accept that continually hassling for sex proves that they want it more than their partner?

Your partner’s not there to service you, it’s not their job to keep you sexually satisfied. You’re together because you love each other and want to make each other happy. Constantly hassling them...

08 August 2024 1,491 0 View

Why do we equate male and female arousal?

Women, on the other hand, can become physically aroused (increased blood flow in the reproductive organs) without becoming psychologically aroused even in the slightest. (Robert Weiss)

05 August 2024 9,537 2 View

How to use Density Functional Theory to calculate carrier mobilities of solid system?

Hello, everyone. I have tried to determine carrier motilities of some materials, by Density Functional Theory, using Quantum ESPRESSO. There are a few methods to do it, like a package called...

04 August 2024 8,894 1 View

Polymer wear calculation?

What is the method for analytically calculating the wear and service life of polymer-polymer sliding pairs?

04 August 2024 1,078 1 View

Why do women not understand that men are aroused by physical contact?

Women often complain that their husbands never touch them unless they want sex. (Michele Weiner-Davis)

02 August 2024 7,778 2 View

Why do women usually need more persuading than men do to have sex with a new lover?

Women need to feel a degree of sexual intimacy before sex becomes desirable… For women, intimacy sometimes results in sex; for men, sex sometimes results in intimacy. (Marina Muratore)

31 July 2024 8,860 0 View

Why do men and women confuse platonic love and sex?

Women associate affection with love. … Men associate affection much more directly with sex. … Men see affection of any kind as a sexual invitation. Many women find this bewildering. (Kramer &...

30 July 2024 9,498 2 View

Why is it not common to see similar pairs in polymer sliding pairs?

If the pairs are similar, will it reduce the adhesion due to electric charges?

29 July 2024 3,185 2 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Is there an alternative to a multinomial regression which allows the DV to be non mutually exclusive?

I am trying to analyse data from a survey examining what variables affect teachers perceived barriers to incorporating technology into their classroom. I have 5 predictor variables however my DV...

06 August 2024 1,752 3 View

In order to run Multinomial Logistic Regression, is it required that the data be in the long format?

I am using unit level data (IHDS round 2) & Stata 17

06 August 2024 5,725 2 View

What researches are there for satisfaction level of the hospital attachment for student nurses ?

I am doing a study on nursing students satisfaction on the hospital attachment whether they are satisfied with the clinical attachment or not. I need more research studies on this

03 August 2024 6,985 2 View

Normality assumption for linear regression is The assumption of normality is whether for residual errors or predictor variavble?

When we conduct linear regression, there are several assumptions. The assumption of normality is whether the residual errors are normally distributed, not whether a predictor is normal?

31 July 2024 6,164 3 View

Posthoc test lettering in JAMOVI?

Does anyone know of a module for the JAMOVI software that is capable of generating mean separations using the classic letters based on post hoc results (e.g., Tukey test)? If, as I believe, such...

31 July 2024 3,333 4 View

How to back transform the results generated from analyses using log transformed with In(X+1) data?

I am conducting my analysis using SPSS. I log transformed my data using In(X+1) as my data contain zero values. However, when I want to back transform the regression coefficients generated from my...

31 July 2024 7,860 3 View

Why do women use fantasy to achieve arousal alone?

Women also often find it easier to fantasise when self-pleasuring than in sex with a partner. The immediacy of someone else’s needs actually inhibits the expression and satisfaction of their own....

26 July 2024 8,351 2 View

Would you be willing to take a look at interview questions for a research study?

25 July 2024 6,938 3 View

Oliver Perra

Hello,

I am not entirely sure this answers your question, but in Stata you have the option to run a regression excluding the intercept, which then allows to use all levels of the predictor variable

http://www.ats.ucla.edu/stat/stata/library/anova_comp.htm#1regnoint

Harald Lang

Hi!

Oliver's idea in more detail:

Demean the dependent variable, i.e. subtract its grand mean, such the their sum equals 1.

Then run your regression, without intercept, and a dummy for each country. Now the coefficients measure the country's difference against the grand mean,

Cheers -- Harald

Jochen Wilhelm

Or simply use "sum-to-zero"-contrast coding.

http://statsmodels.sourceforge.net/devel/contrasts.html

http://faculty.nps.edu/sebuttre/home/R/contrasts.html

http://en.wikipedia.org/wiki/Contrast_%28statistics%29

http://www.ats.ucla.edu/stat/r/modules/dummy_vars.htm

http://www.stat.cmu.edu/~hseltman/309/Book/chapter13.pdf

Emilio José Chaves

Thomas, I imagine that you have data of population in each country. This give you 15 subgroups, from which you get their mean and the grand mean. First work your main variable so you may study its distribution among total people of 15 nations. Later, do the same for each nation. Then you may compare each of them with grand mean value. emilio

Correction of my previous answer:

". . . such that their sum equals zero"

A comment to Jochem's answer:

The "sum zero" modelling requires that you have equally many data for each country. Otherwise you will not compare with the grand mean.

hI. I differ from Harald. We only need to know the fractions of population of each country and it does not matter if each one made its data from different N dataseries. It only needs to weight each nation´s media with fractions of population of the 15 countries. Thanks,emilio

Thomas Hansen

Thanks for helpful advice! The best option seems to be to demean the dependent variable. However, then I need to weight for the large differences in sample size, otherwise the grand mean will be too heavily influenced by a few countries. I am not sure how to do this weighting in SPSS; but I should be able to find it out :) Cheers guys!

I agree with Emilio. What I was trying to say was that if you restrict the coefficients to "sum to zero" (as is often done, a.f.a.i.k.), then this is not equivalent to comparing with the grand mean unless you have an equal number of observations for each country, rather you will compare with a weighted mean, the weights being proportional to the number of observations.

I think we are in agreement?

On a second thought -- the right answer on the issue discussed by Emilio and me depends on the definition of "grand mean". Does it mean that each country gets the same weight, or that each observation gets the same weight?

Yuanzhang Li

I have some different opinions. In statistics, if we compare two or more means from different groups. Those groups should be mutually exclusive, no subjects should belong two different groups. When you compare the mean difference between a country and all countries (grand mean), the subject in that country is a subset of whole population. Those two groups are not independent anymore.

If you really want to do so, you should consider all countries as whole population, each country is a sample. The grand mean is a known parameter (true) (no standard error). Then use one group t-test or normal test to test if the mean from a given country is equal to the population mean (grand mean).

You can use means option or lsmean (adjust mean) to get the output data set with mean, standard deviation, size or standard error. Then with a simple statements to calculate t or z and p values.

In addition, pariwise comparisons say use means/ is more meaningfull than those comparing with grand mean.

You may use ODS output to get those estimations.

Harald, I understand "grand mean" as the weighted average of all nations with respect to their populations. If one nation has 2 million people and the 15 nations together have 100 million, then its frequence is 0.02. Thomas is concerned about "grand mean will be too heavily influenced by a few countries" but that is normal when some nations have very high variable mean and high fraction of population with respect to total ones. I imagine that samples though diferent in size are representative ones of each nation, so each national media must be well estimated. Thanks, emilio

If you are interested, see table 3 of this paper:

http://www.statistica.unimib.it/utenti/rimoldi/DEMOGRAFIA%20REGIONALE/PTS1_1314/Special%20Issue%20Loneliness%20Article.2011.pdf

The paper does not speficy what is meant by "grand mean", but I speculate that if simply refers to the mean level of loneliness across all 12.248 individuals. The paper include data from 14 countries, N ranging from about 300-1100.

If this is grand mean for all 12248, comapring with it, some countrie should have negative difference, some should have positive.All difference in Table 3 are positive for 14 countries.

Remember this is LOGISTIC regression, then 1=no change. So 1=grand mean. Unless I misunderstand..

So, it is OR.

I think they are coefficients and not ORs..

If using SAS/Logistic regression, the default design matrix using 1 -1 (or (1, 0, -1) ) coding system. It will show 13 parameters, the the 14th. it is equals the (sum of 13 parameters)*-1. I.e the sum of all is zero.

If only country is the predictor, the parameter is almost compare to the grand mean of all 12248. But not exactly. In multiple logistic regression. It is not, it compare with adjusted grand mean (exp(intercept+mean effect of continuous factor+ 0 of categorical).

Again, we do not usually report exp( parameter of a country) as OR for that country with grand mean. We need report exp( parameter of country i-parameter of country j), etc.