How to determine which cell is contributing to significance on a chi-square test?

More Mika Mika's questions See All

Do you think can be any Uranium bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about Uranium ore deposits in world.

11 August 2024 6,720 0 View

Do you think can be any diamond bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about diamond ore deposits in world.

11 August 2024 2,167 1 View

What is the difference between mathematical R^4 space and physical 4D unit space?

We assume that the difference is huge and that it is not possible to compare the two spaces. The R^4 mathematical space considers time as an external controller and the space itself is immobile in...

10 August 2024 6,678 14 View

If Banks do not provide credit facility, what are the options available for FPOs and impact on producer’s income?

10 August 2024 8,198 5 View

Controlling for pupil light reflex when analyzing pupil size time course?

I used eye tracking to examine how participants from two different populations (A and B) react to an image. Participants in population A exhibit larger pupil sizes over time, but they also have...

10 August 2024 3,229 0 View

What are a “Farmers Producer Organization” (FPO) and its essential features?

10 August 2024 477 5 View

Strugglling with m6A dot blot any suugesstion ?

I have been doing the m6A dot blot for a while with no improvement, I am extracting the RNA, and I can see the dots although the three biological replicas give a different reading on the memberan...

10 August 2024 8,539 5 View

Do interactions between biosphere, carbon cycle, & water cycle impact global warming & interaction between atmosphere & hydrosphere?

How do interactions between the biosphere, the carbon cycle, and the water cycle impact global warming and interaction between the atmosphere and the hydrosphere?

09 August 2024 3,291 2 View

How to get moment output in Abaqus Standart?

I have input a moment load in module load Abaqus, i put my moment load on the node surface (using reference point). I have define moment in history output and make a set for moment too. But the...

08 August 2024 4,831 4 View

How is energy cycled through the Earth's climate system and how do matter cycle and energy flow through the rock cycle?

08 August 2024 8,162 0 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

If we are using snowball sampling technique, how do we justify the true representativeness of the sample statistically? is there any statistical test?

Are there any statistical methods to justify your sampling technique using SPSS or AMOS?

05 August 2024 9,153 4 View

How to report results of Generalised Linear Mixed Models in a journal article?

Hi everyone, If you have written or come across any papers where Generalised Linear Mixed Models are used to examine intervention (e.g., in mental health) efficacy, could you please share the...

04 August 2024 4,130 4 View

Which test should be used to study association among demographic profile and awarness level?

i have to study the awareness and adoption level of cloud computing in a district of India. i also want to use association among demographic variables like gender, age, education, income etc and...

02 August 2024 2,420 3 View

Why 3 replicates for most biological assays? Is it enough to examine the data fits normal distribution?

Just bounced on me. Before statistically analysing significant difference, shouldn't we see if data fits normal distribution first? Is 3 replicates enough to testify the hypothesis of normal...

31 July 2024 8,141 13 View

Normality assumption for linear regression is The assumption of normality is whether for residual errors or predictor variavble?

When we conduct linear regression, there are several assumptions. The assumption of normality is whether the residual errors are normally distributed, not whether a predictor is normal?

31 July 2024 6,164 3 View

What is the acceptable p-value cutoff for GO enrichment analysis ?

I have an RNA-seq data that I have analysed using Limma-voom and have extracted the gene IDs, log2FC and the p-values. At p value < 0.05, I have over 10,000 DEGs, however, when I run the GO...

31 July 2024 225 2 View

Posthoc test lettering in JAMOVI?

Does anyone know of a module for the JAMOVI software that is capable of generating mean separations using the classic letters based on post hoc results (e.g., Tukey test)? If, as I believe, such...

31 July 2024 3,333 4 View

How to do Mann-Whitney U test with Bonferroni corrected p-values?

Dear All, My lab primarily works on insect wing patterns. In one of the projects, my student and I have defined 19 abnormality characters on the forewing and 6 abnormality characters on the...

31 July 2024 6,464 5 View

Jochen Wilhelm

The chi² statistic is the sum of (observed-expected)²/expected for each cell. You can plot a heatmap of these values. This gives you a good impression which cells contribute most to the chi² value.

Sal Mangiafico

A slightly modified approach to the one Jochen Wilhelm describes is to use the adjusted standardized residuals (ASR) from the analysis. These are based on the calculation for (observed - expected)/sqrt(expected), but they are adjusted for the row and column totals. You can find the formula for these easily online † , and software packages often produce them.**

The advantage here is that the scale of the results is similar to that of z scores, and so relatively easy to interpret. That is, an ASR of > 1.96 or < -1.96 suggests that the cell is contributing to the effect. And an ASR of > 2.58 or < -2.58 suggests that the cell is contributing to the effect more strongly. You might think of these as analogous to z-scores, so that these two levels are analogous to a alpha level of 0.05 and 0.01 respectively.

† e.g. https://scholarworks.umass.edu/cgi/viewcontent.cgi?article=1269&context=pare

** Some software may report the unadjusted standardized residuals, (observed - expected)/sqrt(expected). It's not always obvious from software documentation if the reported values are the adjusted or unadjusted standardized residuals. EDIT: From the discussion below, SPSS reports the unadjusted standardized residuals by default, but you can request the adjusted.

Mika Mika

Sal Mangiafico how to do I interpret this? the pearson's score shows significance (0.018) but the standardized residual value for blood group b is less than 1.96/2. What does this mean?

Mika Mika ,

You don't need to divide that z-like value by 2. For alpha = 0.05, z for alpha/2 = 1.96.

EDIT: From the discussion below, SPSS reports the unadjusted standardized residuals by default, but you can request the adjusted.

EDIT: If you want to use the unadjusted standardized residuals: In this case, there are a couple of options. You could go with a cut-off analogous to an alpha of 0.10 for this approach. z for alpha=0.10/2 is 1.65. Only yes-b would meet this criterion. ... What I would probably do though: You might use no specific criterion, and just note that it's b and o that have relatively large standardized residuals (say, > 1, or ≥ 1.3). (a z of 1.3 corresponds to an alpha/2 of less than 0.2). It's really these four cells that are driving the significant difference in counts from the expected. ... Also notice the sign of the residuals: The count for b-yes is less than expected, while the count for o-yes is more than expected.

Bruce Weaver

Sal Mangiafico is right. SPSS will report 3 different residuals if you ask for them:

RESID. Residuals. Residuals are the difference between the observed and expected cell counts.

SRESID. Standardized residuals.

ASRESID. Adjusted standardized residuals (Haberman, 1978).

Source: https://www.ibm.com/docs/en/spss-statistics/26.0.0?topic=crosstabs-cells-subcommand-command

Code for the table shown above is provided below. HTH.

NEW FILE.

DATASET CLOSE ALL.

DATA LIST LIST / r c n (3F5.0).

BEGIN DATA

1 1 26

1 2 31

2 1 12

2 2 32

3 1 27

3 2 18

4 1 8

4 2 7

END DATA.

WEIGHT by n.

CROSSTABS r by c /STATISTICS=CHISQ

/CELLS=COUNT EXPECTED RESID SRESID ASRESID.

Okay, so the results included in the results by Mika Mika are the unadjusted standardized residuals. If you calculate the adjusted standardized residuals, (as per the paper linked in my previous response) the results will be higher in value. Specifically | those for b | will be > 2.58 and | those for o | will be > 1.96. EDIT: Below Bruce Weaver has SPSS code to request the adjusted standardized residuals.

Hi Sal. My table using SPSS 26 is attached.

The details about how SPSS computes the various residuals can be found in the Algorithms manual. Go to the page linked below and do a Ctrl-F search for to find the PDF.

https://www.ibm.com/support/pages/node/874712

The relevant portion of the documentation is shown on the second attached png file.

HTH.

Thanks, Bruce Weaver . Those are the results I am getting with R and SAS, which apparently use the same calculation. The free PSPP software I was using was giving totally different results. I've used that software like twice, and it always gives me guff. EDIT: I did submit a bug report to them. I've never used SPSS much, but I do support the effort to produce a free product which mimics the basic analyses in SPSS.

Sal Mangiafico why did you divide alpha by 2? The residuals are standardized. Should I check the "adjusted" option in the residuals section? Bruce Weaver

Hi, Mika Mika ... Yes, check Adjusted Standardized, and see if that returns the same values that Bruce Weaver included (e.g. ASR for b are -2.8 and 2.8). ... You divide alpha by two because you are conducting something analogous to a two sided hypothesis test. That is, before you collected the data, you didn't know that b-yes would be lower than expected and that b-no would be higher than expected. So, by analogy with a hypothesis test, for alpha = 0.05, you are comparing to a 0.025 probability of getting a value that extreme on the high end and a 0.025 probability of getting a value that extreme on the low end. (Both with under the assumption that the null hypothesis is true.) This figure may help: https://www.jahjournal.org/viewimage.asp?img=JApplHematol_2014_5_1_27_131823_u1.jpg

A.Ömer Toprak

the cell which has the greatest (O-E) squared

Sal Mangiafico Hello, thank you so much.

Thanks Omer.