Dear colleages,

Eta squared is dominantly used in reflecting the explaining power the same manner as a squared partial correlation coefficient (R2p ) from multiple linear regression: as the proportion of remaining variance in the dependent variable. In the binary case, eta equals product-moment correlation coefficient (PMC), and PMC is known to underestimate the true correlation in an obvious manner. Hence, consequently, eta squared seems to understimate the true explaining power in an obvious manner.

I have studied some options as alternatives for PMC. Goodman - Kruskal gamma (G) seems quite an appealing alternative in the binary case: it reaches correctly the extreme values 1, 0, and -1, and it is robust against many sources of systematic mechanical errors in the calculation process (such as proportions of 1s and 0s, discrepancy between the scales of the variables, and the distribution of the latent variables). Because G seems to be an appealing alternative to PMC (and eta), maybe G squared could be an option for eta squared?

My question is: how should we interprete G squared? Because G reflects strictly the probability rather than covariance, it may not be interpreted as the proportion of remaining VARIANCE in the dependent variable. But what could it reflect? Ideas?

More Jari Metsämuuronen's questions See All
Similar questions and discussions