How do covariates inform ability estimates in psychometric models?

10 October 2014 3 8K Report

As my question may sound a bit bulky, please consider the following example:

You want to estimate the latent abilities of p participants who have answered i items in a questionnaire. The individual answers to the different items can be ascribed with the following model:

Answer_i,p = u_i + u_p; u_i ~ Normal(0, sd) and u_p ~ Normal(0, sd)

where u_i and u_p are randomly varying parameter manifestations of item difficulty and participant ability, respectively.

Suppose you expect (from previous studies) that participant ability is driven by some covariate C that you can assess and/or experimentally manipulate. A regression of u_p on C does not suggest any significant association between participant ability and this covariate, although the effect (beta) points numerically to the expected direction.

Therefore you try to inform your psychometric model by including a (fully standardized) covariate C as a second level predictor of participant ability:

Answer_i,p = u_i + u_p; u_i ~ Normal(0, sd), u_p ~ Normal(beta*C, sd)

Surprisingly, the beta of C now differs significantly from zero, as the estimates of participant ability have changed. Thus the estimates of participant ability become informed by a covariate, which did not have any predictive value before.

I repeatedly encountered this phenomenon. Nonetheless I must admit, that it remains quite elusive to me. Moreover this seems to introduce some kind of arbitrariness in the interpretation of predictors for latent abilities. Is the covariate indeed informative with regard to participant ability or is it not?

Does anyone have a solution to this issue? Proably such situations should be dealt with by appropriate model comparision strategies (i.e., one should refrain from interpreting estimates of such nested effects in isolation)?

Daniel Wright

I am not sure I understand the problem. C is related to both answer and u_p. The two models are:

logit(answer_ip) = beta0 + u_i + u_p,

logit(answer_ip) = beta0 + beta1 C_p + u_i + u_p,

C_p accounts for some of the variation in answering for the people, and therefore u_p changes. Is the question about the meaning of u_p changing depending on what other variables are included in the model?

Robert Miller

Thank you, Daniel. That's exactely what I meant (if you identify u_p in the lower model by participant ability, i.e., u_p + beta1*C_p).

u_p in the upper model represents the estimate of participant ability that is supposed to comprise the variance bound by C_p. A subsequent regression of the conditional means of u_p on C_p results in a weaker beta1 as compared to the lower model. Such two-step procedures seem to be quite common in personality psychology.

In the lower model u_p (+ beta1 C_p) increases the dispersion of participant ability (u_p in in the upper model) while preserving its rank order, which finally results in a larger estimate of beta1. This result can be confirmed by applying a subsequent regression as outlined above (which is kind of self-evident as u_p = beta1 C_p + residuals by definition)

In consequence, the same covariate may not significantly predict participant ability in the upper model, whereas in the lower model it may actually have a predictive value. Thus both approaches may suggest completely diametral interpretations, although they measure the same quantity (participant ability).

Daniel Wright

I wouldn't worry about statistical significance. In the example you gave, C accounts for some of person ability in the top model, and some of the response accuracy in the bottom. Depending on how the computer has done the estimation, the u_p may differ. They (what statisticians call the condition modes, but which people often refer to as the person residuals) are estimates that for most models borrow information from each other. This means some accuracy values that are very different from the others will be estimated as not as different. They are often called shrunken estimates, and a great paper on it is http://statweb.stanford.edu/~ckirby/brad/other/Article1977.pdf (it has a baseball example, and it is the playoff season here in the US, go Dodgers). If ignoring significance it looks like they tell the same story, that would be fine.

This doesn't get exactly to your question about what to do, but that will depend on your purpose. Do you want to measure ability, or ability after conditioning on something.

How to learn more about SPSS and its Application?

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

Baseline drift in HPLC? What causes this?

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

How are iso-frequency contours plotted?

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

How does grain and grain boundary affect the ceramic when studying its dielectric properties?

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?