Rewriting this with more detail...
Hi!
I ran an experiment where participants took a test of 16 questions, yes/no (binary).
They were tested at either 7-days or 28-days - two groups between subjects.
Analayzed as subjects, their results for the test are higher at RI-7.
RI-7: 5.5
RI-28: 3.6
t-tests confirm that this difference is sig - t = 2.171, p = .032
Analyzed as items, RI-7 scores are still higher.
RI-7: 515 incorrect - 269 correct
RI-28: 596 incorrect - 172 correct (more incorrect, fewer incorrect).
However
When put into a GLMM (binomial, logit), RI-7 scores come out sig lower, no matter what I do and what other variables I put in or take out.
An example from one model:
RI effect: F = 28.542 p = RI-28 coefficient -.625, OR .535, p =