I am doing a corpus investigation (in linguistics) and I used a mixed effect analysis with the individual speakers as random intercepts. I have 24 speakers representing about 1800 tokens. In addition I also use the specific words that appear in the test as random intercepts. I have about 280 words, some of which occur only once.

When I run the regression with only the speakers as random intercepts, I have a number of statistically significant independent variables. However, when I include the words, some of these are kicked out. When I draw a contingency tables of the independent variables that are kicked out, the results for the t-test is typically p < 0,05.

So my question is: are these variables not significant or do I have too many random factors?

Similar questions and discussions