Hi all 

This is a question posed out of curiosity more than anything else, but I'd appreciate any answers that helped me confirm my hunch it's a bad idea. 

I have a multilevel dataset with individuals (n=600) nested with neighbourhoods (n=248) nested within counties (n=30). Because there are too few individuals per neighbourhood (mean = 2.4), it is not possible to consider this level in a ML analysis because there are too few individuals per neighbourhood to derive valid inferences. However, I can fit a (less-than-ideal) model of individuals within counties where there are sufficient numbers per county to permit valid inferences, although the data are unbalanced. 

My question is this, if I fit a random intercepts model on my outcome, with individuals nested within counties, can I still include fixed-effects of environmental variables at the "neighbourhood-level"? I think the standard errors for any effect sizes will be too wide, but what other reasons might there be affecting the validity of interpreting fixed effects from a level not specified in the random effects part of the equation? The fixed effects at this level would provide a more accurate characterisation of individuals' exposure to environmental characteristics than the equivalent variables derived (and averaged) over the entire county. 

Thanks in advance for helping me understand this issue. Any directions to reading on this topic are gratefully received. 

Thanks

James

More James B Kirkbride's questions See All
Similar questions and discussions