I've come across this analytical issue a few times in the last couple of months. It's an Aquaculture oriented design and the question pertains to the most appropriate statistical analysis of what is a common issue. There are lots of complexities we could add, but I will make it a simple example.

Question: Is the performance of a larval fish affected by its rearing Temperature (High, Low) and Food (high, low)?

Method: Get parents to breed, pop fertilised eggs (1000) from a number of parents (mixed) randomly into 12 tanks, and then randomly assign the tanks to one of the 4 treatment combinations, with 3 tanks per Temp x Food combination.

Rear for 28 days, and then test the performance of the larvae - let's just say we test maximum swimming speed (our variable).

Analysis.

Now here's the issue.

The problem we have is an obvious non-independence among fish sampled from the same tanks (because they interact and produce social hierarchies). We can sample and assess swimming for multiple fish from the same tank, but all we are doing is getting a better idea of the position of the true mean of the overall population in the tank.

A typical GLM approach would be to simply nest the Tank effect, so we have two fixed effects (Temp, Food), their fixed interaction, and a Tank(Temp*Food) term. This would effectively use the means within the tanks as the replicates to test the main effects, thereby isolating the effects.

The alternative approach that I have seen recently is to use a linear mixed effects model, with Temp and Food as fixed effects, and Tank as a random effect.

To me this latter approach, while it may account for some uniqueness in the Tank, cannot account for the fact that the fish within the tank are interacting, influencing eachothers development and so do not represent independent estimates of what is going on. If you look at the df, it has a similar sensitivity to a GLM with no nested Tank term, which seems overly sensitive given the amount of real information. Maybe I'm just stuck in a GLM mindspace, but it strikes me as a design issue (i.e., telling the LME model that fish are reps when they actually aren't)..

I would appreciate someone much wiser statistically than me commenting on this train of thought.

Thanks

Mark

More Mark Ian Mccormick's questions See All
Similar questions and discussions