The pig sign data (the dependent variable) is repeatedly measured over 4 time periods at 17 separate sites. The habitat variables (the explanatory variables) are time-invariant, but have been measured for each of the 17 sites. I've appropriately reorganized my data to the person-period format. I'm using RStudio to run glmm and lmm analyses, and have included a random intercept of time nested within site and time, respectively. I see that the software will only run the pig sign data from the rows that correspond with the habitat data, which makes the other pig sign data irrelevant (cutting my sample size by 75%, and making my time factor irrelevant). Is it statistically sound to augment my data by repeating the same habitat data for all time periods and run that against the time-varying pig sign data?

I don't see any other way to deal with this in the literature and I sure could use a little assistance with this stats issue.

More Patience Knight's questions See All
Similar questions and discussions