I'm currently trying to familiarize myself with linear mixed models (with normally distributed continuous outcome, no generalized versions for the time being). One particular question I'm currently struggling with is the following:

Given a dataset resulting from measuring both the continuous outcome and a continuous exposure at t1 and t2, a set of time-invariant covariates, and a set of time-varying covariates, the latter again measured at t1 and t2. Covariates might include both quantitative and categorical variables. Assume there are no missing values.

Now, I usually would use an ordinary baseline-adjusted linear regression of the outcome at t2 on all time-invariant covariates, the time-varying covariates at t1, the exposure at t1, and the outcome at t1.

If I am interested in estimating the causal effect of the exposure (and assuming my covariates include all relevant confounding factors), would it make sense to apply a mixed model instead? Is there a mixed model specification where the interpretation of the fixed effect regression coefficients would be similar to that of the ordinary baseline-adjusted model? Knowing this would help me get an understanding of the overlap / differences in what those models actually do. I calculated a random intercept model of the outcome on time and all covariates (assuming compound symmetry, i.e. no heterogeneous variances), however I suppose the coefficients do not represent the same comparison as in the baseline-adjusted model as the model as the mixed model includes additional information (time-varying covariates at t2)?

I'm familiar with the distinction between wide and long datasets and how to transform them so that is not an issue.

I would greatly appreciate any suggestions or hints about pertinent literature sources (specific to this question).

More Hans Bauer's questions See All
Similar questions and discussions