Hello,

I plan to perform a linear mixed model analysis to look at change in cognitive function over 5 follow up waves. I am interested in looking at how this change is influenced by dietary and urinary sodium as well as other covariates like age, gender, blood pressure etc. Dietary and urinary sodium have only been collected at baseline, whilst other covariates have been collected at baseline and at each follow up wave.

I am aware of how to restructure the data set to the "long" format, creating an index variable for Time. However, as mentioned, certain variables have only been collected at baseline and are important as I need to include them in the data set in order to exclude cases. For example, I would like to exclude anyone who has a diagnosis of dementia at baseline and the corresponding variable only has one 1 timepoint (data collected at baseline).

I am looking for advice on the following:

1) How to restructure the data set so these kind of variables fit & make sense within the long format?

2) Then, how can I select cases based only on certain baseline variables and then perform the analysis on the remaining cases, across all waves?

3) Are LMM's the best approach here? Or would you suggest an alternative method.

Thanks.

More Andrea McGrattan's questions See All
Similar questions and discussions