I'm interested in studying the effects of two independent variables (Group and Task) on Dependent Variable X. However, due to unforeseen circumstances I've got baseline differences between the Groups that I'm studying (e.g Visual Acuity). Visual Acuity has 5 levels. FYI: I've got 11 and 15 subjects in each of the group.
Just as a form of control, I intend to include Visual Acuity as a fixed effect to examine if there were any possible interactions. However, my biggest problem is that if I include a Group * Task * Visual Acuity interaction in the model, I'm cautious over fishing of data. Firstly, I do not have a specific hypothesis regarding Group * Task * Visual Acuity interaction. Secondly, because of the smaller sample size and number of levels in Visual Acuity - it appears I'm comparing the data of one participant to a group of 3 in one level for example (i.e. it appears pointless to be performing such analysis?)
I've search other websites and some have suggested that it may not be necessary to include every possible interactions and main effects. I was wondering if it make sense to just include these in my model since I'm neither interested in the main effect of Visual Acuity or even Group * Task* Visual Acuity interactions
(1) Group
(2) Task
(3) Group * Task
(4) Group * Visual Acuity
Additionally, I've tested out two models with REML here:
Model 1: (1), (2) and (3), Model 2: (1), (2), (3), (4)
Model 1 with AIC value at 6303:
(1) significant
(2) significant
(3) significant
Model 2 AIC value 6231:
(1) not significant
(2) significant
(3) significant
(4) not significant
If I include the (4) in model 2, what exactly am I actually doing to my data? Should I choose model 1 or 2? if there's anyone that could enlighten me on the above questions, that would be great!!