I'm interested in assess effects gene-gene and gene-environment interactions on disease susceptibility using binary logistic regression models. All examples I've seen so far on this issue build their models in the following form:

logit = B0 + B1X + B2Y + B3(X*Y)

And I understand it as an estimation of the interaction of X and Y variables weighting their isolated effects in the same model.

But what if I know that X and Y don't have any significant effect in isolation on the outcome, but I'm interested in testing just whether their interaction might have. Is it plausible to build a model just with the interaction term plus the confounding variables (logit = B0 + B1Z + B2(X*Y))?

Thanks in advance!

Similar questions and discussions