Hi biostatisticians,
I’ve created a binary logistic regression model with two continuous independent variables (the percentage of two cell types) to predict disease (disease present/not present) using SPSS. One of the independent variables (variable 1) is an already well described predictor, while the other one is new (variable 2). Conducting a hierarchical logistic regression analysis shows that the new variable (variable 2, percentage of new cell type) significantly improves the model. However, testing for the linearity of the logit (using a logistic model with interaction terms consisting of the variables x the natural logarithm of the variable, as e.g. described by Andy Field’s IBM SPSS Statistics) reveals, that the interaction term has a significance of p=0,033. Therefore, as I understand it, the assumption of the linearity of the logit is not met and the model is not valid and cannot be used this way.
What options do I have to deal with this/to transform the model?
Thanks a lot for your opinions and advice!