The outcome of interest is binary. There are 30 covariates, and nearly two-thirds are categorical. My primary goal is to build a predictive model. The logistic regression analysis without any interaction terms shows approximately half of the covariates are statistically significant. This model seems not reasonable to me as the misclassification error rate is about 20%. I will explore other predictive models. Before doing this, I want to know what are the best practices to incorporate interaction terms in the model.
Thank you in advance.