I am working on comparing binary logistic models with different subsets of predictors. The difference between areas under the ROC curves is never significant, while the difference in AIC values is greater than 4 for two models with the same number of predictors and the same degrees of freedom.