I am interested in developing a split-sample technique to assess the internal validation of a logistic regression. Many statistics have been used to compare both groups, training sample and validation sample: MSE, deviance residuals, AUC, R2, adjusted R2...

Two papers that highlight this issue:

Snee, RD. Validation of regression models: Methods and Examples. Technometrics. 1977;19(4):415-28:

http://www.ams.jhu.edu/~castello/400/Handouts/ValidationInRegression.pdf

Steyerberg EW et al. Internal validation of predictive models: Efficiency of some procedures for logistic regression analysis. J Clin Epidemiol. 2001;54:774-81:

http://www.aliquote.org/cours/2012_biomed/biblio/Steyerberg2001.pdf

Must I calculate all of them? In your opinion, what are the parameters of interest?

More Emilio Pariente-Rodrigo's questions See All
Similar questions and discussions