What is the correct way to calculate confidence intervals of an AUC obtained by merging/pooling predictions from different test sets?

Massimiliano Grassi @Massimiliano_Grassi

09 September 2018 0 4K Report

I have one question regarding the CIs of the AUROC calculated merging/pooling the predictions coming from different test sets.

In one analysis, we use a sort of nested cross-validation approach, even if the outer loop is a more properly a test and not a validation loop. The dataset was split in 5 folds, the whole analyses were replicated five times using the other four folds as training set and the remaining fold as a test set. The same technique was used in each of the five repetitions, optimizing the hyperparameters with an inner loop cross-validation strategy (which varies among the five repetitions). Then, the algorithm was used to generate continuos predictions for the cases in the test set. These test predictions were used to take no decisions/make no comparisons. The predictions of the 5 test sets were then pooled and the total sample test AUROC was calculated.

Given the procedure we used, is it correct to calculate the pooled AUROC CIs with either the common asymptotic strategy or via a stratified bootstrap approach that directly resamples form the distribution of the pooled test predictions?

Or instead, the only correct way to calculate the CIs is to bootstrap n times the training sets, retrain the algorithms, generate predictions in the test sets with the new algorithms and finally calculate the new AUROC? CIs will come from the distribution of AUROCs obtained repeating this procedure n times.

Badges
Science topic

More Massimiliano Grassi's questions See All

Is there a general and model-independent way of calculating prediction intervals in machine learning for regression task?

I’m training some supervised machine learning algorithm to perform the prediction of a continuous variable. I’m currently applying a nested cross-validation protocol (inner: LOOCV; outer: LOOCV;...

01 February 2018 8,903 5 View

Post-pre change or post scores in predicting treatment response give different cross-validated r-squared. Which should I consider?

I have trained a machine learning model to predict the outcome of a therapy based on some pretreatment information. Outcome is based on a questionnaire that is administered both before (pre) the...

01 February 2018 9,440 3 View

In cross-validation, which is the AUC population parameter I really want to estimate?

I’ve found a lot of different procedures to calculate the AUC confidence interval of a cross-validated model. it may sound quite theoretical but it is not clear to me which parameter these CI...

01 February 2017 6,086 7 View

Which Bootstrap for Confidence Interval of AUC with Leave-Pair-Out-Cross-Validation?

I have to calculate the CI of the AUC (Roc) for a series of classifiers (e.g. Lasso, Random Forest, SVM) learned using the same test dataset, in order to identify the best model for this problem...

11 December 2016 379 4 View

Topic analysis with rarely occurring topic and small document corpora. Which technique should I use?

Hi everyone, I need to perform a topic analysis on various corpora of documents and I need a procedure that can be applied to all of these corpora independently in a standard way. These are the...

09 October 2016 2,000 3 View

Recursive feature selection with cross-validation in the caret package (R): how is the final "best" feature set selected?

The rfe functions in the caret package allow to perform recursive feature selection (backward) with cross-validation. It is expected that the best features selected in each fold may differ, as...

08 September 2016 5,759 4 View

Does publication bias affect the meta-regression slope coefficient?

Hi everybody,differently than in meta-analyses, the effect of publication bias in meta-regression seems to me less severe for the slope coefficient, In my opinion, a bias in the slope coefficient...

04 May 2016 925 7 View

Power analysis in meta-regression?

Hi everyone,Is any package/code available to calculate power in meta-regression (random-effects, DL estimation)?None is available in R, as far as I know, but maybe it exists for another language...

04 May 2016 3,970 4 View

How is it correct to optimize a binary classifier output threshold with ROC and LPOCV?

Hello everyone and thank you in advance for you help! I'm building a screening tool with a machine learning algorithm. The model provides a probabilistic prediction (i.e. logistic regression,...

03 April 2016 4,468 7 View

Which Post-Hoc Strategy for a Poisson Repeated-Measure ANOVA?

Hi everyone and thank you for you advice.I'm running a two-way repeated-measure ANOVA, with two groups of subjects undergoing two different treatments (coded as: 0; 1) x 3 assessment times (coded...

07 August 2015 2,625 3 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Swimming/space travel depends on the proprioceptive muscle spindles?

When the entire neocortex is ablated in rodents, although they are still able to swim, all the limbs move continuously and asynchronously (Vanderwolf 2006; Vanderwolf et al. 1978). Normal animals...

03 August 2024 835 3 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

Some new emerging problems on application of RL for scheduling in IoT networks?

I have seen plenty of existing works on applied Reinforcement Learning (RL) policies for optimized scheduling in IoT networks including Q-learning, DQNs, and the newer ones including PPO for...

01 August 2024 8,754 2 View

How to Compress Information Neurally?

Samuel Morse, the inventor of the Morse Code, understood that certain letters in the English language occurred more frequently than others (Gallistel and King 2010). To deal with this, Morse used...

01 August 2024 4,456 2 View