Dear colleagues, I see clear difference between the training and testing statistics in MDR, but I don't get (and could not find the answer in a bunch of publications - it is not even discussed mostly) the real meaning of the whole dataset statistics? I feel it like it's between training and testing statistics towards all my data. As 9/10 is in training and only 1/10 is in testing in each round of cross-validation, the whole dataset statistics is almost identical to the training statistsics. Yet, it reports the final results of the best-model statistical testing of my data. Thus it also feels like applying ROC-analysis to the whole dataset in order to report on the best-model properties. However, the training and testing statistics are only reported in most MDR-based publications. So what do we actually do with the whole dataset statistics? 

More Peter V. Zolotukhin's questions See All
Similar questions and discussions