Ok. I know how to the extent of statistical inference, when null hypothesis is discarded (data supports discarding, under certain risk...) , some event not random. But my question relates more to the extension of some conclusion to the mayor population.
Lets say we want to infer some conclusion on a very unbalanced population. Lets say a study on Parkinson's D. (PD is estimated to affect 100 -180 people per 100,000 of the population). In order to perform some test, we usually balance the sample.
Isn't this a good example of Dewey Defeats Truman case? There will be A LOT of subjects selectively discarded because of being from the control group.
However if the real group proportions are kept in the sample, even a random predictor would yield very high negative predictive value (yes) and probably truly good estimators for real Specificity and Sensitivity.
Perhaps someone could point me out a good work on this issue.