In the attached paper page 206, the author gave an example when the prior and posterior probabilities can provide invalid probabilities.

The example is as follows

Let us suppose that X1 corresponds to a rare but important target event like, e.g., some medical pathology. We are going to collect the cases supporting this event very accurately. However, we are not going to collect information about all the “healthy” cases as X0. In the medical data sets we can rather expect the 50:50 proportion between positive and negative examples. It does not mean, however, that Pr(X1) should be estimated as 1/2. It is questionable whether the posterior probabilities Pr(X1|E) should be derived from such data using estimation with |E| in denominator – it is simply difficult to accept that |E| is calculated as the non-weighted sum of |E ∩ X0| and |E ∩ X1|.

But really I'm unable to get the idea so can anyone elaborate on this example.

More Ahmed Hamed's questions See All
Similar questions and discussions