Suppose that an urn contains r red balls and w white balls. If n balls are removed from the urn without noticing their colors, then the probability, p, of drawing a red ball from the remaining balls in the urn is the same as the probability of drawing a red ball before removing the n balls; i.e., p = r/(r+w). How can we generalize this or relate it (for example) to information theory?