I'm trying to understand how resampling works and am struggling with some strange behaviour. Here is my case: I take a random set of 10 numbers between 15 and 25 and add 5 numbers between 1 to 5. So we have a total set of 15 numbers between 1 and 25, wherein 5 are low. I then create combinations out of this set. Say I leave out one number of the set and take the 15 resulting combinations as a group, a procedure known as Jackknifing. You can then calculate the median for each group. Taking all the groups together, out of these 15 medians, you can take the mean. But other than in Jackknifing you might continue to leave out numbers, until there are none left. So it yields many recombinations. If I do that, the mean of the medians decreases until it finally reaches the mean of the whole group. The following code gave me a sample with the mean of 16.66 and a median of 20. I understand the case with the overall mean and respectively all the numbers (median), but I can't explain the behavior in between.

--- r-code ----

sample_size

More Beni Rohrbach's questions See All
Similar questions and discussions