How many bootstraps can I perform on a sample of N elements?

04 April 2012 71 4K Report

I am using bootstrapping analysis for a set of data that I obtained from a Monte-Carlo simulations.

Bootstrapping (statics) allows random sampling with replacement from the original data set that I obtain from a Monte-Carlo simulation. Thanks!

Anders Nordgaard Popular answer

Interesting question, but I presume you are actually asking how many bootstraps you NEED to perform (since you can do as many as you like). It is important to realize that whatever the number of bootstraps made, the final precision is still completely determined by your initial sample size. Bootstrapping is one method to assess a statistic computed from a sample. Very often we are interested in the accuracy of that statistic when used as a point estimate of a population parameter, Bootstrapping means we temporarily substitute the empirical probability distribution induced by the sample for the probability distribution defined by the population. The repetition of taking a bootstrap sample just replaces the otherwise very teady calculations with that empirical distribution that would be required to assess your initial statistic. Bootstrapping is not per definition taking a large number of bootstrap samples, that is just the practical way it is performed (i.e. by Monte Carlo-simulation from the empirical distribution). To judge upon how many bootstrap samples you should take requires (as other contributors here also have said) consideration of for instance how large the variance of your statistic (assuming the empirical distribution is temporarily the true one) is, how skew its sampling distribution is etc. etc.

Ashwini K Pandey

It can be variable depending upon many factors. However, in practice, 1000 iterations can be performed. For statistics students, when NPC, SE etc. are explained, 1000 iterations usually give an account of how, given every other condition constant, it can be interpreted in reference to the assumption of normality.

There are some other specialized tools that have used different iteration numbers. For example, Standardized Low Resolution Tomography Analysis (sLORETA) recommends using 5000 iterations because they don't use assumption of normality for their analysis of Statistical Nonparametric Mapping (SnPM). However, to analyse the difference of R between two surface Laplace maps (difference between map1 vs. map2 and pool1 vs. pool2), just 200 iterations have also been used as found in the literature.

So, as I said, it is variable and depend upon many factors at hand.

Hope this helps...

Puneet Kumar Gupta

Ashwini is right. 1000 replication will be sufficient.

Judit Camacho

You mean, 1000 replication whatever the number of elements of my data?

Salwa Mousa

I know b =(1:1000))

Hugo Alfried Volkaert

I am not sure I understand what you want to do. Monte Carlo will already sample your parameter space sufficiently, so I see no need for bootstrapping the "one" dataset you got. Monte Carlo processes could easily give you 1000s to millions of datasets.

Yar Yot

I think more 1000s that good

Sean Clouston

The rule of thumb is 1000. However, for a sample size N there are only (2^N - 2) possible sample combinations (assuming you ignore the original sample and the null sample), after which your new replications are simply repetitions and probably shouldn't be included. Hope that helps.

Dhivya Shanmughanandhan

Maximum of 1000 replicates is more than sufficient.

Cheers.

Gholamreza Jandaghi

I think it depends on the complexity of the "statistics" you want to calculate from the samples. But 1000 is good enough.

jean-louis Foulley

JL Foulley, University of Montpellier II, France

The total number of bootstrap samples of size n out of a n data sample is simply the number of n combinations out of 2n-1 items . This is a particular case of the so called Bose Einstein statistics giving the the number of ways of distributing n particles among the g sublevels with here g=n.

Leocadio Blanco-Bercial

So, I recommend to people that thinks that 1000 repetitions is enough, to read

Pattengale, N., Alipour, M., Bininda-Emonds, O., Moret, B. and Stamatakis, A. (2009) How many bootstrap replicates are necessary? Research in Computational Molecular Biology. Lecture Notes in Computer Science, Springer Berlin, Heidelberg, pp. 184-200.

With 1000 repetitions many times you do not get the proper answer if you have a large enough number of variables.

Anders Nordgaard

Ahmed Fayaz

Have a look at this paper:

Davidson, R & MacKinnon, J G (2000) Bootstrap tests: How many bootstraps? Econometric Reviews 19(1): 55-68.

Herman Ader

I agree with Hugo Volkeart above, that your problem is not too clearly defined.

The minimal number of bootstrap samples you should take is dependent on the statistic you want to bootstrap and what aspect of the statistic you need to analyse and interpret.

For instance, to get confidence intervals for the mean, 100 bootstrap samples would be sufficen, for a confidence interval of the standard deviation you need more, like 500. But for a statistic like the cost-effectiveness ratio I use 50.000 since the underlying distribution of the statistic is Cauchy (because the denominator can be zero, the function contains a discontinuity).

If one is interested in the complete distribution of the statistic, not in its confidence intervals only, a large number of samples is needed to get a proper and precise impression (This occurs if one wants to determine outlyers).

But so what? To calculate the statistic on a large number of bootstrap samples usually takes only a few seconds in R.

(note that Sean Cloustons calculation of the different number of bootstrap samples (see above) is incorrect, because it does not take into account (a) that the number of elements of the boottrap sample is always N; (b) since you sample with replacement, the same element of the original sample can occur several times in the bootstrap sample.

For more on the subject see:

Advising on research methods: A consultants companion

by Adèr, Mellenbergh and Hand (2008).

(website: www.jvank.nl/ARMHome)

The book has been indexed for Google books

(there is a link to Google books on the website, type BOOTSTRAP in this case as a search term).

Herman Adèr

Hernan E Velasquez Mast

Its not clear what is the Output of your MonteCarlo Simulation is. Apparently you are producing a series of data from that process, that you call a Sample. (Eventually one data per each Trajectory of the Simulation) If you could broaden your explanation of your Process, perhaps you could get a more Useful Suggestion.

Treatment of Data Series as for Input or Output to/from a MonteCarlo Process is Dellicate.

Yar Yot

Simple random sampling by purposive?

Raúl Carrasco

Try 1000; 10000; 100000 and observe the results plotted. This helps you decide

In R is very fast

Judit Camacho

what is ln R?

Herman Ader

R is a freely downloadable, high quality statistical package, derived from the Splus package.

Look at: http://www.r-project.org/ to get hold of it. It has a wide range of additional packages, not all statistical,

including one for bootstrapping (boots)

Dudley Gentles

Hi,

I usually do 10,000 replicates but actually you could do many more depending on how long it takes on your computer.

But I also have a bias estimator that shows me how far off (my simulated mean) is from say the mean given by your original sample. I usually use Python to do the simulations or SAS.

ok bye for now.

Mostafa Davtalab Olyaie

we can perform infinite time bootstraps on a sample of N elements.

Vitor F. O. Miranda

Could anyone give an advice for bootstrap analyses using R to phylogeny?

Leocadio Blanco-Bercial

for genetic phylogenies, I would recommend you RAxML; you can perform many bootstraps in a very fast way (faster than using R)

Vitor F. O. Miranda

Thank you, Leocadio!

All the best, Vitor

David H Abbott

In general, I believe you can do as many resamplings as you like from a dataset without concern about exceeding N. That said, applying bootstrapping to Monte Carlo derived dataset surprises me and makes me think you may want to reconsider. Typically Monte Carlo analyses themselves involve repeated samples from random number generators and generate an often very large number of output observations, so it stikes me as odd to take some or all of these and then build up a bunch more by resampling. You might rather want to just up the number of observations created by Monte Carlo process you are using.

Elizabeth A Vanner

The number of bootstrapped samples that you can generate from a sample of N elements is more a question of computing power rather than a statistical question. If you are thinking about this as a statistical question, then perhaps your concern is that you don't create the same bootstrapped sample twice. There is no way you could guarantee that you didn’t create the same bootstrapped sample twice – that could occur by chance. However, the number of possible unique bootstrapped samples from a sample of size N is equal to N! (N factorial or N * (N-1) * (N-2) * … * 2 * 1). This number gets large very quickly. For example, 12! is equal to 479,001,600 – or nearly half a trillion possible combinations, but 6! is only equal to 720, while 7! is equal to 5,040, and 8! is equal to 40,320. So if you have a really tiny sample, such as N=6, you might want to limit the number of bootstrapped samples to, in that case, 720 so that you could say the expected number of times that each possible bootstrapped sample would occur would equal 1. Of course, there would undoubtedly be some duplicate samples and some that did not occur, but with any number of samples over 720, you would be guaranteed to be generating duplicate samples, which would not contribute any new information to the analysis. However, with a sample size of 10 or more (over 3.5 million possible samples), there would be a relatively low probability of duplicate samples, even if you created 100,000 bootstrapped samples.

jean-louis Foulley

Dr Vanner/I do not agree with you: the number of different possible boostrapped samples from a data set of size N is not N! but C(N,2N-1) number of combinations of N items out of 2N-1 which is smaller than N! a soon as N>5: see my comment 2 days ago.

Elizabeth A Vanner

I do not see any other post from you, but I defer to your expertise.

jean-louis Foulley

Here is a copy of it.

JL Foulley, University of Montpellier II, France

Leonidas Georgopoulos likes this · Edit · 2 days ago

Juan Gonzalez

I understand you want to resample by BS from a MC sample. I am assumming that the MC sample may come from a MC simiulations that are relatively expensive (due to the tyoe of process that you are simulating) so you expect that BS resampling will aid in getting CL's and perharps freqeuncy distributions of the statistic which interersts you, If this is the case, then the answer to your question about the number of BS samples depends upon the what assumptions or computation you are willing to make. If you intend to get the CL's of your statistic under the normality assumption, all you need is the standard deviation and the confidence level; if you need to get nonparametric estimates of the CL's of yur statitistic, you can do so based on the freqeuncy distribution (HISTOGRAM) based on the proper quantiles for teh conficence level you wish to prescribe.

Regarding the numbes of BS samples, one plausible to presicribe is by reliying on an adaptive convergence criteria, that is, get b samples, estaimte the CL's fo the statistic of interst, generate 2b samples, and recompute the CL's, the optimal number of BS samples is that for which the CL's estiamtes converge to an accepable precision. In the case of relying on non parametric estiamtes fo the CL's based on the histogram, what you coudl graphically see is that as BS increases, the histogram changes, but it tends to converge to a given histogram. The number of BS samples that it takes to get a stable histogram is the optimal; increasing BS will not change your estiamtes.

Onc concern you should have is that depending on the statitstic which distirbution you are aiming to characterize by BS resampling, is the expected bias of the estimate--I recome that you look at the papers below.

Best,

Efron, B., Tibshirani, R. (1986). “Bootstrap methods for standard errors, confidence intervals, and other measures of statistical accuracy.” Statistical Science 1(1), 54-77.

Efron, B. (1987). “Better Bootstrap Confidence Intervals.” J. Amer. Statist. Assoc. 82 171-185.

DiCiccio, T., Efron, B. (1996). “Bootstrap Confidence Intervals.” Statistical Science 11(3), 189-228.

Hugo Alfried Volkaert

From Jean-louis Foulley:

So with 4 samples, the number of DIFFERENT bootstrap samples should be following Bose-Einstein formula: (n+g-1)!/n!(g-1)!

if g=n:

(2n-1)!/n!*(n-1)!

or for n = 4

=7! / 4!*3! = 7*6*5*4*3*2*1 / (4*3*2*1) * (3*2*1)

= 5040 / 24 *6 = 35

or should it be 4! = 4 * 3 * 2 * 1 = 24?

Just for a test, here are the results for n = 4:

AAAA

BBBB

CCCC

DDDD

AAAB

AAAC

AAAD

AABB

AACC

AADD

AABC

AABD

AACD

ABBB

ACCC

ADDD

ABBC

ABBD

ACCD

ABCC

ABDD

ACDD

ABCD

BBBC

BBBD

BBCC

BBDD

BBCD

BCCC

BDDD

BCCD

BCDD

CCCD

CCDD

CDDD

35 combinations. However, that does not mean that 35 bootsraps will give you these 35 different sets... Since bootstrapping is a random resampling procedure - WITH REPLACEMENT, starting each time again from the same dataset, one may have to perform many more than 35 resamplings in order to get one or more for each.

One does not need to get the different possible bootstrap samples... one needs to recreate a possible data space DISTRIBUTION.

How many samples do you need to have a representative distribution? In my experience, for phylogenetic analysis based on DNA or protein sequence data up to several hundred polymorphic sites, 500 bootstrap resamplings are sufficient. If you do more resampling, all that may change are the numbers after the decimal value...

Since there is no agreement on what a reliable bootstrap value is (50% cut off?, 75% cut-off?, 90%, or 95% cut-off) the possible small changes in bootstrap values for larger bootstrap datasets is a waste of time. If a critical grouping of taxa is very close to the cut-off value you have decided upon (just a little bit more, say 76% if you decided that 75% is your cut-off), it may be worthwhile to do a second bootstrap analysis (even with same number of bootstraps) to confirm. As each resampling is independent and unique, there is no guarantee you will see the same results and if the evolutionary signal is not consistent, and thus not reliable, you may obtain different bootstrap values.

jean-louis Foulley

It is for sure 35 not 16. In fact, this number M of samples increases very rapidly with the size n of the data set.

For instance for n=17 , M=1 166 803 110 so that the probability of generating by MonteCarlo the same boostrap sample becomes very small.

jean-louis Foulley

Sorry

It is for sure 35 not 24.

How do I know if my observational/experimental sample is so big enough to test my theory?

Should I use either 'we' or 'I' in my thesis?

Where can I obtain non-parametric goodness-of-fit tests written in fortran codes?

Would you like to be a Scientific Advisory Board member (volunteer) for The All Results Journals: Phys ?

How to learn more about SPSS and its Application?

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

Posthoc test lettering in JAMOVI?

How to back transform the results generated from analyses using log transformed with In(X+1) data?

Have you tried using Vizly for your data analysis? Use the link: https://vizly.fyi/?via=olatomide. How do you see it?

Is it appropriate for researcher(s) to collapse five or four rating Likert scales to three or two as the case maybe during data analysis?

How to test multivariate outlier in STATA?

Who wants opportunities for scientific cooperation?

Suggestion for PhD Research Topic/Topics in Applied Statistics?

What is the difference between OTU and ASV analysis?