For example, the same model with 2 parameters is fitted to two data sets. Confidence intervals for each parameter are estimated by nonparametric bootstrapping. The results define two separate (one for each data set) 2-dimensional confidence regions for model parameters (each bootstrap gives a combination of 2 parameter values). The region overlap to some degree. What is the best way to estimate the p value for this overlap? In other words, at what level of significance do the parameter confidence regions generated based on each data set differ?