How can i calculate the critical values of a new test for normality with R through a monte carlo simulation for a vector of sample sizes?

More Usama Hadhoud's questions See All

What is the best way for ploting the powers of some univariate tests for normality in R?

like the plots shown page 2150 in the following article ( comparisons of various types of normality tests, Yab and Sim (2011)) but in R

11 December 2015 2,033 5 View

Steps of computing the power of ols test for normality against alternative distributions using matlab?

Using Monte Carlo simulation - ols test for normality (shalit 2012)

10 November 2015 5,487 7 View

I need the way to estimate the power of zhangwu1 and doornik hansen tests for normality in R found in the package PoweR?

testing for normality .. against any alternative distribution

10 November 2015 1,777 0 View

Plz, can any one help me to compute the following function in R ?

the function to apply in the word file and the data in text file

10 November 2015 9,647 0 View

What is the way for plotting the powers of different normality tests at different sample sizes in the same graph with R?

sw,JB,ols,zu

09 October 2015 7,459 0 View

How can i apply these functions in R?

ols test for normality

09 October 2015 8,214 2 View

What are the steps for graphing the power of different normality tests in the same qq plot using R language?

ols test - SW - D'agostino - LF

08 September 2015 3,566 0 View

How can i estimate the power of different normality tests using R language?

normality tests such as SW,JB, D'agostino, OLS, anderson darling and lilliefors.

08 September 2015 8,831 3 View

What are the fundamental ideas of Monte Carlo dimensionality estimation, compared to Rasch residuals PCA?

I read an interesting article from Christensen (2007): A Monte Carlo approach to unidimensionality testing in polytomous rasch models. But do you have some further thoughts or maybe other...

08 August 2024 4,245 0 View

"A Markov-like Model for Patient Progression"?

A Markov-like Model for Patient Progression" Markov Chain Monte Carlo (MCMC) Markov Chain Monte Carlo (MCMC) is a powerful computational technique used to draw samples from a probability...

05 August 2024 10,079 0 View

Which distribution type should I use when calculating the average particle size from TEM image? and how to calculate the error ?

average particle size calculation from TEM

04 August 2024 2,921 1 View

How to start a Molecular Dynamics Simulation?

Is it possible to conduct a molecular dynamics simulation to see the effects of a specific carbohydrate on the structure of lipids (e.g., micelle structure)? I am a beginner in this field and plan...

03 August 2024 3,371 3 View

How to load and plot 2D-PIV (particle image velocimetry) recorded velocity vector field in Techplot?

Dear Researchers I need to know, how to load and plot 2D-PIV (particle image velocimetry) recorded velocity vector field data in Tecplot? Thank You

02 August 2024 3,615 1 View

Which test should be used to study association among demographic profile and awarness level?

i have to study the awareness and adoption level of cloud computing in a district of India. i also want to use association among demographic variables like gender, age, education, income etc and...

02 August 2024 2,420 3 View

Which will be the best software for the Hydration shell analysis with molecular dynamics?

I am using a windows system, what software I should use for hydration shell analysis with molecular dynamics?

02 August 2024 3,143 4 View

How to calculate effect size of AMCE (Average Marginal Component Effect) in Randomized Conjoint Experiment?

I am following Hainmueller, Hopkins, and Yamamoto's (2014) paper for my randomized conjoint experimental data analysis. The link to the paper is provided below. I received a comment from the...

02 August 2024 4,406 0 View

Posthoc test lettering in JAMOVI?

Does anyone know of a module for the JAMOVI software that is capable of generating mean separations using the classic letters based on post hoc results (e.g., Tukey test)? If, as I believe, such...

31 July 2024 3,333 4 View

Transfection in HEK293T cells?

Dear All, I am trying to transfect a pCDNA3.1 vector containing my gene of interest. The purpose is to figure out the localization of the protein of interest. I have fused the protein with GFP on...

31 July 2024 9,892 4 View

Daniel Wright

By critical values, do you mean statistic values for which you would reject Normality? If so, and using simulation, conceptually you would just create lots of normal distributions (rnorm in R) for the different sample sizes and calculate your test statistic, then see what values would be appropriate for rejecting the hypothesis (e.g., one possibility would be 2.5% in each tail but other possibilities exist). You could smooth values a bit the relationship between sample size and critical values makes sense, or just increase the number of replications to get better precision. R can be slow for loops, so if your statistic is slow to compute you would probably want to avoid loops with functions like replicate() or sim() or the apply() set of functions (see Wickham's http://adv-r.had.co.nz/).

If you want to calculate these mathematically it will depend on your test statistic.

Timothy A Ebert

These kinds of questions are very useful, but also very difficult. You might start by thinking about whether you want to sample a distribution, or if you want to sample a population that is drawn from this distribution. The latter would be more appropriate if the primary use for the new method will be in research where populations are typically small. I am not quite sure about the quantitative definition of small in this case. Certainly anything less than 1000, probably anything less than 10,000. Anything 10 million or greater would probably be large. Some of the answer depends on the variability in the data.

There are two approaches. Daniel's suggestion was to use normal distributions. Thus the null hypothesis is true in these cases. This is where I would start, and I would plot a graph that looks at sample size and the probability of rejecting the null hypothesis.

The other approach is to use other distributions. These can be friendly like the uniform distribution runif(100,5,5) and rgamma(100,5,9), or unfriendly custom distributions. Think about things like generating a population consisting of rnorm(5000) and rnorm(5000)*6.4386 to give a bimodal distribution with two equal peaks. You then withdraw a sample from this population to test using your new method. Repeat, and you can either generate a new population, or you can resample the existing population. These custom distributions can be multimodal, skewed, and distorted to whatever degree you want. You can then run simulations to see how well the new method performs as the distributions become ever closer to normal. You can also see how well the new method performs as the two populations become more dissimilar. So how well does the new method work if you are comparing a normal population to a bimodal population where the two modes are within some distance of each other. So rnorm(5000) and rnorm(5000)*d where 0

Timothy and I, I think, are answering slightly different questions and it boils down to my first question, what you mean by critical values. I assume these were for null hypothesis significance testing of whether we can reject the hypothesis that the empirical distribution was drawn from a normal distribution. If you are using other distributions then you are not testing the null of the normal distribution being true.

However, if you goal is to get a confidence interval around the test statistic, then you would use whatever distribution you think is appropriate (perhaps via bootstrapping). I think this is what Timothy is addressing. Let's say your test statistics was kurtosis, which can be used to show non-normality. The standard errors and p-values that are given in many packages and textbooks assume a normal distribution (and you will get near these values with the approach I suggest), so have problems when the null hypothesis is false if using these to make confidence intervals. See the attached. It shows using the empirical distribution (and bootstrapping) produces better confidence intervals (but because kurtosis involves taking the error to the fourth power the estimates are still un-reliable in many situations ... there are more robust alternatives out there for tail-heaviness ... see references in the attached).

Article Problematic standard errors and confidence intervals for ske...

Mostly trying to help Usama think about the problem in a broader context, or perhaps trying to facilitate a further refinement of the question (as Daniel has asked for).

It is not clear if Usama has invented a new method, or if he is trying to better understand a method developed by someone else. Maybe critical values are going to be used to construct the equivalent of a t-table as found in introductory statistics textbooks. The methodology.pdf manuscript that was included in the question was interesting. If that is the goal, then at least part of the relevant program was published in an appendix at the end of that manuscript. Contacting those authors directly might get more help targeted to this specific problem.

Usama Hadhoud

thanks all .. i pretty know the steps theoretically and most of your answers but i need the codes in R with an example for any test if it is possible..

Dear.. PROF.DANIEL.. yes i mean by critical values and what you have answered exactly but i need the codes in R to apply these steps ...

DEAR.PROF TIMOTHY.. yes iam trying to better understand a method developed by someone else but under other conditions.. this method with some steps to be implemented.. i have e-mailed them but no one have answered me yet.

The Wickham book I mentioned above is good for writing code and his plyr (http://plyr.had.co.nz/09-user/) package may be of use.

@Daniel,

I loved Figure 1. Thank you. I have always had trouble remembering which is which.

Tim