Where the population is unknown, the sample size can be derived by computing the minimum sample size required for accuracy in estimating proportions by considering the standard normal deviation set at 95% confidence level (1.96), percentage picking a choice or response (50% = 0.5) and the confidence interval (0.05 = ±5). The formula is:
n = z 2 (p)(1-p)
c 2
Where:
z = standard normal deviation set at 95% confidence level
If key-variable of the population is quantitative, use n=Z2*s2/d2. Where: n - this is what are looking for (minimum sample size), Z - is the value of the distribution function (for tourism phenomenons you can calculate this value for alpha equals to 0,05), s - is the population standard deviation, and d - is acceptable standard error of the mean (it is up to you). Of course, you don't know the population (typical in tourism studies). So, I suggest to estimate s using results from pilot research. After the pilot research calculate s=s'*(n'/(n'-1))^0,5. Where: n' - is the sample size of pilot research, and s' - is the the standard deviation of sample of pilot research. I know that it's not a perfect solution. There is lot of limitations, e.g. using convenience sampling for pilot research.
Where the population is unknown, the sample size can be derived by computing the minimum sample size required for accuracy in estimating proportions by considering the standard normal deviation set at 95% confidence level (1.96), percentage picking a choice or response (50% = 0.5) and the confidence interval (0.05 = ±5). The formula is:
n = z 2 (p)(1-p)
c 2
Where:
z = standard normal deviation set at 95% confidence level
Sample size for unknown population maybe use the requirement of analysis tools. e.g. SEM or factors need minimum 100 samples or 5xn_variables and chosen by purposive way.
Tomasz Napierala's suggestion of a pilot study is appropriate. However, to avoid a very large error for small sample size estimation, use the "t" variate. "t" is dependent on the degrees of freedom, while "z" is not. Implementing a sample size equation in Excel and using equation solver to iteratively solve for 'n" with "T.inv" used for 't" is very easy to set up.
@Lloyd T. can u pls make your explanation a little more explicit. Please if u have any document that can be of help on this issue pls send to [email protected]
Determining the sample sizes involve resource and statistical issues. Usually, researchers regard 100 participants as the minimum sample size when the population is large. However, In most studies the sample size is determined effectively by two factors: (1) the nature of data analysis proposed and (2) estimated response rate.
For example, if you plan to use a linear regression a sample size of 50+ 8K is required, where K is the number of predictors. Some researchers believes it is desirable to have at least 10 respondents for each item being tested in a factor analysis, Further, up to 300 responses is not unusual for Likert scale development according to other researchers.
Another method of calculating the required sample size is using the Power and Sample size program (www.power-analysis.com).
we can calculate our needed sample size. This can be done using an online sample size calculator or with paper and pencil.
Your confidence level corresponds to a Z-score. This is a constant value needed for this equation. Here are the z-scores for the most common confidence levels:
90% – Z Score = 1.645
95% – Z Score = 1.96
99% – Z Score = 2.576
If you choose a different confidence level, use this Z-score table* to find your score.
Next, plug in your Z-score, Standard of Deviation, and confidence interval into the sample size calculator or into this equation:**
Here is an example of how the math works assuming you chose a 95% confidence level, .5 standard deviation, and a margin of error (confidence interval) of +/- 5%.
((1.96)2 x .5(.5)) / (.05)2 (3.8416 x .25) / .0025 .9604 / .0025 384.16 385 respondents are needed
Where the population is unknown, the sample size can be derived by computing the minimum sample size required for accuracy in estimating proportions by considering the standard normal deviation set at 95% confidence level (1.96), percentage picking a choice or response (50% = 0.5) and the confidence interval (0.05 = ±5). The formula is: n = z 2 (p)(1-p) c 2 Where: z = standard normal deviation set at 95% confidence level p = percentage picking a choice or response c = confidence interval
In order to answer your question, several remarks need to be incorporated:
1.The Cochran formula allows you to calculate an ideal sample size given a desired level of precision, desired confidence level, and the estimated proportion of the attribute present in the population.
2. Cochran’s formula is considered especially appropriate in situations with large populations. A sample of any given size provides more information about a smaller population than a larger one, so there’s a ‘correction’ through which the number given by Cochran’s formula can be reduced if the whole population is relatively small.
3.The Cochran formula is:
n0=(Z square x pq/e square)
Where:
· e is the desired level of precision (i.e. the margin of error),
· p is the (estimated) proportion of the population which has the attribute in question,
· q is 1 – p.
The z-value is found in a Z table.
4. If the population we’re studying is small, we can modify the sample size we calculated in the above formula by using this equation:
n= [n0/(1+((n0-1)/N)).
5. In order to estimate the sample size, three issues need to be studied, i.e. the level of precisions, confidence or risk level and the variability. Regarding the last issue, which your questions is concentrated the degree of variability in the attributes being measured refers to the distribution of attributes in the population.
6.The more heterogeneous a population, the larger the sample size required to obtain a given level of precision. The less variable (more homogeneous) a population, the smaller the sample size.
Note that a proportion of 50% indicates a greater level of variability than either 20% or 80%. This is because 20% and 80% indicate that a large majority do not or do, respectively, have the attribute of interest. Because a proportion of .5 indicates the maximum variability in a population, it is often used in determining a more conservative sample size, that is, the sample size may be larger than if the true variability of the population attribute were used.
Sample size refers to a number of factors, including the purpose of the study (Israel, 1992, p.3). Miaoulis and Michener (1976) have specified three main criteria to determine the appropriate sample size which are-
(1) The level of precision: It refers to the range in which the true value of the population is to be estimated (Israel, 1992, p.1). It is also called sampling error or margin of error. Generally acceptable margin of error in educational and social researches is 5% or 0.05 for categorical data, and 3% or 0.03 for continuous data (Krejcie & Morgan, 1970 quoted in Bartlett et al., 2001, p.45)
(2) The level of confidence or risk: It is based on ideas included under the Central Limit Theorem that when a population is repeatedly sampled, the average value of the attribute obtained by those samples is equal to the true population value (Israel, 1992, p.1). It is also called alpha level. The alpha level used in determining sample size in most educational research studies is either 0.05 or 0.01 (Ary, Jacobs, & Razavieh, 1996 quoted in Bartlett et al., 2001, p.45).
(3) The degree of variability: The degree of variability in the attributes being measured refers to the distribution of attributes in the population. The more heterogeneous a population, the larger the sample size required to obtain a given level of precision. The less variable (more homogeneous) a population, the smaller the sample size (quoted in Israel, 1992, p.2).
The Cochran formula allows you to calculate an ideal sample size given a desired level of precision, desired confidence level, and the estimated proportion of the attribute present in the population.
Cochran’s formula is considered especially appropriate in situations with large populations. A sample of any given size provides more information about a smaller population than a larger one, so there’s a ‘correction’ through which the number given by Cochran’s formula can be reduced if the whole population is relatively small.
The Cochran formula is:
📷
Where:
e is the desired level of precision (i.e. the margin of error),
p is the (estimated) proportion of the population which has the attribute in question,
For unknown population to calculate the sample size the population parameter is always taken as 50% with 5% margin of errors (p), z= 1.96 of 95% confidence interval
The sample size will therefore be
n = z2p(100-p)
ε2
Where
n= required sample size
Z= Critical value of the standard normal distribution for the 95% confidence interval around the true proportion which is 1.96
P= expected proportion of interest to be studied which is 50%, which is the prevalence for unknown previous prevalence.
ɛ= accepted margin of error on Proportion which is set at 3% (if the expected prevalence is above 20% and below 80%. The expected margin of error is set at 5% If the expected
if the expected prevalence is below 20% and above 80%. The expected margin of error is set at 3% . Therefore, since the previous prevalence is 50% of the unknown population which is above 20%, and below 80% the margin of error will be 5%.
Substituting in the above formula you get the final sample size for unknown population
For unknown population to calculate the sample size the population parameter is always taken as 50% with 5% margin of errors (p), z= 1.96 of 95% confidence interval The sample size will therefore be n = z2p(100-p) ε2 Where n= required sample size Z= Critical value of the standard normal distribution for the 95% confidence interval around the true proportion which is 1.96 P= expected proportion of interest to be studied which is 50%, which is the prevalence for unknown previous prevalence. ɛ= accepted margin of error on Proportion which is set at 3% (if the expected prevalence is above 20% and below 80%. The expected margin of error is set at 5% If the expected if the expected prevalence is below 20% and above 80%. The expected margin of error is set at 3% . Therefore, since the previous prevalence is 50% of the unknown population which is above 20%, and below 80% the margin of error will be 5%. Substituting in the above formula you get the final sample size for unknown population
I think that we should also consider the design effect(DE) and Non-Response Rate(NRR) while calculating sample size. So, there would be a slight variation in formula suggested by Mr. Brajesh i.e., a general formula used by most of the researchers. I would recommend you to read the book written by Leslie Kish on Survey Sampling. This would help you to use appropriate formula.
Actually I was not first come here to answer the question as it was also mine. But I couldn't able to write a question for my self and decided to continue here.
My research title is related to cultural tourism where my primary sample population will be both domestic and international tourists who use to visit different cultural heritage sites. Thus, they are unknown and I need your support to draw clear and easy sample size formula to determine the unknown sample size. Thank you!
The sample size was determined using single population formula for estimating single population proportion from the infinite population. The formula for calculating the sample size (n) would be:
n = (zα/2)square P (1-P)
d(Square)
P is the assumed highest population proportion prevalence
d is margin of error
z (a/2) is the Z-score at 95% confidence interval = 1.96
This is an online calculator that helps you to calculate the needed sample size. All you need to know first are the Confidence Level and Confidence Interval.
G power calculation is more reliable compared to Z score method given the credit in terms of either of the populations i.e. Known and unknown population