How to calculate sample size for the study of disease prevalence ?

Hi,

there are many ways to find the sampling size you can see this site:

https://www.qualtrics.com/uk/experience-management/research/determine-sample-size/?rid=ip&prevsite=en&newsite=uk&geo=FI&geomatch=uk

at the end of this site you can see the formula .

at all the estimate of sampling size come back to some thing such az Relative error,Interval length,..

if it wasnt enougth you can send massage to me for more help.

Have good time.

Ramnath Takiar

One of the problem faced by most of the researcher is that they do not have information on the prevalence of the disease which they want to study. In the year 1986, I got appointed as Senior Research Officer in Desert Medicine Research Center, Jodhpur, Rajasthan, India. The very first problem, I faced was to determine the sample size. The objective was to carry out a general health survey in the state and to determine the major health problems of the area. Not much information was available nor the disease whose prevalence to be taken as reference for determining the sample size was available. After deep thinking I decided that we will decide the sample size to study all the diseases which has the prevalence of 1%. The sample size so decided has to be valid for all other diseases which has the prevalence more than 1%. With this background, I come back to your problem. The steps you may follow are as follows:

1) The formula to be used for calculation of sample size is n = 4pq/L^2

Where P is the prevalence of the disease under study, q = 1 - p; L is the tolerable error in the estimation of the prevalence

2) Guess the prevalence to be 1% of hydatidosis among the domestic cattles.

3) Then, n = 4*(0.01)*(0.99)/(0.02)^2 = 9900; here I have assume L = 0.002 (that is 20% accuracy in estimation)

4) So, the sample size should be to study 9900 cattle for the study.

5) Note, if you change the prevalence to 2%, 3% or 5%, accordingly the sample size will reduce.

6) For, 2% prevalence, using the above formula, the sample size will be 4900.

7) I hope this will satisfy your immediate requirement. Still, if you have any query, you are free to ask.

8) Note, when the desired prevalence is low say 1% or 2%, going for less sample size then required will not give you optimum results.

Giovane R Sousa

Hi Punya Ram Sukupayo

You can try either use this online source (re: fast calculator) which is pretty feasy and friendly using or my explained SAS code example below, which you’d just need to vary the parameters of the power analyses to explore the consequences of varying assumptions. If I can be of any further assistance, please let me know.

Fast calculator:

https://www.dartmouth.edu/~eugened/power-samplesize.php?

title 'power for comparison of proportions [estimation of odds-tatio] between 2 cardiac AAb risk strata (total n=800)';

proc power;

logistic

alpha = 0.05

vardist('aab_pos') = binomial(0.08, 1)/*distrbution of a dichotomous variable for "high risk group, yes/no" , (proportion of people with positive test, n trials [do not change])*/

vardist("x1a") = normal(33,7)/*mu=mean, sigma=sd. "age"*/

vardist("x1b") = normal(114, 12)/*mu, sigma. "sbp"*/

vardist("x1c") = normal(12,6)/*mu, sigma. "duration"*/

vardist("x1c") = normal(7.2,0.9)/*mu, sigma. "a1c"*/

vardist('dkd') = binomial(0.12, 1)/*(proportion with microalbuminuria, 1)*/

/*can add as many as you want, following this format [I recommend sticking to normal or dichotomous covariates]*/

testpredictor = 'aab_pos'

testoddsratio = 2.4 /*Odds ratio between risk strata*/

covoddsratios = (2 2 12 2 2) /*odds ratio for each covariate, needs to be (# of vardist statements -1) should come from our 'pilot' data*/

responseprob = 0.12/* prevalence of cvd or other outcome*/

ntotal = 800 /*total cohort size*/

power = .;/*leave blank, will output power*/

run;

What is the name and characteristics of the fungus in the picture?

How to inactivate Aspergillus Flavus spores to maintain the aflatoxin content but eliminate the danger of contaminating the environment?

I am planning to design a Rockfill Dam of about 160m Dam height. Which design Procedure and Refrences shall I Follow?

How to learn more about SPSS and its Application?

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

Which distribution type should I use when calculating the average particle size from TEM image? and how to calculate the error ?

How to calculate effect size of AMCE (Average Marginal Component Effect) in Randomized Conjoint Experiment?

Posthoc test lettering in JAMOVI?

How to back transform the results generated from analyses using log transformed with In(X+1) data?

How to conduct a sensitivity power analysis for Kendall's Tau?

How to estimate sample size for GWAS of continuous and discrete traits? What are the pre-requisites?

Have you tried using Vizly for your data analysis? Use the link: https://vizly.fyi/?via=olatomide. How do you see it?

Is it appropriate for researcher(s) to collapse five or four rating Likert scales to three or two as the case maybe during data analysis?