Folks,
I have 15,000 address strings which I have written a piece of code which assigns them to broad geographic region e.g. American address assigned to North America. I would like to check the accuracy of my code however to do this I must manually check the strings. To do this I've taken a sample (SRS). I set a margin of error of 5%, a CI of 95% and a response distribution of 50%. Therefore, I assumed that I would assign the correct region half the time.
The issue here is that I am much more accurate then I anticipated. Of my sample of 375 372 (99%) where assigned to the correct location. This means I am unable to calculate lower and upper confidence intervals.
Is there any other types of formula I could use when your response distribution is skewed to a highly positive outcome? Or is the only remedy to simply increase my sample size?
The only way I can verify if something is correct is by manually checking therefore the need to draw.