1. I'm trying to use a sample dataset (attached) to estimate the optimal sample size.
I'm using the linked book chapter as a guide, using formula 7.1 (7.2) gives a similar result.
The resulting n is similar to the sample size. In fact, if I make random sub-sets the estimated n will be close to the actual sub-set size every time.
I was expecting a fairly stable value for n after a certain sampling size (which I want to determine). Instead the result points to the sample size used for the estimation: there is no conclusion to be taken, every tested sample size is the correct one.
The desired absolute error seems to be the most sensitive variable, but I'm unsure how to calculate a robust/adequate value for the sample, independent of variations in the test sample size.
2. The sample is not normally distributed, it's skewed to the left. The averaged values will be subject to a t-test. There are samples both in control and treatment that do not pass a normality test. But for this purpose, only the averages need to conform to a normal distribution, or the underlying values also need to be normal?
http://www.zoology.ubc.ca/~krebs/downloads/krebs_chapter_07_2013.pdf