I study behavior in C. elegans, and it is nearly impossible to control and account for the many factors that might influence behavior (temperature, age, time of day, humidity, vibration, etc, etc). If one or more of these factors has a particularly strong effect, then we are blind to it (mostly for practical reasons). As a matter of protocol, we always collect data on multiple days and combine into a single sample collection. So multimodal distribution in our sample wouldn't be surprsing, but I would consider it a consequence of technical limitations and not "real" biology for the specific behavior I'm studying. Based on published studies and my own experience with the specific behavioral assay (a populaiton-based chemotaxis assay), the data are generally normally distributed (as best as I can tell - haven't been motivated to test for normality, but suspect the data would pass as normal for the majority of samples). As such, the evidence appears to show that the data are normally distributed, and this is the assumption I make when choosing statistical tests. (Note: I use "sample" here as used in statistics, i.e., collecting a sample from a population to make inferences about the population...not to be confused with its use in most biomedical research settings where a "sample" represents collection of a single specimen.) My N is approximately 8-24 per group.

So, my inclination would be to assume normal distribution (even if on occasion, the sample distribution appears multimodal) given the fairly exhaustive published evidence that the behavior appears normally distributed. My assumption of normal distribution is for the population not the sample (especially given the relatively small N for each sample). Is it reasonable not to test for normality when a sample appears multimodal, especially in light of effects by factors that are difficult to control? I don't feel comfortable testing for normality when it is likely due to technical variation. This could lead to inconsistent application of statistical tests (where two independent samples collected for the same condition lead to applying tests assuming normal distribution for one and tests assuming multimodal distribution for the second). Thank you all for your responeses!

More Ricardo Frausto's questions See All
Similar questions and discussions