With multiple samples of data, eventually one sees different distributions, which arent necessarily 'normal distributions'. I have come across many transformation methods/functions (e.g. a logarithmic transform) to make data 'suitable' for predicting the value of a variable? However, using X variables to predict Y variables seems okay with normally distributed data, but if data isnt normally distributed (bi-modal population of X data for example), how do you predict Y variable values with certainty. After application of statistical tests like 1-way ANOVA, my data across independently tested samples, appear to be significantly different (alpha = 5%).