Hi RG community,
I have a dataset which has a non-normal distribution and I want to perform linear regression on it (using Graphpad Prism 9.3.1). Here is the process I follow:
1. First determine normality of data set using D'Agastino-Pearson omnibus normality test.
2. Find correlation coefficient (r): Pearson Correlation if data has normal distribution or Spearman Correlation if data has non-normal distribution.
3. Then I perform linear regression in Graphpad Prism and find out the fit equation and the goodness of fit (R-squared or R2).
Usually, for normally distributed data, the R-squared term numerically turns out to be square of correlation coefficient r (Pearson Correlation coefficient)
i.e., R-sqaured or R2 = (r)2
However, this is not true for non-normally distributed data. (Is this normal?)
Hence, I just wanted to get clarified on a few silly things: (haven't really worked much with non-normally distributed data)
1. Is it ok to have R-squared or R2 not equal to (r)2 for non-normally distributed data?
2. Is it ok to perform linear regression on non-normally distributed data as is or does it need some modification before linear regression can be performed?
I appreciate any help regarding this topic.
Thank you!