Hi RG community,

I have a dataset which has a non-normal distribution and I want to perform linear regression on it (using Graphpad Prism 9.3.1). Here is the process I follow:

1. First determine normality of data set using D'Agastino-Pearson omnibus normality test.

2. Find correlation coefficient (r): Pearson Correlation if data has normal distribution or Spearman Correlation if data has non-normal distribution.

3. Then I perform linear regression in Graphpad Prism and find out the fit equation and the goodness of fit (R-squared or R2).

Usually, for normally distributed data, the R-squared term numerically turns out to be square of correlation coefficient r (Pearson Correlation coefficient)

i.e., R-sqaured or R2 = (r)2

However, this is not true for non-normally distributed data. (Is this normal?)

Hence, I just wanted to get clarified on a few silly things: (haven't really worked much with non-normally distributed data)

1. Is it ok to have R-squared or R2 not equal to (r)2 for non-normally distributed data?

2. Is it ok to perform linear regression on non-normally distributed data as is or does it need some modification before linear regression can be performed?

I appreciate any help regarding this topic.

Thank you!

More Srujan Singh's questions See All
Similar questions and discussions