Hello, I am doing a multiple regression with one outcome measure and six predictors (say p1, p2, p3, p4, p5, and p6). The sample size is 140. When I perform a stepwise regression, I have 4 statistically significant predictors: p1,p2,p3, and p4 (p1 is the strongest and p4 the weakest). These are the results without any transformations of the raw data.

However, predictor p4 is not normally distributed, so I performed lg(10) of the predictor p4 and insert that predictor in the regression (instead of raw p4 data). This time, the results also indicate 4 statistically significant predictors but this time I have the following statistically significant predictors in the model: p4(lg10), p1, p5 and p3 (ranked by their size, p4(lg10) being the strongest and p3 the weakest).

So, the question is what should a possible write-up of these results be? Both of these models are theoretically sound but I wonder which one is „closer to the truth“.

Similar questions and discussions