I'd like you to participate in a simple experiment. I'd like to limit this to simple linear and multiple linear regression with continuous data. Do you have such a regression application with real data? Some cases with several independent variables might be helpful here. If you have done a statistical test for heteroscedasticity, please ignore that, regardless of result. We are looking at a more measureable degree of heteroscedasticity for this experiment.
To be sure we have reasonably linear regression, the usual graphical residual analysis would show no pattern, except that heteroscedasticity may already show with e on the y-axis, and the fitted value on the x-axis, in a scatterplot.
///////////////////////////////
Here is the experiment:
1) Please make a scatterplot with the absolute values of the estimated residuals, the |e|, on the y-axis, and the corresponding fitted value (that is, the predicted y value, say y*), in each case, on the x-axis. Then please run an OLS regression through those points. (In excel, you could use a "trend line.") Is the slope positive?
A zero slope indicates homoscedasticity for the original regression, but for one example, this would not really tell us anything. If there are many examples, results would be more meaningful.
2) If you did a second scatterplot, and in each case put the absolute value of the estimated residual, divided by the square root of the fitted value, that is |e|/sqrt(y*), on the y-axis, and still have the fitted value, y*, on the x-axis, then a trend line through those points with a positive slope would indicate a coefficient of heteroscedasticity, gamma, of more than 0.5, where we have y = y* + e, and e = (random factor of estimated residual)(y*^gamma). Is the slope of this trend line positive in your case?
If so then we have estimated gamma > 0.5. (Note, as a point of reference, that we have gamma = 0.5 in the case of the classical ratio estimator.)
I'd also like to know your original equation please, what the dependent and independent variables represent, the sample size, whether or not there were substantial data quality issues, and though it is a very tenuous measure, the r- or adjusted R-square values for the original linear or multiple linear regression equation, respectively. I want to see whether or not a pattern I expect will emerge.
If some people will participate in this experiment, then we can discuss the results here.
Real data only please.
Thank you!