02 February 2019 4 3K Report

I am using nonlinear regression models (SVM, Random Forest) on a pooled regression problem. I have high autocorrelation and some cross-sectional correlation and want to provide a performance metric like R^2 that is corrected from this issue. Or some variant of Newey-West that I can apply post estimation (directly in my errors or in the prediction and realized).

I also have an issue in reporting my R^2 cross-sectionally. My dependent variable varies between 0 and 1 and the cross-sectional average changes drastically in some periods. This results in the models having a negative R^2 suggesting they are very bad, but the scatter plot of the prediction vs. realized suggest a relationship between the two (for example the correlation is 0.5). What would be the best way to deal with this problem?

Thanks

Similar questions and discussions