I am doing a multiple regression on several variables (i.e. 13 vars). The variables have few continuous variables and many categorical variables. The dataset is for only a year involving 7891 school applicants for entrance exams. I am using the average scores as my dependent var. The R-squared of the multiple linear model is very low 0.022. I have attempted using other methods such as nonlinear OLS but the R-squared is still very low. Also, I attempted using quantile regression but I think it's not applicable for the current data at hand. Can I get any recommendations in terms of pre and post estimations methodologies for the current data? Thank you!

Similar questions and discussions