Hi!

My and my thesis partner have encountered some problem with our regression and would really appreciate some help / guidance.

We are evaluating a property tax reform and have downloaded tax data for the entire population affected by the reform. In our data set, the population is divided into different groups depending on income-level (discrete variable from 1-26), geographical region (discrete variable from 1-21), age-groups (discrete variable from 1-3) and gender (dummy variable). The tax data is collected yearly for some years before and some years after the reform.

We would like to use a regression model in order to see whether some of these variables (for example income-level) have had a significant effect on the change in tax payments due to the reform. Our problem is to find a proper regression model.

One thought we've had is to use our data as a panel data set. Then it is a short panel data set since N = 3 276 and T = 17. The problem we got then is that our discrete variables that we want to evaluate are time-invariant, which makes the fixed effects model difficult to use. In the same time we think that the unobserved component is correlated with our regressors, which makes the fixed effect more proper to use.

If someone has thoughts on what regression model to use for our data it would help us a lot. Also suggestions for good literature or papers that covers the same kinds of data set that we have would really help us.

Thanks!

Similar questions and discussions