I am facing issue around using a log form of a variable in regression as independent variable , that is, using log(z) and [log(z)]^2 as independent variables in regression specification along with other independent variables. people using it to explore non-linear kind of relationship. In my view, the [log(z)]^2 should ideally be redundant because of following

y=Bo + B1*log(z) + B2*[log(z)]^2+ ∑B_k*X_i + e ------ (1)

∂y/∂z= B1*(1/z)+2* B2*(1/z)

∂y/∂z= (B1+2*B2)(1/z)

Since we would end up at same specification of marginal impact of z even if specified above regression without the term [log(z)]^2 i.e.

y=Bo + B1*log(z) + ∑BkXi+ e --------- (2)

∂y/∂z= B1*(1/z)

So thinking about log(z) as if it is z is essentially creating lot of problem in the process. The problem, however, is that the expression (1) gives slightly higher R-squared and there is lot of adjustment in co-efficient value of constant term to accommodate the square term. My understanding is that the quadratic term of log(z) confuses the analysis more than it solves it. I would love to see responses my view of using quadratic term. Or Please let me know if I am doing something very silly and can not see through that.

More Md Zakaria Siddiqui's questions See All
Similar questions and discussions