Hello everybody,
I have a large data set of simulated data, at the moment I have 700000 observations but the number of observations can be increased if needed. The data set consists of one dependent variable/response variable and 4 independent variables/predictors. I want to include all 4 predictors because all of them are essential for the predictions. My aim would be to get a kind of response surface.
First, I started with multiple regression analysis using polynomials and interaction terms but I don't know how to decide which model is the best. Do have any suggestions?
After that, I tried to use non parametric regression (loess) but this seems only applictable for up to 4 predictors and I have five.
Do you know methods which can help me to analysis my data set and uncover the functional form of my predictor variables, y=f(x_1,x_2,x_3,x_4,x_5)?
Thank you very much in advance!
Best regards,
Florian