Good morning ,
I am working on a project and I need to predict the quantity of sales of a product based on the country (+10) , the city (+100) , the price, the month, the week of day , the period of the year (ex: holiday or not) etc....
My first question : What are the different tests I should do before doing a linear regression?
Secondly, do you think that Ridge regression is a appropriate method to use in this type of analysis?
Secondly: How should I prepare the data?
- Should I normalize the numeric data?
-Should I transform the categorical variables into dummy variables?
Finally, don't you have any explaination to the non normal residuals and the heteroscedasticity I found after predicting with the Ridge Regression on re-scaled data?
Thanks in advance for helping me.