15 September 2020 2 8K Report

Good morning ,

I am working on a project and I need to predict the quantity of sales of a product based on the country (+10) , the city (+100) , the price, the month, the week of day , the period of the year (ex: holiday or not) etc....

My first question : What are the different tests I should do before doing a linear regression?

Secondly, do you think that Ridge regression is a appropriate method to use in this type of analysis?

Secondly: How should I prepare the data?

- Should I normalize the numeric data?

-Should I transform the categorical variables into dummy variables?

Finally, don't you have any explaination to the non normal residuals and the heteroscedasticity I found after predicting with the Ridge Regression on re-scaled data?

Thanks in advance for helping me.

More Myriam Hong's questions See All
Similar questions and discussions