I am working on a data mining project and some of features varies between 1 to 1,000,000. Before training machines, I do the standardization for all the data features and do the rest of machine learning process (Regression method). The problem is to reduce some error of cource, but MAE, MSE, ... they are all metrics that bias the lines to bigger differences between train data and machine hyperplane (I hope used the right expression!). I decided to find the best fitted line to relative error ((x_estimation - x_test) / x_estimation). One of my friend said you can use some trick (maybe on data or model) that can help you to find your model faster.

I searched the web for hours, yet haven't found anything. He does not show me explicitly what that trick is, so I decided to ask you researchers if you got any useful hints, references, or python codes on github to clarify the problem for me, let me know.

I appreciate your helps.

More Masoud Masoumi Moghadam's questions See All
Similar questions and discussions