How to interpret root relative squared error with regards to prediction model of data mining? I have generated a data mining model using classification technique and the result shows that the model has 82.6872 %. what does that mean?
The root relative squared error is relative to what it would have been if a simple predictor had been used. More specifically, this simple predictor is just the average of the actual values. Thus, the relative squared error takes the total squared error and normalizes it by dividing by the total squared error of the simple predictor. By taking the square root of the relative squared error one reduces the error to the same dimensions as the quantity being predicted [1]. You may also be interested to know the difference between high relative and root relative errors. To know this, refer to [2].
The root relative squared error is relative to what it would have been if a simple predictor had been used. More specifically, this simple predictor is just the average of the actual values. Thus, the relative squared error takes the total squared error and normalizes it by dividing by the total squared error of the simple predictor. By taking the square root of the relative squared error one reduces the error to the same dimensions as the quantity being predicted [1]. You may also be interested to know the difference between high relative and root relative errors. To know this, refer to [2].
82.6872 % is the accuracy level of the model that you have created. I do not think that it is a very good model as there are some model with accuracy as high as 99%. Classification is a method of supervised machine learning. There are various such type of methods. Here are some of the list of methods which you can try
1. Bayesian classification.
2. Support Vector Machines.
3. linear regression.
4. logistic regression.
5. naive Bayes.
6. linear discriminant analysis.
7. decision trees.
8. k-nearest neighbor algorithm.
9. Neural Networks (Multilayer perceptron)
The attached file is how to compute the root relative squared error.
To make things much clear, if the RRSE = 82.6872 % , then the model is not quite good. But if the accuracy = 82.6872 %, this means the model is arguably very good. Knowing that, in this case, looking at the accuracy alone is not enough, you have to explore another aspects like F-measure, Kappa statistic, etc.
Hi Sobhan Sarkar: Thank you for the answer! Can you please provide a reference for the "Relative Absolute Error" or the "Root Relative Squared Error"? A book or publication link would be really helpful (rather than stackoverflow).