27 September 2023 5 3K Report

There may be more than one pattern of data across a single dataset, and variables are likely to cluster according to these patterns. The structure of such data is constructed via unknown latent groups would lead to unknown regression models. Consequently, the residuals of these models are assumed to be a mixture of normal distribution.

In the context of robustness, the main distribution of random errors is assumed to be a normal distribution that is contaminated by another mechanism or distribution. So, there are two ways to tackle such a problem, either trimming or using the weights function to reduce the impact of the random errors that follow the contaminated distribution.

Mixture regression has the same conceptual approximately, but it has another method to estimate the model parameters such as EM.

My question is what are the differences between mixture regression and robust multiple regression, and how can I take the decision which one I have to use with my data? Please, I'm not talking about the percentage of another distribution is 50%.

More Hassan Uraibi's questions See All
Similar questions and discussions