Dear experts,
I would appreciate if you help me to find a hint for my question.
Consider a routine example of having a dependent (y) and several independent (X's) variables. The variable y is recovery rate. Based on experience, we know that the X1 is the most important predictor and the greater the value of X1, the more recovery rate we have.
Is it reasonable we assign weights to the observations according to their value of X1 before fitting any models? if yes, do you know any references to support this idea? (For either the linear regression or machine learning methods)
P.S: I have found some book chapters and articles but their ideas behind assigning weights to the observations are as follows;
(1) to achieve more precise estimates by correcting for heteroskedasticity
(2) to achieve consistent estimates by correcting for endogenous sampling
(3) to identify average partial effects in the presence of unmodeled heterogeneity of effects.