To see about the accuracy, estimated variance of the prediction error is helpful, but for including bias from the model being for just a sample, not the population, you need something like cross validation. That helps to indicate the difference between a model for the sample and the unknown model for the population.
Galit Shmueli has an interesting reference to Hastie, et.al., top of page 6 in the following, which might interest you:
I made some notes to sort out your question, and review this myself.
Consider the following:
1) There is a "true" but unknown model for a given situation.
2) There is then a model format, or possible models, postulated, based on a sample and on knowledge of the subject matter. Coefficients are labeled with subscripted "beta" characters. (Famous quote by George Box notes models are 'wrong,' though some are 'useful,' is dependent upon the fact that it matters how well the model format you pick may or may not approach reality.) I think it is customary to use epsilon here instead of e.
3) There are estimates for these coefficients, and estimates for residuals, e, for a given model, based on the sample, using least squares or some other technique, and those are the estimates of the betas, each beta with a hat, or a star (Maddala's notation for WLS regression, for which OLS is a special, and often overused case), or we may use the letter "b," each with a subscript.
4) It sounds like you would use "u" in place of "epsilon," if you had an entire finite population to calculate betas. However, those betas are still for a postulated model, for a given finite population. Also it might be regarded that a superpopulation would generate the finite population as one possible result of a mechanism, so in that frame of thinking, you still don't have "true" betas, for your postulated model, which is not "true," either.
5) Amazingly, many models are very useful, and many of those are virtually "true."
6) Note that you will often see "u" or "e" used interchangeably from one model to another, or in the same model representing different levels, so notation can be confusing.
Of course in real life we can't have the collected data of population, then we should try with samples. Concerning epsilon, yes you are right and I attached what I meant. So, please have a look on my attachement and let me now about your thoughts. Thanks alot.
You can't know betas, only estimates of betas, so the sample approach is the way to go. - Cheers.
PS - At least, that's the way I see it. And in survey sampling, dependence upon a regression model is the basis of the conditionality principle. That is, we are dependent upon the sample we have. (That makes cross validation a good idea to check for bias from a misspecified model (as well as sampling variance and nonsampling errors of all kinds).) - So, I'd say "linear regression for samples," which is your second Q derivative idea.
I totally agree with you and of course we can't know betas and have formulas for betas, but some statisticians said: when taking partial derivative of Q according to betas and equalling them to zero (∂Q/(∂β0 )=-2∑〖yi-β0-β1 xi〗=0), we will have beta hats that are estimates of betas. please look at my previous attachement (first choice). Both of the choices leads to the same formulas of beta hat but the starting points are different. cheers.
I did look at your attachment before answering, and that was.the best I could make of it.
Also, note that n came up in both derivations, not N the first time, and n the next. That seems to indictate the second derivation is the way to go. But I have to say that I am not very good with notation, but I still think you want the "sample" approach notation here.
Isn't the whole book on the internet a copyright violation?
Anyway, one cannot sum over all N cases if you do not have all N observations. You can say that this is what would happen, but we approximate using a sample. Perhaps the author meant just that: "This is what would happen. Now we approximate with a sample."
(But as I noted earlier, by the way, even if you could use the entire population to determine betas, you still have a model, and a model won't be 100 percent correct.)
Sorry, but I'm not going to look at that book unless I know it was posted with the author's and publisher's permissions.
Concerning the book on the internet, I just made a search on google and wrote "applied linear regression analysis pdf" and directly found this book. So, I did not make any violation sir. Cheers.