Besides independency, normal distribution and homoscedasticity, why don't we also need the sample sizes among groups to be somewhat equal?

Imagine we have a factor with three conditions. Let's say sample sizes are n_1=20, n_2=100, n_3=100 and all mentioned requirements are fulfilled. Calculating the variance within groups and between groups combines the variance of all samples now. Accordingly, degrees of freedom are df_within = p-1 = 2 and df_between = n-p = 217. Don't we in average overestimate the degrees of freedom for n_1 and underestimate it for n_2 and n_3 then? Or is that the very reason why we can't actually say something about individual groups when calculating an ANOVA?

Thanks in advance,

Steffen

Similar questions and discussions