I am trying to model the energy content of goat milk on the basis of compositional data (e.g., protein, lactose, fat, etc..). All the data I am processing have been obtained from literature. Obviously, I have no individual data but only means. Unfortunately standard errors are not always available. So, a question arises, can I use mean values in linear regression analysis or should I perform another type of procedure to find the underlying relationship among the energy content and the other variables? Thanks in advance.