Does anyone know of a proof of the fact that by coding a memory-less information source X, the resulting information source Y has memory and H(X)=L*H(Y) where L=average code length?

Similar questions and discussions