in short in case of ARQ and HARQ energy consumption directly relates to number of re-transmission required however that is not easy to model statistically as it depends on lots of factors ( bunch of delays, algorithms,channel,.....). however the channel capacity generally considers an error free transmigration (10^-6) and already considers the effect of the ARQ in your case.
There are several papers talked about the practical transmission green wireless communications, such as
1. "Energy efficiency and spectral efficiency tradeoff in type-I ARQ systems," IEEE Journal on Selected Areas in Communications, vol. 32, no. 2, pp. 356 - 366, Feb. 2014.
2. "Optimum energy and spectral efficient transmissions for delay-constrained hybrid ARQ systems," IEEE Transactions on Vehicular Technology, 2014.
with practical frame error rate (FER) analysis
3. “An accurate frame error rate approximation of coded diversity systems with non-identical diversity branches,” IEEE International Conference on Communications, ICC, June 2014.
Increased channel capacity implies that you can transfer more information using the available resources. If channel capacity is less, then you have to use more resources (like more number of channels, more bandwidth, higher power amplifiers at transmitter end) - in all cases power will be consumed.