Deep neural networks have high representational capacity and have gained much success in recent years. However, with the ability of high representational capacity, it can also suffer from overfitting, especially when the data is not big. So, what are the strategies that could avoid overfitting deep neural networks (other than drop-out).