I have a Belief Network (Stochastic Neural Network) which is trained using Wake-Sleep algorithm (as defined in the attached paper). What is the best method to avoid overfitting of this model?
Especially, I am concerned about choosing number of the units (stochastic neurons) in each layer. How to choose an appropriate number of units, in terms of good generalization?
Note that my problem is totally unsupervised, i.e. there are no class labels for the data. Actually, I want to use the Belief Network as a generative model in a kind of "synthesis" problem (not classification).
http://search.proquest.com/openview/46b0529a5ebd4e1add1aebdb5bb74a31/1?pq-origsite=gscholar