I am looking for research on the topic of training Boltzmann machines, Deep Belief Nets or other generative models on audio samples. Ultimately I would like to train these on specific sounds and then have the network generate new sound samples using Gibbs sampling. The only research that I find are for training and generating music scores. The closest I can find is for speech recognition but it is very specific to speech and combined with HMMs.

Similar questions and discussions