Suppose we want to train a MLP neural network in order to learn the underlying probability density function (pdf) of some real-valued data. That is, the network must learn p(x), where x is a continuous random variable (possibly high dimensional vector). My proposed solution is as follows.

First a nonparametric density estimation method, such as Parzen (kernel) method, is used, and then its result is fed as training data to the neural network. More precisely, at each data point the value of pdf is obtained via the nonparametric estimation, and these point/value pairs are used to train the network in a supervised manner (i.e. using backpropagation). 

Is this method okay in theory, or I have missed something?

Regards.

More Amirhossein Ahmadian's questions See All
Similar questions and discussions