What makes an ANN successful in generalizing its decision to non previously seen inputs? From a mathematical point of view, there are unlimited numbers of multi-dimensional functions that can fit a given input-output sets. Are all these functions good "generalizators"?