It's called ``active learning'', cf. http://active-learning.net/ This doesn't, of course, mean that you won't get stuck in local ``extrema'' of the information transfer, since the approach consists in maximizing the local information transfer.
Active learning isn't limited to sequence ordering-that's the point. It's a procedure that is defined by the selection of examples so as to maximize the information transfer-which is one measure of ``high performance'' and, in fact, can be generalized to the maximization of any other objective function, the idea being that one doesn't present all the examples at once.
The order of data is defined by the dimensions and classes in the data set. The dimensions are the attributes affecting the response variable and the classes are different categories of the response variable. Its necessary to have a idea as to what is the order to data set as it helps you in identifying the number of hidden units in the neural network. Vague number of hidden units will either lead to under fitting or over fitting. The number of hidden layer units should be less than input units and larger than output units.