I am interested in the application of derivative-free methods, in particular pattern search algorithms, to the identification of nonlinear systems. I have found many application papers that use this kind of methods to identify the parameters of the plant, but I have not found too much information about the advantages of using those algorithms with respect to gradient-based methods. Obviously the results will depend on your model structure, but is there any paper comparing both alternatives in general?