I am trying to model a system which consists of a second order linear part (for example a set of mass spring damper with variable parameters for spring and damper) and a non-linear part which is a cascade of a derivative, a time delay, a static non-linearity (e.g. half-wave rectifier) and a low-pass system. I want to use neural networks for this purpose. I have tried time delay neural networks which were not successful. I am now trying to use RNNs (Recurrent Neural Networks). Since I am relatively new to this subject, I was wondering which network architecture are suitable for this purpose? (I have to use a limited number of parameters)