Why multi hidden layer NN with few neurons in each layer is a better approximator than a one hidden layer NN with many hidden neurons?

More Ali Namadchian's questions See All

Does Fokker-Planck equation give us the exact measure of the statistical ensemble of the trajectories of the stochastic differential equation?

It is known that the FPE gives the time evolution of the probability density function of the stochastic differential equation. I could not see any reference that relates the PDF obtain by the FPE...

09 October 2019 9,993 6 View

Is it possible to stabilize an Infinite-dimensional system by stabilization of the corresponding approximated finite-dimensional system?

we can approximate an infinite-dimensional system with a finite-dimension system, (for example by proper orthogonal decomposition) If we stabilize the approximate finite dimension of an...

09 October 2019 423 6 View

Is it necessary for engineers to know the definitions of mathematical spaces?

We know that mathematicians study different mathematical spaces such as Hilbert space, Banach space, Sobolev space, etc... but as engineers, is it necessary for us to understand the definition of...

09 October 2019 5,175 4 View

In stochastic systems(SDEs), Most of the time we assumed that the disturbance and noise vanish at the equilibrium point, is it reasonable assumption?

One of the main stability theories for stochastic systems is stochastic Lyapanuv stability theory, it is the same as Lyapanuv theory for deterministic systems. the main idea is that for the...

31 December 2018 4,463 4 View

Under which condition it is possible to change the dynamics of a nonlinear system to an arbitrary dynamic with feedback control input?

Imagine we have an ODE system x_dot=[f1(x,u), f2(x,u), f3(x,u),....fn(x,u)] where f1,..,fn are nonlinear functions of control input u and states x, x is member of R^n and u is member of...

31 December 2018 7,615 6 View

Is there a general framework to compute the reachability set of nonlinear affine-control systems?

for linear control systems x_dot=Ax+Bu the reachability set can be calculated using the Image of the controllability matrix, i.e R=([B AB A^2B,....,]) and reachability set=Im(R) when rank(R)=n,...

31 December 2018 2,037 3 View

Is it possible to prove the stability of a continuous time system from its corresponding discrete version?

In a simplest case imagine we have a continuous finite-dimensional dynamic system described by and ODE x_dot=f(x) (1) , Is it possible to prove the asymptotic stability of (1) by...

11 December 2018 5,461 30 View

Which one is a stronger stability condition? Lyapunov stochastic stability or detailed balance?

in most cases for continuous time stochastic systems which are modeled by SDE, the Lyapunov stability conditions can guarantee the stochastic stability of the system, another definition In...

11 December 2018 4,084 4 View

Is it possible to establish the stability results based on pseudo-spectra for a linear differential operator?

In the theory of the stability of the differential operators, one could prove the stability results based on spectra of an operator, (all eigenvalues must be negative for example). one problem...

10 November 2018 3,034 7 View

Which neural network structure is better for approximation of dynamical systems?

there are different kinds of neural networks. MLP, RBF, LSTM, recurrent, ... I have to approximate a dynamical system with neural network, which type of NN is more suitable for this task?

08 September 2018 2,340 5 View

ANY free software for reconstructing neurons in the microscopic image?

Hi everyone, I am working on brain slices for visualizing a protein in the soma and dendrites, using a fluorescence tag. However, I need a tool (not paid) for reconstruction of the whole neuron,...

04 August 2024 4,725 2 View

My frequency output shows no errors, however, there's no frequency as a result on gausview. Does anyone know what I'm doing wrong?

I'm trying to perform a frequency calculation using Gaussian via MOBAXterm. The output shows no errors, however, there's no frequency as a result on gausview. The option "vibrations" is not...

31 July 2024 631 4 View

How can we calculate the percentage of configuration interaction (CI) in the UV output data of the Gaussian program?

How can we calculate the percentage of configuration interaction (CI) in the UV output data of the Gaussian program? for example: Excited State 17: Singlet-A 5.1359 eV 241.41 nm...

28 July 2024 9,165 2 View

How to extract RNA from neurons?

Hi, I am a master's student who studies inflammation. I'm planning to extract RNA from the Vagal nerve or Dorsal Ganglion roots, but I got a question. It would be only a few 50ug per each nerve,...

23 July 2024 6,783 1 View

Is is possible to calculate the activation energy of Redox Reactions Using Gaussian?

Hi everyone, I'm working on calculating the activation energies for some redox reactions using Gaussian, Here are the reactions I'm interesting: Py•−+ 3O2 → Py + 3O2•− Py•− + 1O2 → Py + 1O2•− Is...

18 July 2024 4,418 3 View

Cresyl violet Nissl Staining ?

Kindly confirm how to distinguish glia cells and neurons using cresyl violet staining. Also, using cresyl violet nissl staining how to identify the apoptotic neurons ? Does this stain stains...

18 July 2024 2,420 0 View

Please, from the computational calculations of an organic compounds using Gaussian program?

Using DFT/B3LYP/6-311++G

17 July 2024 7,720 1 View

What is wrong with my input file?

im studing gaussian 16 with reading paper about I-131 Metaiodobenzylguanidine in the paper "In a similar vein, nuclear magnetic resonance shielding values were investigated using the widely...

16 July 2024 6,040 4 View

The Uniqueness of Human Language?

Unlike humans, it is believed that birds do not have a symbolic language system that can be reduced to words (Berwick et al. 2012). But they do have a nervous system that can generate sequences...

14 July 2024 9,852 0 View

What does absolute and relative ir and raman activity means .and what the measurement formula?

I have ftir from gaussian output file.where ir aand raman are given. However in many research articles value of ir and raman are classified into two portion. Please anyone knows suggest me to how...

11 July 2024 2,498 1 View

Khulood Obaid

following

Ricardo Almeida

Hi,

When you increase the number of neurons, you increase the complexity of the search space. The NN may then over-estimate the complexity of the function it is trying to learn, causing overfiting and consequently bad generalization. You could check that ploting training/validation error curves. If you see a through the plot that the NN fails to converge, it is due to the increased number of local minimuns, which could be tackled using reinitialization and adaptive learning rate mechanisms.

Usually simpler is better, should you try a NN with one hidden layer and few neurons and you may get faster and accurate results as well.

Regards.

Ali Namadchian

Dear Ricardo

In fact in my training I used regularization (Bayesian regularization) method for both one layer and multi layer networks.regularization methods avoid overfitting. So I do not think that overfitting is the problem.

Hi Ali,

Have you tried to plot the estimated function shape generated by both NN topologies to see how they look?

I'm not sure that Bayesian Regularisation guarantee no overfitting, although it may help. The best way to check it is plotting training/validation curves, then you can easily see if it has either overfitted or not converged. Since your function is not too complex and you are using Bayesian Reg, it's likely that the NN with 30 neurons has not converged. This could be solved by adding more examples for training.

Another point is that the accuracy may not have to do with the number of layers but the number of neurons. As you add neurons, which meas more parameters, the learning process becomes more difficult. I believe you can achieve results as good as your 2 layer NN, if not better, with a single layer NN with few neurons, let's say 3~8.

If you had a more complex problem, then you probably would do better with a more complex NN.

If you check the training/validation plot, please share the results with us!

Best wishes.

Dear. Ricardo

I think Regularization will guarantee no over-fitting, by the way, I have attached the estimated function shape with two structure,

1- one hidden layer with 30 hidden neurons

2- two hidden layer with each 5 hidden neurons

as it can be seen the two hidden layer gives a far more better approximation of the function.

also I have attached the normalized training data-set.

I simply used the Matlab neural network toolbox, and train the network with trainbr (Bayesian Regularization) function.