How do I find the optimal configuration of a Artificial Neural Network?

More D.B. Jani's questions See All

Is any book available to solve PDE ?

Mathematical model of desiccant dehumidifier.

07 August 2019 7,254 8 View

How many solar panels do I need to run HVAC?

Simulation of desiccant based air conditioners.

01 February 2019 2,168 4 View

How to model rotary desiccant dehumidifier by use of computational fluid dynamics CFD?

Simulation of desiccant wheel.

01 February 2019 3,340 4 View

Does a stochastic network will evolve differently each time it is run?

Artificial neural network.

31 December 2018 1,142 8 View

What can be the possible reason for thermal equilibrium in stochastic networks?

ANN.

31 December 2018 6,070 0 View

What is the nature of general feedback given in competitive neural networks?

ANN.

31 December 2018 1,439 0 View

How to apply particle swarm optimization and response surface method in experimental results?

Optimization of desiccant wheel.

31 December 2018 5,169 8 View

How are input layer units connected to second layer in competitive learning networks?

ANN.

31 December 2018 2,732 0 View

Can any one share the fundamental books/papers/technical reports related to PCM Phase change materials?

Simulation of air conditioning

31 December 2018 2,941 1 View

What consist of competitive learning neural networks?

ANN.

31 December 2018 6,060 0 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

After COVID-19 it has seen that EFL learners technological affiliation has raised. In addition, in the post-COVID period learners started to engage AI technologies like ChatGPT while learning...

08 August 2024 8,964 4 View

What are examples of AI for good projects a teacher can assign to students?

So I am organizing an AI seminar. What are possible AI projects in the AI for good spirit? something the students can do and have an impact?

08 August 2024 9,437 4 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

How to design human-centered classroom in the age of A.I.?

08 August 2024 347 5 View

Are air moisture harvesting technologies effective in combating desertification?

Air moisture harvesting Air water collection devices

06 August 2024 5,473 2 View

Do experts have journals in the field of artificial intelligence and big data that are not indexed by SCI or EI?

05 August 2024 8,836 2 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

What's the role of IT & AI in Telecommunication Industry?

05 August 2024 8,264 3 View

Mario López Popular answer

According to my experience, I would recommend applying k-fold cross validation method for selecting the optimum topology of the net.

Hadi Tabealhojeh

I think you should try various configurations manually. I don't think a NN with more than one hidden layer is needed (this is proved in many applications) but you still have to try different number of neurons in the hidden layer by:

1- for different n (number of neurons in hidden layer) do:

2- train the Net several time and average over the validation errors

3- finally, pick the n that have best validation rate.

Mario López

D.B. Jani

Thanks for the answer.

Amirhossein Mehrdanesh

This can be solved by several techniques:

1- Trial and error: (The possibility of over-fitting exists)

Determination of the initial value using rule-of-thumb methods:

Rule-of-thumb methods for determining the precise number of neurons:

The number of hidden neurons should be between the size of the input layer and the size of the output layer.

The number of hidden neurons should be 2/3 the size of the input layer, plus the size of the output layer.

The number of hidden neurons should be less than twice the size of the input layer.

Rule-of-thumb methods for determining the precise number of hidden neurons:

Nh=Ns / (alpha∗(Ni+No))

Ni = number of input neurons.

No = number of output neurons.

Ns = number of samples in training data set.

alpha = an arbitrary scaling factor usually 2-10.

2- N- Fold Cross Validation

In order to avoid over-fitting, it is necessary to use cross-validation to explicitly penalize overly complex models, or to test the model's ability to generalize by evaluating its performance on a set of data not used for training, which is assumed to approximate the typical unseen data that a model will encounter.

3- Using a hybrid meta-heuristic algorithm to train feed-forward neural networks

Thank you all for the valuable answers.

Vaibhav Gandhi

Yes, best go for 5-fold or 10-fold Cross-Validation approaches. I think MATLAB has this in-built.

thanks for the reply.

Robert Fraczkiewicz

Our ADMET Modeler does it by semi-exhaustive search (www.simulations-plus.com). It uses ensembles of MLPs, each with single hidden layer. Thus, that part is fixed. What's variable are:

1) The number of hidden neurons (N)

2) The number of inputs (I)

A user specifies the starting and end values of both N and I, as well as steps, dN and dI, on going from start to end. Hence, one gets:

N0, N0+dN, N0+2*dN, ..., N0+m*dN neurons

I0, I0+dI, I0+2*dI, ..., I0+k*dI inputs

These parameters for a (m+1) x (k+1) matrix of ANN architectures. Architectures with too many weights with respect to the size of training data are removed - these are located in the lower right corner of the matrix. All the remaining ANN ensembles are then trained one by one (we can afford it because training is very fast) and the best one is chosen.

thanks sir for your valuable answer.