Selecting the optimum values for the number of batches, number of epochs, number of hidden layers, and number of steps?

More Mohammad Alipour's questions See All

Could you recommend some articles on Urban Transportation System optimization and Innovation?

13 August 2024 2,595 3 View

Is there any cases algae not using the nutrient from the wastewater and grow normally?

I am working on microalgae cultivation using waste water. The initial concentration of nutrients were less but the microalgae has achieved biomass growth of 2 g/L. The final concentration of...

08 August 2024 4,812 2 View

1. If I can quantize the atom using this hyperbolic spiral and classical physics, could nature do the same?

If we map as a continuous motion an ionising electron (beginning its journey at n=1) in an H atom, a specific hyperbolic spiral appears (see animation). When we solve this spiral formula, we find...

07 August 2024 5,343 2 View

Articles on" Gender disparities i leatherwork education"?

Articles on" Gender disparities i leatherwork education"

07 August 2024 2,500 0 View

Why results of ROS flurescence are negative as there was no bacteria within?

Hello. I am working on ROS production of two systems: system A is cerium oxide and hydrogen peroxide, system B is cerium oxide nanoparticle, hydrogen peroxide and potassium bromide. I did some...

04 August 2024 5,974 3 View

What should I do with parameters that are not relate to my simulation in MyLake model?

I want to Estimate surface heat fluxes using MyLake, but I don't have all the initial values in model parameters section and other sections,is there a way?

04 August 2024 1,537 1 View

Why reactivity isn't increased with more empty spots in valence shell?

If from a geometric perspective the non-halogens, non-noble gases have more empty spots in their valence shell, and the filling/exiting of any of the empty spots in the shell constitutes a...

03 August 2024 4,787 2 View

Why is the molecule's orientation with an electric field affect polarizability?

Why is the molecule's orientation with an electric field affect polarizability? Electrons are diffuse enough to be independent with respect to orientation and effect of electric field on...

03 August 2024 7,843 1 View

Why don't d-orbitals split themselves, why does it take a ligand? why don't protons from ligand repel nucleus split d-orbitals?

why don't d-orbitals split themselves because of themselves without the presence of ligands? Electrons are indistinguishable. Why wouldn't it be more correct that protons from a ligand split the...

03 August 2024 3,589 3 View

Why doesn't chromium 2+ ion use all its d-orbitals to receive lone pairs from six waters in [Cr(H20)6)]3+?

I'm guessing it's because the ligand experiences too much electron repulsion or proton repulsion from the chromium to insert them close to the 3d-orbitals which are close to the metal nucleus. Is...

03 August 2024 1,370 1 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

After COVID-19 it has seen that EFL learners technological affiliation has raised. In addition, in the post-COVID period learners started to engage AI technologies like ChatGPT while learning...

08 August 2024 8,964 4 View

What are examples of AI for good projects a teacher can assign to students?

So I am organizing an AI seminar. What are possible AI projects in the AI for good spirit? something the students can do and have an impact?

08 August 2024 9,437 4 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

How to design human-centered classroom in the age of A.I.?

08 August 2024 347 5 View

Do experts have journals in the field of artificial intelligence and big data that are not indexed by SCI or EI?

05 August 2024 8,836 2 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

What's the role of IT & AI in Telecommunication Industry?

05 August 2024 8,264 3 View

Can usage of AI tools like chat GPT in research work is recommendable ?

AI tools like ChatGPT can enhance research work significantly when used responsibly and in conjunction with thorough human oversight.

05 August 2024 1,842 3 View

Saloni Bhatia

This purely depends upon the type of model you want to build and by looking upon the various graphs you can decide upon these hyperparameters. It is well explained in this one single link.

https://towardsdatascience.com/simple-guide-to-hyperparameter-tuning-in-neural-networks-3fe03dad8594.

Hope you find it helpful

Davood Moghadas

There is no general rule for it. To select the optimum number of hyperparameters and network architecture, several different networks should be initially trained on a small portion of the data. Then compare the accuracy of all networks. The network with the maximum accuracy has the best architecture. Then, you should apply the selected architecture on the whole data set. The network can be further tuned by dropout regularization. Regarding the number of epochs, the best way is to assign a large number of epochs (e.g 1000) and then use early stop regularization. This technique prevents over-fitting by stopping the training procedure, once the model performance on the validation subset does not improve for a certain number of epochs.

Burhan Rashid Hussein

I would advise you to read deep learning books by Adrian from https://www.pyimagesearch.com/

These research articles could also be useful although they may be addressing your question indirectly

1. Article How transferable are features in deep neural networks?

2. Preprint Fixing the train-test resolution discrepancy

Sergio Cofre-Martel

Since Stochastic Gradient descent is the default optimization technique, ideally you would like to make your batch size as large as you can. This, given that the only reason we separate in batches is for computational memory.

Grid search used to be the go to technique when selecting hyperparameters. However, it has been proven that Random search in your domain space is highly more efficient. http://www.jmlr.org/papers/volume13/bergstra12a/bergstra12a.pdf

You can also study the effect of each of the hyperparameters through a Design of Experiments. Article Design of Experiments and Response Surface Methodology to Tu...

In my experience, when you have a small number of features, usually many layers with few units work best. And when you have many features, use few layers with many units. Obviously this depends on the problem and the data you have, but it my give you a head start.

Mohammad Alipour

Thank you all for responding and sharing useful links. I also did optimizations on each of any single aforementioned parameters by adding a simple loop and testing different numbers for each parameter. For example, in the attached figure, I calculated the error rate considering the different number of epochs, and I finally selected the value which resulted in the minimum error rate.

Vladimir Puzyrev

I would also suggest to have a look at automatic hyperparameter optimization libraries, e.g.:

https://towardsdatascience.com/optuna-vs-hyperopt-which-hyperparameter-optimization-library-should-you-choose-ed8564618151