Hyper parameters and their importance in deep learning?

More Chethankumar BM's questions See All

What are the frame works available for deep learning. and which is the best one to use?

Deep learning Experts Machine learning Experts

11 December 2017 7,223 8 View

Can any one brief the the working nature of the inception layers in deep networks and its variants?

Deep Learning Experts Machine learning Experts Pattern Recognition experts

10 November 2017 2,514 3 View

Can any one give the brief description about Disadvantages of CNN?

Deep learning experts Machine learning Experts

10 November 2017 8,334 2 View

Can any one brief me the approaches which yields better performance by trimming or pruning the deep networks ?

Deep Learning Experts Machine learning experts Pattern Recognition Experts

09 October 2017 1,922 6 View

What is the importance of feature scaling in machine learning?

Machine learning Experts

08 September 2017 2,517 1 View

Is KD Tree is best suited for real time indexing ?

Machine Learning Experts

06 July 2017 7,351 0 View

What is the impotrnace of learning rate in any network?

Machine Learning Experts

10 November 2016 5,707 0 View

CAN any one brief the imortance of SOFTMAX function in networks?

Machine Learning Experts

10 November 2016 7,375 0 View

Can any one brief the diifuclty in designng layers in deep learning?

Machine Learning experts

09 October 2016 6,806 1 View

Hi can any one provide the objects of measuring variations ?

Statistics Experts

07 August 2016 4,864 0 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Swimming/space travel depends on the proprioceptive muscle spindles?

When the entire neocortex is ablated in rodents, although they are still able to swim, all the limbs move continuously and asynchronously (Vanderwolf 2006; Vanderwolf et al. 1978). Normal animals...

03 August 2024 835 3 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

Some new emerging problems on application of RL for scheduling in IoT networks?

I have seen plenty of existing works on applied Reinforcement Learning (RL) policies for optimized scheduling in IoT networks including Q-learning, DQNs, and the newer ones including PPO for...

01 August 2024 8,754 2 View

How to Compress Information Neurally?

Samuel Morse, the inventor of the Morse Code, understood that certain letters in the English language occurred more frequently than others (Gallistel and King 2010). To deal with this, Morse used...

01 August 2024 4,456 2 View

Prayag Tiwari

http://colinraffel.com/wiki/neural_network_hyperparameters

Rainer Spiegel

Dear Chethankumar,

reading your question, I thought the following website might be appreciated by you. It is a summary on hyperparameters in deep learning.

https://deeplearning.web.unc.edu/files/2016/02/Hyperparameter-Optimization-for-Deep-Learning.pdf

Yours sincerely,

Rainer

Meareg A. Hailemariam

Hi, you might also find these two links useful:

https://www.coursera.org/learn/neural-networks-deep-learning/lecture/TBvb5/parameters-vs-hyperparameters

https://towardsdatascience.com/what-are-hyperparameters-and-how-to-tune-the-hyperparameters-in-a-deep-neural-network-d0604917584a

Chethankumar BM

Dear Prayag , Rainer and Meareg

Thank you for your suggestions ..I will go through the contents

Mazin Abed Mohammed

Dear Sir,

I would say the hyper-parameters are most important things in the model. For example, number of hidden layers and number of unites in hidden layer, and regularization parameters. These parameters will decide the model is over fitting or under fitting on a specific data set.

An extreme example would be: if we set hidden layer to be 1 and 1 hidden unit, with regularization parameter 10 million. The model will provide nothing.

Marcos L. Mucheroni

Dear Sir;

What I believe is that the central challenge in training deep architectures is to deal with the strong dependencies that exist during training between parameters between layers (or skills levels). To address this difficulty of the problem we must simultaneously have two questions:

- adapt to low layers to provide adequate information for the final adjustment (end of training) of the above layers,

- adapt the layers above to make good use of the final adjustment (end of training) of the lower layers.

But this will not be easy, the parameters still have to be chosen, yes it is only one way.

Amirreza Mahbod

Hi,

This link also may help:

https://blog.floydhub.com/guide-to-hyperparameters-search-for-deep-learning-models/

Sander Ali Khowaja

The question is unclear to me. You want to know the importance of hyperparameters or role of each hyperparameter. More or less all of the parameters are important but they also depend on the loss function or objective function you use. For instance, in image denoising, choosing the right receptive field is very important but while dealing with action recognition you can just start with any and trivially change it with respect to the layers you design. Optimizer such as Adam works better in image denoising and single image super resolution but in most action recognition cases SGD or Ada delta optimizer dominate. There are alot of other parameters with respect to the network you design and the optimization for each of them is necessary in order to achieve the optimal results. I would recommend you to read Deep Learning book by Ian GoodFellow. The parameters are explained in detail let it me the use of dropout ratio or early stopping it covers all and in easily readable manner

Muhammad Aamir

Hyperpameters are an essential part of any deep network that helps you to optimize the quality of the network. Change in parameters, helps you to get the desired results or soulution to your problem.

Jasem Almotiri

It is Optimization for quality of network. Kind of deep architecture That can have strong dependency between layers

Robertas Damasevicius

See https://www.researchgate.net/publication/324876042_Multi-threaded_learning_control_mechanism_for_neural_networks

Abdulrahman Alruban

As typical deep learning libraries and frameworks instantiates deep learning algorithms with default parameters. Therefore, it is always recommend to do hyper-parameters search to find the optimum set of combinations. This is important as fitting an algorithm on different datasets requires different parameters values in order to fully converge and find the absolute (global) minimum.