Is there a universal method (rule) to choose the activation function for a MLP neural network?

More Bogdan Oancea's questions See All

Does anyone knows a data simulator for mobile phones?

I am interested in a software that generates the network events data sent by a mobile phone to the surroundings antennae. At least a good tutorial about what data the mobile phone sends to the...

02 March 2019 5,517 5 View

Are there any methods to deal with modelling the self-selection bias for count statistics?

Suppose that we want to count the individuals in a population, but but we can count only a sub-population ( only those individuals that respond to a survey, or are for example subscribers to a...

06 July 2017 6,658 3 View

Is there an R package for panel VARs?

I have to estimate a panel VAR. I know that there is a STATA package for this type of problem, but I use R.

09 October 2016 633 7 View

How should I choose the optimum number for the neurons in the input/hidden layer for a recurrent neural network?

I've implemented a recurrent neural network for time series prediction. It uses Extended Kalman filter for training and truncated backpropagation through time for computing networks derivatives....

31 December 2012 2,408 16 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Is Galaxy.org good to use for research for analyzing data and for publication?

Hello all, I wanted to know, can I use galaxy (USA, Europe or Australia) platform for analyzing the shotgun data, and can it be used for publication purpose as well? Thanks :)

06 August 2024 6,610 4 View

Do experts have journals in the field of artificial intelligence and big data that are not indexed by SCI or EI?

05 August 2024 8,836 2 View

What are possible strategies can be used to analyze data under sequential explanatory mixed method approach?

Better ways to analyze the qualitative and quantitative data in a sequential explanatory mixed method approaches

04 August 2024 2,703 6 View

How can I interpret the data without the need of solving it manually?

How can I interpret the data gathered without solving?

03 August 2024 9,054 3 View

Why can't academics earn the money they deserve?

Only Journals make money from the articles we have worked on for years. Academics do not earn money from their refereeing. Then shouldn't the solution be a system in which academics can earn...

01 August 2024 6,469 6 View

Conjugation of PEG-Amine to an Amino Acid Using EDC?

I am attempting to conjugate PEG to an amino acid at the C-terminus, for the purposes of producing nanoparticles. I have been told that PEG modified with amine groups can be used for this purpose,...

31 July 2024 2,033 1 View

How Do Project Data Analytics and AI Advance Quality 4.0 in Construction Project Management?

As the construction industry advances, the integration of Project Data Analytics and Artificial Intelligence (AI) is becoming increasingly crucial in project management. These technologies are...

31 July 2024 6,484 1 View

Oyebade K. Oyedotun Popular answer

Dear Bogdan,

Choosing activation functions for MLPs depend on a couple of things as discussed below.

1. The activation function for the output layer would depend on whether you are performing classification or regression. For binary classification (i.e. problems of two classes), the Logistic-Sigmoid function can be used with binomial cross-entropy as the cost function. For multiclass classification (i.e. problems with more than two classes), the softmax function is used with multinomial cross-entropy as the cost function. For regression problems (i.e. real-value outputs), the linear/identity function is used.

2. For hidden layer units, your options would depend on the depth of your model as follows:

(a) For shallow models (1 or 2 hidden layers), the Logistic-Sigmoid, Tangent-Sigmoid or rectified linear function can be used. Here, choosing the most appropriate activation function among the aforementioned would be via experiments.

(b) For deep models (i.e. more than 2 hidden layers), the rectified linear function would become more appropriate to alleviate the problem of vanishing gradients.

I hope this helps.

Oyebade

Oyebade K. Oyedotun

C. Hamonet

no comment

Taleb Alashkar

Here is a very good resource for comparison between activate functions available.

http://cs231n.github.io/neural-networks-1/

Hubert Anysz

Hi Bogdan,

I've used ANN MLP for prediction. I've resign from 2 hidden layers - I've couldn't reach better results that for one hidden layers. I was told that in most cases the linear function in the hidden layer is the best. There were many combination of different parametres that I've checked. Different activation functions too (linear called satlins in matlab, logistic, hyperboid tangent called tansig). Finally the best prediction accuracy was achieved for linear activation function in the hidden layer and tansig in the output layer. So many, many experiments in my case.

Best regards

Hubert

Bogdan Oancea

Hi Hubert,

Thank you very much.

Bogdan