Tanh or ReLu, which activation function perform better in firing a neuron?

More Zafar Ali's questions See All

How can we differentiate between calcite, dolomite, siderite, magnesite and ankerite minerals in carbonatite rocks in thin section under op microscop?

How can we differentiate between calcite, dolomite, siderite, magnesite and ankerite minerals in carbonatite rocks in thin section under optical microscope?

07 August 2024 2,132 3 View

Unusual intensity drop in some sections of chromatograms in DDA?

Hi, we have measured tryptic peptides using both DDA and DIA method on QExactive. In DDA replicates i saw unusual intensity drops occurring at the same sections of chromatograms in DDA replicates...

07 August 2024 3,218 4 View

Can you suggest reliable sources defining "3D mesh" and "3D city models"?

Dear fellow researchers, I am currently working on a paper where I need to provide a reliable reference that defines and distinguishes between 3D mesh models and 3D city models. Although I am...

06 August 2024 9,986 2 View

Absorption coefficient of methane?

Hello, Can anyone provide me with the absorption coefficient of methane gas at 7.7 um? Any reference?

06 August 2024 980 5 View

How are Large Models Exploring and Outputting Knowledge Understanding in Specific Content Areas, and What Does Academic Research Say About It?

Hello everyone！ I am currently exploring the performance of large models in understanding knowledge in specific domains, and attempting to construct a knowledge framework similar to what...

05 August 2024 5,729 2 View

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity?

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity? What is the acceptable percentage of error (regardless of the metric)? Could you suggest...

03 August 2024 5,358 0 View

How do i get an account to upload my published papers?

need to open an account to upload my published papers

01 August 2024 9,255 1 View

What is the best sampling strategy?

I am conducting a qualitative study that uses interviews to investigate the perceptions of teachers about a particular leadership practice and I am focusing on 3 schools which have a total number...

01 August 2024 8,457 10 View

What is the problem with these tissue culture plants?

All plants are green but some of these plants becomes yellow. I did not found any reason. Please help me to find out the real problem.

01 August 2024 589 4 View

How to correctly use the UTE and ZTE pulse sequences in Bruker's ParaVision software?

I am using a Bruker 600M solid-state NMR spectrometer with a Micro 2.5 microimaging system. The test sample is a tube of 1M LiCl aqueous solution, and the nucleus detected is 1H. I am trying to...

01 August 2024 9,227 1 View

Is there an English Translation of the Carl Moller text: ZUR VERGLEICHENDEN ANATOMIE DER SILURIDEN?

I recently came across an anatomy text by Carl Moller that was published in 1915 but it is in German or Dutch neither of which I can understand. I would like to know if there is an English...

10 August 2024 4,347 1 View

How to convert a privately loaded document into a public document?

I attempted to make a privately uploaded text public but a window appeared that said an error occurred. There was no explanation provided as to why there was an error or what might be done to...

05 August 2024 8,025 7 View

ANY free software for reconstructing neurons in the microscopic image?

Hi everyone, I am working on brain slices for visualizing a protein in the soma and dendrites, using a fluorescence tag. However, I need a tool (not paid) for reconstruction of the whole neuron,...

04 August 2024 4,725 2 View

How to change the version of the article full-text pdf file?

How to change the displayed full article text to its corrected version? In the file on the page of the journal where I published the article, there was an error in the text, the table is...

30 July 2024 3,229 2 View

Can anyone please provide me the full text article of this clinical Trial?

Roflumilast Cream Improves Signs and Symptoms of Plaque Psor...

29 July 2024 5,250 0 View

Are you looking for research collaboration ?

we have few papers ready for submission, and we need one co-author for each article who can pay article fee. Interested authors may text here or contact me on my following email id [email protected]

29 July 2024 6,626 0 View

How can productivity (using the Google form link below to provide your answers) be achieved in manuscript publication?

Survey on Productivity in Journal Manuscript Publication Survey Form Link: (https://forms.gle/YRVrn8dL4WZJJ79S8 ) Dear Researcher, We kindly invite you to participate in our survey focused on...

29 July 2024 4,116 1 View

In terms of chaos, what is the necessary and sufficient condition for authoritarianism, permanent or temporary, to come to exist and persist?

Since 2016 Brexit, the world needed to change the thinking behind traditional democracy as the democratic landscape changed, yet traditional democratic thinkers and actors have been acting as if...

28 July 2024 6,515 1 View

Can we convert a thousand of FASTA sequence in numeric form in .csv format? If yes kindly send me the script for the same?

I have a .text file for various FASTA sequence , and i want to convert these sequences into a numeric file which will be in .csv format. OR I want to extract physiochemical properties(features)...

25 July 2024 3,650 2 View

Has the Affordable Connectivity Program Helped You/Your Community? And Would Bulk Tablets on a Similar Program Help Your Business?

Do You Believe VP Harris Would Show Empathy to Tech Companies in Her Administration as POTUS? I ask in light of the fact that, the ACP was not re-funded this round. So many and businesses need...

24 July 2024 4,732 3 View

Peter Wlodarczak

In deep learning the ReLU has become the activation function of choice because the math is much simpler from sigmoid activation functions such as tanh or logit, especially if you have many layers. To assign weights using backpropagation, you normally calculate the gradient of the loss function and apply the chain rule for hidden layers, meaning you need the derivative of the activation functions. ReLU is a ramp function where you have a flat part where the derivative is 0, and a skewed part where the derivative is 1. This makes the math really easy. If you use the hyperbolic tangent you might run into the fading gradient problem, meaning if x is smaller than -2 or bigger than 2, the derivative gets really small and your network might not converge, or you might end up having a dead neuron that does not fire anymore.

Zafar Ali

Peter Wlodarczak thank you for your detailed answer, what is your stance on using a combination of these functions in different layers of neural network, do you think it is a good practice ?

Sobhan Sarkar

It should be tested. But ReLU may perform better overall.

Mohamad Taheri

It depends upon the model’s architecture, the hyperparameters and the features that we are attempting to capture. Ideally, we utilize the ReLU function on our base models but we can always try out others if we are not able to reach an optimal result.

Mohamad Taheri you are right, most of researchers recommend ReLU activation function and it works fine in most cases. However, it depends on model's architecture as well as the requirements of our output, as many researchers make use of Softmax in attention-based bidirectional LSTM.

Sisay Chala

It depends on the need and the architecture. Generally ReLU is a better choice in deep learning. I would try both for the case in question before making the choice.

Gereziher Adhane

tanh is like logistic sigmoid but better. The range of the tanh function is from (-1 to 1).

Advantage:

==> Negative inputs will be mapped strongly negative and the zero inputs will be mapped near zero.

Drawback:

==> Activation is Dense i.e. Costly.

==> Mainly used classification between two classes

==> It suffers from Vanishing gradient problem.

ReLu is the most used activation function. The range of ReLu is from (0 to infinity). But, the issue is negative values become zero immediately which decreases the ability to map the negative values appropriately.

==> Activation is sparse and efficient.

==> It should only be used within Hidden layers of a Neural Network Model.

==> The dying ReLu problem, i.e. for negative value, gradient will be zero, that means those neurons which go into that state will stop responding to variations in error.

Okay, which one do we use?

==> If you know the function you are trying to approximate has certain characteristics, you can choose an activation function which will approximate the function faster leading to faster training process.

==> If you don’t know the nature of the function you are trying to approximate, then maybe i would suggest start with ReLu and check with others.

Akshansh Mishra

It totally depends on the neural network architecture. In one of my research work hyperbolic tangent activation function results more accuracy than ReLu activation function.