In a multilayer perceptron: what happens when one uses the sigmoid function in the output layer?

More Lila Khatiwada's questions See All

Why Do TDS and EC Increase with Larger Wastewater Volumes, While BOD and COD Decrease?

I have carried out MFC experiments on three different volumes, 50, 500 and 1000 mL of wastewater. Results after MFC treatment shows that TDS and EC are more in larger volumes of water i.e. TDS and...

09 August 2024 9,621 0 View

How to enrich pig excreta for increasing nutrient quality organically ?

Pig slurry is rich in major and minor nutrients. Is there any way to improve / Enrich its manure quality to be used in agriculture organically ? please share your knowledge.

09 August 2024 5,605 2 View

Is it possible to plot the atom-projected band structure using GPAW?

Hi, I'm currently working on a project where I need to plot the atom-projected band structure using GPAW. I've been able to calculate the band structure for my material, but I'm having trouble...

07 August 2024 269 3 View

Unusual intensity drop in some sections of chromatograms in DDA?

Hi, we have measured tryptic peptides using both DDA and DIA method on QExactive. In DDA replicates i saw unusual intensity drops occurring at the same sections of chromatograms in DDA replicates...

07 August 2024 3,218 4 View

Leaf area of tomato ?

Hi How can this equation Ln(LA) = 1.038 + 0.89 ln(X) be applied to calculate the leaf area of a tomato? Can you explain with an example and what is the substitution of Ln and ln?

06 August 2024 2,508 2 View

Why did the authors extrapolate a phenotype that they experimentally proved in one bacterial strain across the whole genus of the organism?

I aim to be as skeptical as possible regarding whether a pair of orthologous genes results in the same phenotype in their different but related bacterial organisms under similar environmental...

05 August 2024 6,787 4 View

How to preform densitometry on SDS-page bands?

I ran a SDS-page of a bacterial lysate and I want to quantify protein concentration in a specific band. I was thinking of using a standards ladder or make some standards are different...

05 August 2024 9,805 3 View

XRD Analysis is showing only Calcium carbonate. It is not showing other compounds. Can anyone help me get the other compounds?

XRD Analysis is showing only Calcium carbonate. It is not showing other compounds. Can anyone help me get the other compounds

04 August 2024 3,019 3 View

Which solvent is better to dissolve with secondary metabolites extracted from fungi?

I work on MCF7 cell cell for anticaner purpose and I wa to do drug preperation the drug ( secondary metabolites extracted from Aspergillus) My question which solvent is better with these secodary...

03 August 2024 4,725 2 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

After COVID-19 it has seen that EFL learners technological affiliation has raised. In addition, in the post-COVID period learners started to engage AI technologies like ChatGPT while learning...

08 August 2024 8,964 4 View

What are examples of AI for good projects a teacher can assign to students?

So I am organizing an AI seminar. What are possible AI projects in the AI for good spirit? something the students can do and have an impact?

08 August 2024 9,437 4 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

How to design human-centered classroom in the age of A.I.?

08 August 2024 347 5 View

Hello all, Looking for international reviewer to review Ph.D thesis in wireless sensor network.Can anybody help?

My name is Apurva Saoji. I am a Ph.D scholar in Computer engineering in India. I am looking for international expert in reviewing my PhD thesis, "Competitive Optimization Techniques to Minimize...

07 August 2024 4,600 2 View

Do experts have journals in the field of artificial intelligence and big data that are not indexed by SCI or EI?

05 August 2024 8,836 2 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

What's the role of IT & AI in Telecommunication Industry?

05 August 2024 8,264 3 View

Orlando Anunciação

If you use sigmoid function in output layer, you can train and use your multilayer perceptron to perform regression instead of just classification. The output layer will output continuous values instead of binary ones. In the context of classification this can also be useful if you want to have a measure of the confidence of your classification.

Hosein Hashemi

smoothing and somehow having soft labels at output (posterior probablities)

Patricio Perez

Depending on the problem, the sigmoid may be more conveniente than linear. You must have in mind that the sigmoid gives a bounded result.

Salvador Tortajada

A sigmoid activation function is lower and upper bounded. For instance, a logistic sigmoid function is ranged between (0,1) and a hyperbolic tangent is ranged between (-1,1). We often use sigmoid activation function in the output layer when we are dealing with a classification problem instead of a regression problem, that is, when the output target is categorical.

It is common practice to use one output unit for each class and if we use a logistic sigmoid activation function for the output layer, then the result is often interpreted as the posterior probability of the class given the input p(c|x). Thus, you can see the multilayer perceptron as a discriminative model like the logistic regression models. My experience is that this approach is powerful for classification purposes but it sometimes results in an ill-calibrated model. To avoid this problem you can also check the softmax activation function.

Horst Langer

I'm a bit confused.....the output contains the estimated values which are then compared to the targets. During training weights of the MLP are adjusted in order to minimize the difference between output and target. If the output is transformed by some function so should be the targets..at the end we carry out some nonlinear weighting of the goodness or costfunction. And this seems to me a matter of convenience, depends on the problem.

Stam Nicolis

The statement isn't correct: in multilayer perceptrons the units are sigmoid functions, e.g. for the two-layer perceptron that can represent the XOR function. So it's rather the other way around; while the sigmoid functions realize a classification task, one should rather ask what a linear output function actually is useful for, since such cases might seem much more specialized. And, of course, through a classification scheme it is possible to generate probability distributions on target spaces-though the efficiency is another issue.

As far as I remember the sigmoid - for solving the XOR problem - is applied in the hidden layer whereas the output -should be treated in the same way as the target. Something I got wrong ?

Well, for solving the XOR problem, you need a hidden layer of two sigmoid units and their result is fed into another sigmoid unit, the output unit, which gives the answer. So all units are sigmoid. You could, of course, use any activation function but, for the XOR, at least, you do need to state the mapping, which amounts to giving a threshold, i.e. a sigmoid function.

Edmo Cavalcante

Both logistic sigmoid function and hyperbolic tangent function represent a balance between the linear and the non-linear behavior. However, logistic sigmoid function has only positive values and that is a disadvantage for the network. That will require a changes in the thresholds for the activation functions to mitigate it.