Are there any methods except sign function to give the Neural Networks output for binary classification?

More Arash Moradi's questions See All

What is good book for mathematics of intelligence?

i look for a good math book for advanced machine learning

31 December 2017 7,276 2 View

Is this time series random or nonrandom?

In binary classification of time series what is the accepted margin for measuring accuracy of classification or prediction that permit us to conclude that time series is not accidental, or vice...

09 October 2014 9,627 6 View

Is "uncertainty principle" fundamental limits are underling and can mesure behind "standard quantum limit."?

Recently the article "Tricking the Uncertainty Principle" speaks about the team in Caltech they try to do that. The original paper was "Mechanically Detecting and Avoiding the Quantum Fluctuations...

05 June 2014 9,216 4 View

If we have an input and select subset of that input, then train with that to increase prediction accuracy, what can we call it?

Is is called feature selection or input selection? Or anything else?

10 November 2013 3,419 7 View

What is C(H), that be the convex hull of the set of base classifiers in boosting algorithm?

I want to know the characteristics of sets of convex hull classifiers and what is different with H alone?

09 October 2013 4,245 4 View

Are there any other methods than correlation for finding the dependency of two variables?

If I want to find the dependence and relationship between two random variables, which methods can I use?

09 October 2013 5,821 14 View

What is the relation between data dimension and time for svm convergence?

For linear and quadratic ones.

09 October 2013 8,702 4 View

What is neural network output?

We have discrete vector of time series and output that is between [a,b]. Is it probability of occurrence? How we can investigate output? How find what happened there?

09 October 2013 3,160 8 View

We used two different methods for classification - how we can calculate the difference between these two methods?

For example, how we can show with Kullback–Leibler divergence which the different distance between two prediction and so different probability distribution underline each classification, and how...

09 October 2013 1,647 4 View

How do information and entropy propagate in hybrid (ensemble) machine learning?

Generally, with the help of hybrid system, does the signal to noise ratio increase or decrease?

09 October 2013 2,153 1 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Which Scopus Journal provides the most affordable fees?

"PUBLISHING IN A SCOPUS JOURNAL" Researchers are now at a cross road. The critical need to publish in a Scopus or ISI, etc journal is ever vital. Journal Publication fees must be submitted....

10 August 2024 8,621 1 View

Seeking Advice on Viability and Execution of Undergraduate Thesis Topic?

Hello everyone, I am currently developing a thesis proposal and would appreciate your input on its viability and how to effectively carry it out. My proposed topic is: "Does the perceived threat...

10 August 2024 8,992 0 View

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

After COVID-19 it has seen that EFL learners technological affiliation has raised. In addition, in the post-COVID period learners started to engage AI technologies like ChatGPT while learning...

08 August 2024 8,964 4 View

Who will be moral responsible for the death of thousands of people in the event of an earthquake?

Who will bear moral responsibility for the deaths of thousands of people in the event of an earthquake? Weeks and months remain before the onset of strong earthquakes that bring death to...

08 August 2024 6,134 12 View

What are examples of AI for good projects a teacher can assign to students?

So I am organizing an AI seminar. What are possible AI projects in the AI for good spirit? something the students can do and have an impact?

08 August 2024 9,437 4 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

How to design human-centered classroom in the age of A.I.?

08 August 2024 347 5 View

Are there any instruments for studying time similar to the way it is in space?

There are a huge number of methods for studying objects in space, according to the senses (and not only). Mechanical, thermal, optical, acoustic, electrical, magnetic, based on particle beams,...

06 August 2024 7,102 0 View

James Walter Taylor

I'd suggest that any network using sigmoid-like limiting functions at the nodes can be mapped in an almost infinite number of ways to binary, using a threshold (sign) or a pair of thresholds (binary classification with hysteresis, for time varying data; or a trinary output, with an "unknown" or confusion state). With many of the NN I worked with, I included "not-possible patterns" trained to a midpoint level. This sometimes helped produce a better generalization in the final result.

Glen Dario Rodriguez

May be the trouble is the topology of the NN. How many layers? how many neurons per layer?

Training data is "very close to random"? I doubt that you can make up in the discriminant function a problem that lies in the measures. If there is no information, than there is little to learn.

Michael Philip Craven

It may be the data as Monther and others have said. I'd agree with James on trying a sigmoid for binary classification problems in which case you can use the backpropagation algorithm or other gradient descent variant for training. You can use tanh() for +1,-1. If you try this, you'll find it's important to match the learning rate (which affects the update step size) to the number of neurons plus the ability to generalise will depend on topology e.g. number of neurons in hidden layer(s), as Glen also mentioned (for example you can have too many hidden neurons versus the number of classes you are expecting). There are heuristics for optimising many of these choices. I discussed a few of these in one (although rather old) paper http://eprints.nottingham.ac.uk/1901/ which is also on RG. This focusses on convergence but is also relevant to generalisability.