Is there a link between threshold and classifier training error?

More Safi Ullah's questions See All

Is there an English Translation of the Carl Moller text: ZUR VERGLEICHENDEN ANATOMIE DER SILURIDEN?

I recently came across an anatomy text by Carl Moller that was published in 1915 but it is in German or Dutch neither of which I can understand. I would like to know if there is an English...

10 August 2024 4,347 1 View

How to send my account link and how to make my account public?

Nothing

01 August 2024 755 1 View

Transfection in HEK293T cells?

Dear All, I am trying to transfect a pCDNA3.1 vector containing my gene of interest. The purpose is to figure out the localization of the protein of interest. I have fused the protein with GFP on...

31 July 2024 9,892 4 View

Are you looking for research collaboration ?

we have few papers ready for submission, and we need one co-author for each article who can pay article fee. Interested authors may text here or contact me on my following email id [email protected]

29 July 2024 6,626 0 View

What publications should I target as a psychology masters student in the UK?

I am writing a paper as a part of my course. I am new in London and was wondering that what publications should look upto?

21 July 2024 3,538 1 View

I have two problems: 1) the enzyme is not immobilizing efficiently into the MOF material.. 2) the MOF itself has peak on 400nm by using p-NPA test.?

I am working on carbonic anhydrase immobilization into MOFs. I am facing problems with low enzyme loading.. The other issue is that when using p-NPA activity test to detect the activity of the...

20 July 2024 1,440 3 View

Why my gel electrophoresis have shadow bands? Please see the attached picture for the gel electrophoresis ?

Sometimes I see the shadow like bands and its not true band. I want to know that what's the reason for it. I am using 2% gel for running genotyping samples I have uploaded the gel picture in both...

19 July 2024 148 6 View

What analysis to use for an dependent variable with repeated measures and a independent variable only measured once?

Hi all, I am trying to use mixed effect model to analyze my data, which including a baseline measurement for my exposure (A), and repeated measurements for the outcome (B). I do have some...

17 July 2024 8,682 3 View

I need to know the required time for VDF heating with water for 50-80°C for PVDF Polymerization in ?

i need to know the required time

15 July 2024 4,796 2 View

How Can I Resolve a Persistent Unwanted NMR Peak at 1.25 ppm?

Hello everyone, I am facing a consistent issue in my NMR spectra with an unwanted peak appearing at 1.25 ppm. This peak seems to vary with the amount of sample: it becomes more pronounced with...

15 July 2024 9,065 4 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Which Scopus Journal provides the most affordable fees?

"PUBLISHING IN A SCOPUS JOURNAL" Researchers are now at a cross road. The critical need to publish in a Scopus or ISI, etc journal is ever vital. Journal Publication fees must be submitted....

10 August 2024 8,621 1 View

Seeking Advice on Viability and Execution of Undergraduate Thesis Topic?

Hello everyone, I am currently developing a thesis proposal and would appreciate your input on its viability and how to effectively carry it out. My proposed topic is: "Does the perceived threat...

10 August 2024 8,992 0 View

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

After COVID-19 it has seen that EFL learners technological affiliation has raised. In addition, in the post-COVID period learners started to engage AI technologies like ChatGPT while learning...

08 August 2024 8,964 4 View

Who will be moral responsible for the death of thousands of people in the event of an earthquake?

Who will bear moral responsibility for the deaths of thousands of people in the event of an earthquake? Weeks and months remain before the onset of strong earthquakes that bring death to...

08 August 2024 6,134 12 View

What are examples of AI for good projects a teacher can assign to students?

So I am organizing an AI seminar. What are possible AI projects in the AI for good spirit? something the students can do and have an impact?

08 August 2024 9,437 4 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

How to design human-centered classroom in the age of A.I.?

08 August 2024 347 5 View

Are there any instruments for studying time similar to the way it is in space?

There are a huge number of methods for studying objects in space, according to the senses (and not only). Mechanical, thermal, optical, acoustic, electrical, magnetic, based on particle beams,...

06 August 2024 7,102 0 View

Giovanni Tessitore Popular answer

Take a look at the ROC curve (Receiver Operating Curve). This method enables ones to set the optimal threshold on the basis of the desidered FP rate and TP rate.

Marco Trincavelli

hi Safi, i think that to answer this question is better to have some more details. binarizing the output of your network you mean? which kind of activation function you use for the output neuron?

Safi Ullah

thank you Trincavelli, Yeah I want to binarize the out put. and the activation function is I am using tanh

Fidel Alfaro-Almagro

Why don't you just simply try different threholds and calculate a correlation between them and the MSE?

yeah, that is for usre an option. Howver if you would find an optimal threshold value which is very different from 0 I would investigate whether the training was succesful or something went wrong...

Cheers

Giovanni Tessitore

Nazanin Kermani

Threshold has direct impact on the error since based on the threshold the samples are assigned to one of the classes. In binary classification (assume two classes C1 and C2) if the output node takes values in the [-1,1] interval, which in your case, the hyperbolic tangent values fall in the [-1,1] interval. Then threshold/cutoff of B (e.g. 0.4) means if the output value of a predicted sample is bigger than B than it is labeled as class C1 otherwise class C2. So you can tune the threshold to get a reasonable performance both in terms of speciﬁcity and sensitivity. Mostly what you tune is this but in a more conceptual level you can consider the threshold not only for the output nodes but also for all nodes/neurons in the network.

Threshold has a great impact on the topology of your neural network, based on the threshold neurons get on or off which means that if the value of activation function of a neuron exceeds the threshold, it outputs a quantity otherwise it will be off. Therefor the number of active neurons and their connections can be manipulated by changing the threshold(theta).

Regarding the class imbalance issue if the samples in the smaller class are more important or more costly to generate or just sparse using the threshold you can put them in favor. For example when you put the threshold to '0' the probability of a sample classified as class A and B is equal but as you from the beginning the samples of class B are less abundant you can fix the threshold smaller/lager for class B. As I said you can run the classification with different threshold and see which threshold generate the most accurate classification results. But aware of over tunning and use cross validation methods before fixing your final parameters.

Sundaram Subramaniam

The error in the classification depends on the separation between the probability distribution functions in the n- dimensional feature space of the samples. The classification error occurs when there is overlap distribution functions between the two classes.

Aureli Soria-Frisch

ROC is the standard way of analyzing this. The binarization threshold (aka decision threshold) defines two types of performance measures: true positive rate (percentage of the samples when the actual decision should be 1, and the binarization threshold delivers indeed a 1 ), and false positive rate (percentage of the samples where actual decision 0, and binarization threshold delivers 1). If you iterate for different values of the binarization threshold you obtain different TPRs and FPRs, which you can display in a graph TPR vs FPR. This is the ROC curve.

See work by Fawcett - ROC Graphs: Notes and Practical Considerations for Researchers, or it paper in Pattern Recognition Letters, "An introduction to ROC analysis". Very nice works!!

Samaneh Abdoli

of course there is different answer for a Network with different threshold values.So the error will change. You must find a threshold value with the least error

Eric Koh

i assume it is multidimensional inputs. yes, choice of threshold has direct impact on the results / error. what are other consideration factors for the end results? i had used ROC to evaluate and optimise my model so it can work well for accuracy performance and low false positive alarm rate. it is tendious but it is systematic approach to adjust the threshold. Hope it helps.

Hadi sadoghi yazdi

training of net with imbalance data of two classes must be done with robust training against imbalance. You continue solving this problem with training net with imbalance data set

Tanvi Banerjee

I just wanted to mention that you need to be careful during training since your dataset seems biased towards class A. Usually training algorithms assume both classes are equally represented, you may want to fix a prior probability of a data vector being in Class A or B..

Evaldas Vaiciukynas

Nafiseh, in my humble opinion, cost of log likelihood ratio (Cllr) is much more powerful and consistent summary of confusion matrices (obtained from all possible thresholds AND class imbalances), than that Index of Balanced Accuracy (IBA), which these Spanish scientists keep on proposing in recent years (since 2007, I believe). =]. In this light, IBA looks like a toy solution, while Cllr was also generalized to multiclass case to obtain multiclass cross-entropy (Cmxe). More info on this here: http://arantxa.ii.uam.es/~jms/seminarios_doctorado/abstracts2007-2008/20070226NBrummer.pdf

José Salvador Sánchez

Evaldas, I believe that you have not understood the meaning and purpose of the IBA measure when you are saying that this is a toy solution. Probably you should read our papers once again in order to gain insight on this measure, which has been proposed with the aim of overcoming the bias towards the majority class in an imbalance problem.

James Walter Taylor

I'm wondering about the intent behind your question, I don't read your question as directly asking about thresholds in the ROC sense of a fixed linear discriminant function and a performance trade-off, but in the sense of determining the impact of say, a non-symmetric threshold used to binarize the output, and the use and effect of that threshold on training and convergence. Correct?

If the latter, correct classifications would not produce any training feedback of the network, so in the limit, where the threshold is set to one limit or the other of your sigmoidal limiting function, no training will be produced for one class or the other. In effect you introduce a cost function, emphasizing one class over the other. In that situation, MSE at a fixed number of iterations, I think, would have more to do with the complexity of the classes in your feature space than anything else.