In case of balanced classes, what is the best metric to evaluate a supervised binary classifier?

More Sami Belkacem's questions See All

Leaf area of tomato ?

Hi How can this equation Ln(LA) = 1.038 + 0.89 ln(X) be applied to calculate the leaf area of a tomato? Can you explain with an example and what is the substitution of Ln and ln?

06 August 2024 2,508 2 View

How to interpret good fit but insignificant indirect effect?

Hi everyone, I have conducted a mediation analysis with SEM. My model fit was good but there was no significant mediation. Which aspects of this should I discuss?

19 July 2024 4,272 5 View

How to find fully funded PhD scholarship in the subject of spatial and/or regional economics?

Hello researchers and professors I'm from Pakistan and have published few articles in health economics and poverty from spatial perspectives. I have good expertise in spatial analysis and now I...

30 April 2024 3,048 2 View

Leaf area of tomato ?

hi every one What is the correction factor in the tomato leaf area equation? leaf area = maximum length of the leaf * maximum width * correction factor

10 April 2024 3,958 1 View

Estimation lycopene and caroten ?

After extracting lycopene from tomato fruits using the 4:6 acetone:hexane method, can it be measured with a spectrophotometer the next day? Does the pigmints damage?

01 April 2024 8,510 3 View

Estimation lycopene and caroten ?

Hi everyone After extracting lycopene from tomato fruits using the 4:6 acetone:hexane method, can it be measured with a spectrophotometer the next day? Does the pigments damage?

01 April 2024 4,859 1 View

How to calculate youngs modulus from the curve?

how to calculate youngs modulus from the curve?

16 March 2024 4,392 2 View

How to recover the solutes from Rotary Evaporator?

19 January 2024 3,335 1 View

SGLT1 and GLUT2 protein expression in crude jejunal homogenate?

Hello, Do you think purifying BBM proteins is necessary when studying the expression of GLUT2 and SGLT1 in rat jejuna? Isolating BBM vesicles is tedious given the number of samples at hand. Would...

10 January 2024 1,251 0 View

Aeroacoustic postprocessing Fluent data interpretation?

Hi, I am trying to analyse the acoustic data available on Fluent using the FWH permeable and non permeable surface approaches over wind turbine blade section. My concern is how to make the best...

05 January 2024 6,472 2 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Absorption coefficient of methane?

Hello, Can anyone provide me with the absorption coefficient of methane gas at 7.7 um? Any reference?

06 August 2024 980 5 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Swimming/space travel depends on the proprioceptive muscle spindles?

When the entire neocortex is ablated in rodents, although they are still able to swim, all the limbs move continuously and asynchronously (Vanderwolf 2006; Vanderwolf et al. 1978). Normal animals...

03 August 2024 835 3 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

Some new emerging problems on application of RL for scheduling in IoT networks?

I have seen plenty of existing works on applied Reinforcement Learning (RL) policies for optimized scheduling in IoT networks including Q-learning, DQNs, and the newer ones including PPO for...

01 August 2024 8,754 2 View

Md. Tarek Habib Popular answer

It depends on problem domain. There exists several off-the-shelf metrics, e.g. accuracy, precision, recall and so on. Each of these metrics indicates different aspects. So, the best metric cannot be generalized. It depends what the point of interest is based on problem domain. However, the mostly used metric is the accuracy for evaluating a supervised binary classifier with balanced classes.

Samer Sarsam

Hi Sami,

Both MCC and F-Measure can be used. I recommend also exploring the "Kappa statistic".

HTH.

Samer

Xi Long

If the classes are balanced, accuracy or AUC_ROC is good, otherwise, you should look at kappa coefficient or AUC precision-recall.

Mario Michael Krell

If your classes are balanced, why not use the most intuitive measure, Accuracy? If you are not sure, that your classifier chose the best decision boundary you should go for AUC, which is equivalent to the probability that your classifier assigns the higher score to the relevant tweets compared to the irrelevant tweets. Be careful with any other metrics, because they usually don't have a clear meaning and as soon as you get into the regime of unbalanced classes, most metrics are misleading, especially the F-Measure should not be used at all (proof in the attached paper).

https://www.frontiersin.org/articles/10.3389/fncom.2014.00043/full

Siti Mariyah

in the case of the balanced dataset, precision, recall, and F1 are the good measurement. If your dataset is unbalanced, you would better use ROC curve or macro/micro precision

Sami Belkacem

Thanks to all of you for your answers.

Alaa Tharwat

Classification Assessment Methods

This paper introduces a detailed explanation with numerical examples many classification assessment methods or classification measures such as:

Accuracy, sensitivity, specificity, ROC curve, Precision-Recall curve, AUC score and many other metrics. In this paper, many details about the ROC curve, PR curve, and Detection Error Trade-off (DET) curve. Moreover, many details about some measures which are suitable for imbalanced data are explained.

Your comments are highly appreciated.

https://ac.els-cdn.com/S2210832718301546/1-s2.0-S2210832718301546-main.pdf?_tid=20353999-8c84-422a-8191-fd1059469272&acdnat=1536185805_03b19e159f0675a01f35512961973eb5

Presentation of the paper is here

https://www.researchgate.net/publication/327403649_Classification_assessment_methods_a_detailed_tutorial

Dr R Senthilkumar

Precision recall

ROC-AUC

Accuracy

Log-Loss

M. Rahman

Both Accuracy and ROC are appropriate.

Md. Tarek Habib

Masum Shah Junayed

In balanced classes, AUC-ROC is good, otherwise, you should look at precision, recall.

Laith sabah Alzubaidi

Precision, recall, and F1 are good besides AUC-ROC.