What is the best metric (precision, recall, f1, and accuracy) to evaluate the machine learning model for imbalanced data?

More Ibrahim mohamed Gad's questions See All

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

After COVID-19 it has seen that EFL learners technological affiliation has raised. In addition, in the post-COVID period learners started to engage AI technologies like ChatGPT while learning...

08 August 2024 8,964 4 View

How to generate a citation of my paper from ResearchGate?

How we can cite the papers from ResearchGate. I am trying to create citations for this article, Quantum Machine Learning Algorithms for Optimization Problems: Theory, Implementation, and...

08 August 2024 6,690 3 View

Does Anyone have expertise in in vitro transcription and RNA pull down assay?

I am currently working on LncRNA; to know the lncRNA-protein interactions I want to do RNA pull down assay, so I need to design primers with T7 promoter. I need assistance in this regard.

07 August 2024 6,622 1 View

How to fix background error in rietveld refinement of one XRD peak using GSAS-II?

I want to refine one XRD peak of my in-situ xrd but the background is never working good which ultimately fails the refinement. How to refine and adjust the background using GSAS-II

05 August 2024 5,291 2 View

How can I add own Henry coefficients in Aspen Plus?

Hi, i would like to simulate an absorption process in Aspen Plus. I want to use the NRTL model und would like to add some individual Henry coefficients. Is that possible and how?

05 August 2024 2,333 2 View

Why might the impedance values for DI water and 0.1X PBS buffer solution exhibit a decreasing and increasing trend, respectively over time (HP 4194A)?

Hello everyone, I'm encountering an issue with my electrochemical impedance spectroscopy (EIS) measurements and would appreciate some insights. Experimental Setup: Electrodes: Gold interdigitated...

05 August 2024 3,783 2 View

Can usage of AI tools like chat GPT in research work is recommendable ?

AI tools like ChatGPT can enhance research work significantly when used responsibly and in conjunction with thorough human oversight.

05 August 2024 1,842 3 View

Usage of internal standards in LC-MS/MS analysis?

Have you ever seen a LC-MS/MS method uses both internal standards and external standards (in matrix matching purpose) but the concentrations of internal standards are outside the calibration curve...

05 August 2024 3,084 6 View

ANY free software for reconstructing neurons in the microscopic image?

Hi everyone, I am working on brain slices for visualizing a protein in the soma and dendrites, using a fluorescence tag. However, I need a tool (not paid) for reconstruction of the whole neuron,...

04 August 2024 4,725 2 View

How effective is the Citi Bloc standard basket in enhancing the accuracy and comparability of international construction cost assessments?

Citi BLOC Standard Basket Definitions: A standardized unit representing a fixed basket of construction materials, labor, and equipment costs priced in various cities. Purpose: To create a common...

04 August 2024 8,997 1 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Swimming/space travel depends on the proprioceptive muscle spindles?

When the entire neocortex is ablated in rodents, although they are still able to swim, all the limbs move continuously and asynchronously (Vanderwolf 2006; Vanderwolf et al. 1978). Normal animals...

03 August 2024 835 3 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

How combine yolo with Faster R-CNN?

I want a model that is balanced with accuracy or speed, faster rcnn has high accuracy while yolo have fast speed. i am thinking to combine them to get a hybrid model to achieve both speed and accuracy

02 August 2024 3,104 0 View

Some new emerging problems on application of RL for scheduling in IoT networks?

I have seen plenty of existing works on applied Reinforcement Learning (RL) policies for optimized scheduling in IoT networks including Q-learning, DQNs, and the newer ones including PPO for...

01 August 2024 8,754 2 View

Barodi Anass Popular answer

In my opinion all elements what you said for evaluation the performance of DL models, its very good to judge robustness of your model, all this what are you said is called rapport of classification. we can in some time plot the Confusion Matrix for visualized the confusion between the classes.

I hope that be Clair for you.@ Ibrahim mohamed Gad

Sven Van Poucke

you should better balance your data first.

Fabrice Clerot

you could consider the balanced error rate

Article Classification Assessment Methods: a detailed tutorial

there is no such thing as an absolute measure of performance ... at the end of day, it all depends on your application ; for instance, you could know the costs for TP, FP, TN and NN : in such case, the expected cost should be your prefered metric ...

Suraj Bhagat

Please use confusion metrics. That one is recommended and easily accepted

Patrick Timmons

Consider using specificity and sensitivity in addition to other metrics

Mahesh Gour

for the imbalance data, the confusion metric is the best metric in which false positive and false negative errors easily observed.

Shahir Asfahan

Use Kappa statistic, works best with imbalanced datasets.

Ramoni Adeogun

Consider performing a pre-processing step to ''balance'' the dataset such that bias in the network performance is eliminated or at least reduced. This can, for example, be done by removing some of the datasets corresponding to examples with higher proportion. I would think it is better to train the network with a small-size balanced data than a large unbalanced data.

Md Hasan Shahriar

If you are dealing with multi-class dataset, you may consider different versions of the metrics such as macro, micro, weighted etc.

For imbalanced class problems:

Use micro-averaging to weight your metric towards the largest one.

Use macro-averaging to weight your metric towards the smallest one

https://scikit-learn.org/stable/modules/generated/sklearn.metrics.f1_score.html

Mohammed Ashikur Rahman

There are many techniques to balance your data. After that, for evaluation, you can try k-fold cross validation.

Ndifreke Nyah

For an imbalanced data, use precision and recall.

For a balanced data you can either precision/recall or accuracy.

To avoid misleading results, it is a good practice to balance an imbalanced data first.

Fatemeh Daneshfar

For imbalanced data, you can use weighted classification, then using the weighted average recall (WAR).

Or using unweighted average recall (UAR) for imbalanced data classification.

Khitam Muhammed Mesri

you can't rely on accuracy measure when you have imbalanced data because it might be deceiving .

but you should get a high accuracy anyway to prove that your model is working well .

then comes the most important measures : precision , recall and f1score

your model is making a lot of false positive predictions that's why your precision is not hight (that's not a good thing )

your model is not making a lot of false negative predictions that's why your recall is higher (that's the good thing )

f1 score is the harmonic average ( keep in mind it's not a normal average it gives weight to either precision or recall depending on something called beta value )

your f1 score is good because it's automatically gives a little weight to your recall (which is already good ) .

for me i think your precision is not good and that can't be overlooked .

please , correct me if i'm wrong .

Barodi Anass

Ibrahem Kandel

I would recommend using Kappa score.

Abouzar Choubineh

My recently published paper with the title of “Applying Separately Cost-sensitive Learning and Fisher's Discriminant Analysis to Address the Class Imbalance Problem: A Case Study Involving a Virtual Gas Pipeline SCADA System” and URL of https://www.sciencedirect.com/science/article/pii/S1874548220300214 may be helpful.