Which metrics would you recommend for evaluating models in a Multiclass Imbalanced classification problem?

More Hussein Aldor's questions See All

How Social Media Affects Your Mental Health ?

How Social Media Affects Your Mental Health

04 August 2024 6,961 3 View

Do You Know Untreated TMJ Affects Your Brain And Entire Body?

04 August 2024 6,282 1 View

What are the most PROBLEMS of HYBRID nanofluid in oil industry ?

explain your idea

23 July 2024 1,197 0 View

Who wants opportunities for scientific cooperation?

Dear Colleagues, I hope this message finds you well. My name is Noor Al-Huda K. Hussein,and I am a researcher specializing in deep learning applications in genetic data analysis. I am currently...

18 July 2024 5,562 0 View

Who wants opportunities for scientific cooperation?

Dear Colleagues, I hope this message finds you well. My name is Noor Al-Huda K. Hussein, and I am a researcher specializing in deep learning applications in genetic data analysis. I am currently...

16 July 2024 3,981 6 View

Wireless insite3d convert path in to image ????

hello dear i need to learn this program any one know ??? i want to convert path propagation to Image reconstruction 2d how is that done ? lik this image??

14 July 2024 1,811 0 View

Wireless insite3d convert path in to image ????

how to convert the area with 4 node and convert it to reconstruction image in wireless insite?like the fig

14 July 2024 4,435 0 View

What are the recommended rapid response scopus indexed journals in computer science?

Kindly provide their URLs. Thank you

30 June 2024 8,475 0 View

What Are the Main Reasons for Publications?

23 June 2024 3,199 1 View

How to Calculate the Sample Size in an Analytical Study?

23 June 2024 3,375 3 View

How combine yolo with Faster R-CNN?

I want a model that is balanced with accuracy or speed, faster rcnn has high accuracy while yolo have fast speed. i am thinking to combine them to get a hybrid model to achieve both speed and accuracy

02 August 2024 3,104 0 View

Is a reliability test necessary in my survey on translations?

Dear all, I gave 116 respondents 18 translated sentences and asked them to indicate their levels of acceptance of these translations on a five-point scale. Some translations result from strategies...

24 July 2024 8,245 5 View

How do microorganisms interact with their environment and what can be the interaction between micro organisms and macro organisms?

23 July 2024 8,467 4 View

How do we pick data for determination of Validation Acceptance Criteria?

Hello, colleagues! There is commenting open for new upcoming edition of USP 1033. Validation target acceptance criteria is now different from what it used to be and it doesn't include Cpm....

23 July 2024 7,292 3 View

Which research tool for expert validation for our study?

I am a 3rd year Computer Science student currently writing our Bachelor's thesis about finding diverse k-shortest paths in pedestrian networks. We have chosen 3 local areas as our proposed...

15 July 2024 4,289 0 View

What is trustworthiness in qualitative research and how can you improve reliability accuracy and validity?

12 July 2024 9,035 6 View

How is artificial intelligence being utilized to enhance the diagnosis and treatment of sleep apnea?

AI has the potential to improve the management of sleep apnea by personalizing treatment, enhancing diagnostic accuracy, and advancing our understanding of the condition.

03 July 2024 9,393 2 View

List of journals impact factors?

Dear colleagues, Is it possible to send me the list of journals impact factor for the year 2024 (classification is for the year 2023)? excel format if it is possible. Thank you in...

29 June 2024 2,102 3 View

After training an XGBoost model using K-fold cross-validation, Can I use SHAP to interpret this dataset?

In other words, I did not use the trained XGBoost model to make predictions on the test set and then use SHAP for interpretation. The reasons are as follows: Even with the best and most...

26 June 2024 7,332 1 View

Experts on orchid viruses for an article for home gardeners?

I am writing an article for my blog regarding orchid viruses aimed at home gardeners. It will include details of home testing kits their use and accuracy. I also want to include details about...

26 June 2024 7,422 1 View

Victor Leme Beltran

My suggestion would be, if possible, for you to group the classes that have a good balance between them, into one major class, in order to create a class division that is more balanced in a overall. In this case, you would execute one first model for distingushing between these major classes, that are more balanced between them and afterwards, for each algorithm you would execute a algorithm for distinguishing between the classes that are inside the major class. Although it may seem unintuitive to have 2 models executing in series as the probability of a correct answer would be the result of the multiplication of the accuracy from both, in previous experience of mine, i found this to be more effective other than just trying to directly creating one major model for all classes, specially, because, by doing this we should reduce the bias in the model.

As an example, i had a similar case, of imbalance for a text classification problem, where i applied the strategy above, and used for it, a sequential neural network combined with cross validation for distinguishing between the classes. It proved in this case to be much more effective than trying to take the entire data and creating a mega model at once.

As an observation, it is well possible to use KNN as a method for performing this classification as well.

For metrics, what i've previously used are accuracy and AUC. These have been proved to be good for my projects, but there are other metrics that may as well be used.

Qamar Ul Islam

Dear Hussein Aldor

The ROC Curve or ROC Analysis is the most often used ranking metric. ROC is an abbreviation that stands for Receiver Operating Characteristic and refers to a branch of research that examines binary classifiers based on their ability to differentiate classes.

Kind Regards

Sakinat Oluwabukonla Folorunso

I recommend recall!/ sentivity metric. Since class imbalance problem allied with class disjucts and overlap gives suboptimal classification performance. This metric will allow you to see the rate of detection for all class. ROC is too optimistic a metric to be used

Asier Rabasco Meneghetti

It might be useful to consider the macro f1-score. That way you can obtain a score that gives all classes the same weighting while in the end being a harmonic mean of such macro precision and recall.

It might also be interesting then, given the number of classes, to either consolidate some of them together in a way that makes sense for the problem you are facing (maybe some insights from literature review) or with clustering over the classes to see if they naturally join together given a set of features you are using for classification, or from a confusion matrix to see which classes get mixed up together the most.

Then creating different models for each groups of classes to further distinguish between them.

Arvind Kumar

G-mean = sqrt(sensitivity*specificity) is one simple metric for measuring performance of imbalanced classification problems.

Dr.Siddanagouda Somanagouda Patil

As I feel the spectral classification would be the solution to your problem. The major and minor classes would be distinguished as subclass. Superclass and Subclass approaches with Bayesian/Naive Bayesian classifier can lead to obtaining the dominant and non-dominant classes. Chameleon algorithms reorganize the subclass and class within the class.