What are the best methods for evaluating Classifier Performance?

More Deepak Paudel's questions See All

Absorption coefficient of methane?

Hello, Can anyone provide me with the absorption coefficient of methane gas at 7.7 um? Any reference?

06 August 2024 980 5 View

How are Large Models Exploring and Outputting Knowledge Understanding in Specific Content Areas, and What Does Academic Research Say About It?

Hello everyone！ I am currently exploring the performance of large models in understanding knowledge in specific domains, and attempting to construct a knowledge framework similar to what...

05 August 2024 5,729 2 View

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity?

Regarding a model for simulating battery charge and discharge, what do you consider to be high fidelity? What is the acceptable percentage of error (regardless of the metric)? Could you suggest...

03 August 2024 5,358 0 View

How do i get an account to upload my published papers?

need to open an account to upload my published papers

01 August 2024 9,255 1 View

What is the problem with these tissue culture plants?

All plants are green but some of these plants becomes yellow. I did not found any reason. Please help me to find out the real problem.

01 August 2024 589 4 View

How to correctly use the UTE and ZTE pulse sequences in Bruker's ParaVision software?

I am using a Bruker 600M solid-state NMR spectrometer with a Micro 2.5 microimaging system. The test sample is a tube of 1M LiCl aqueous solution, and the nucleus detected is 1H. I am trying to...

01 August 2024 9,227 1 View

Is artifacts in XPS possible to build high deviation in binding energy larger than 5 eV??

Hello. Thanks for your consideration to see my question. Recently, I conducted XPS anaylsis of g-CN that is prepared from thermal polycondensation of DCDA, so-called conventional bulk-g-CN,...

30 July 2024 9,824 2 View

Which statistical test should we use?

N=6 Comparing pre and post test likert scale responses. Participants are mix of practicing & preservice teachers.

30 July 2024 7,233 4 View

How to build my own lab made four point probe set up?

Hello, I'm trying to measure the conductivity of semiconductor films but since I don't have a commercial four point probe set up I would like to build one on my own in my lab. I have generators,...

30 July 2024 906 2 View

Can the limit of quantification (LOQ) of an analytical method fall outside its linear dynamic range, or must it always be within it?

Can an analytical method's limit of quantification (LOQ) be outside its linear dynamic range, or is it always required to be within it? Please provide a thorough explanation supported by verified...

29 July 2024 7,198 9 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

How to choose the journal?

Hello I want a suitable journal in the field of remote sensing and machine learning to be judged quickly. Thank you for your guidance Thanks

01 August 2024 1,799 4 View

A Question about Phd thesis?

Hello everyone What is your opinion about the introduction of an expert decision support system in which the rules are extracted from existing data without human intervention, instead of being...

31 July 2024 5,785 4 View

The use of data from PubChem for commercial purposes?

Hi, I'm curious to know if data on chemical compounds from PubChem, such as water solubility properties, can be used to train a machine learning model for commercial purposes. Will this infringe...

30 July 2024 8,707 1 View

How can we improve transfer learning techniques to make models generalize better across different tasks and domains with limited labeled data?

Machine Learning

24 July 2024 2,487 3 View

How can AI technology to enhance the agricultural productivity?

Farmers no longer have to apply water, fertilizers, and pesticides uniformly across entire fields. Instead, they can use the minimum quantities required and target very specific areas, or even...

22 July 2024 8,296 3 View

How to Select the most suitable machine learning algorithm depending on the characteristics of the given dataset ?

I'm working on a project that involves analyzing a new dataset, and I'm at the stage of selecting the most appropriate machine learning algorithm. The dataset consists of both numerical and...

22 July 2024 6,097 7 View

The best source for amplification of ADAM17 prodomain?

hi every one I am making vector construction (for fusion proteins) and in this moment I wanna to amplification of ADAM17 prodomain with PCR. to yet, I couldn't amplified the ADAM17 prodomain with...

21 July 2024 8,660 1 View

Julien Maitre

Hi,

The best method to evaluate your classifier is to train the svm algorithm with 67% of your training data and 33% to test your classifier. Or, if you have two data sets, take the first and train SVM, and take the seond database and test.

If you want to evaluate the performance, your first data sets is used to train the SVM, and the second learning data, which are not perfect (e.g. Noise) is taken for testing the SVM trained.

To get performance, you have the accuracy, the precision, the recall, the f1-score (or f-measure) and the cohen's kapa.

I hope these information will help you

Hadi Shakibian

Hi Deepak,

There are generally two acceptable validation methods as splitting percentage and cross validation. If you have enough data, you can split it into two parts train and test with size of 70% and 30% of data respectively. Next, you should run the learning algorithm for 10 or more times and report (precision, recall, f-measure, auc) the average. For not enough large or small datasets, k-fold cross validation is better (k=1 or 5 or 10 ...). Also, If you want to see the robustness of the learner, CV is advised.

Onaopepo Adekunle

Hi...

Just like Kouser; To evaluate performance, you need to calculate the percentage of correctly classified observations of class 1 aka sensitivity and that of class 2 aka specificity then plot sensitivity Vs 1-specificity over all thresholds aka ROC curve

Alfonso Amendola

I think the best methods are:

1) ROC;

2) Sensitivity;

3) False positive rate;

4) Precision;

5) F-measure;

6) Confusion matrix;

7) Several other methods combining the aforementioned in wise way (depending on the type of data).

There's plenty of papers on the argument, I suggest to consider the type of data you're dealing with to choose the best measure(s).

Best

Fouaz Sofiane Ayachi

Hello all,

In order to evaluate the performance of your classifier (using cross or k-fold validation), reliability can be assessed by computing the percentage of correctly classified events/variable as well as by a complete confusion matrix, which summarizes how many instances of different event got confused by the system . The row of a confusion matrix show the number of instances in each actual event class (defined by the ground truth), while the columns show the number of instances for each predicted event class (given by the classifier’s output).

Generally, the classification performance can bemeasured by: F-score=2xSexP/(Se+P) where P=TP/(TP+FP) stands for the probability that a classification of that event type is correct. , Se=TP/(TP+FN) and Sp. = TN/(TN+FP) are Sensitivity & Specificity respectively. fromp wich you can build the ROC curve then compute the AUC (area Under curve) parameter.

you can find all what you need to know here:

[1] Simon Rogers MG., A first course in machine learning. Machine learning & pattern recognition, Cambridge UK: Chapman & Hall / CRC; 2012. .

[2] M. Sokolova and G. Lapalme, “A systematic analysis of performance measures for classification tasks,” Information Processing & Management, vol. 45, no. 4, pp. 427–437, Jul. 2009.

Let me know whether you can not get acces to the documents evoked above.

I can ssend it to you.

Cheers!

Rajeev Kulkarni

I use AUC and ROC as metrics for performance of my classifiers.

Every classifier predicts a probability. For simplicity, I'll assume a binary classifier (0/1).

By default, statistical packages will interpret probability of 0.5 or greater as 1, and smaller as 0. Based on these labels, you can validate (using cross or k-fold validation) the classifier performance derived from a confusion matrix (precision, recall, etc.)

However, if you set the threshold as 0.25, your confusion matrix will change and other metrics will change as well. Similarly, threshold of 0.75 will get yet different results.

So, I find ROC and AUC to be better descriptors of classifier performance. Together, these metrics are telling me how much of the variance in my target variable can be explained by the classifier.

If you want to compare the performance of your SVM with, say, a boosting algorithm, or a random forest, you can simply compare the validation ROC + AUC of the models.

Mohammad Idrees Bhat

I have some confusion ROC curve is mainly used in Bio-informatics. For text classification ,Character recognition we mainly use Precision,Recall or there harmonic mean F -score.Can we use ROC curve for all ,or is there any reservations.

You can generate ROC for any binary classifiers. AUC is quite literally the area under the ROC.

ROC / AUC can be extended for multi-class problems as well, but that is a different problem.

Shashi Mehrotra

You can experiment using cross validation and splitting data of various sizes, and

ROC and confusion matrix can be used.

How to choose parameters for RBF SVM i mean to say how to arrive at optimised C,gamma and Episilon parametrs.....Thanks in advance

Abdelmalik Ouamane

The optimised C, Gamma are calculated in the training phase by minimizing the error rate

Mostafa Ali Elmasry

you can get sample of classified documents and get the relevant and irrelevant out of classifier then calculate precision, recall and F-measure

Abdesslam Belaout

hi Alfonso Amendola,

please give a reference for more explanation of what you have posted

think you in advance

Amin Ullah

The method which all researcher have been following for evaluation is to divide data into 60, 20, and 20 % for training validation and testing.

Then calculate confusion matrix for validation data as well as test data and also calculate precision and recall for test data this the total evaluation.