If accuracy is the concern which classification algorithm is better to use?

More Dr. Vaishali S. Parsania's questions See All

Which Data mining tools are supporting Naive Bayes and other classification algorithms?

In Weka I have done the experiment...apart from it...

09 October 2016 8,622 9 View

Can anyone help to provide me dataset of healthcare or link of it?

I want to use this dataset to do testing of classification algorithm in data mining.

31 December 2015 1,933 7 View

IS there any paper showing comparative analysis of Weka and R datamining tools?

09 October 2015 4,543 2 View

Can anyone provide patient database or link of same with details including Result of disease...?

Result of disease means ....cure/ not cure/ better/ improved or any set of binary which reflect condition of patient in terms of result...

09 October 2015 3,885 0 View

Is there any research paper showing comparative study of all the data mining tools?

(Preferably Weka, R, Orange, Tanagra tools)

09 October 2015 3,972 21 View

Which are best datamining algorithms in classification?

Which are best data mining algorithms in classification if my data set is of healthcare and accuracy is priority?

09 October 2015 2,309 36 View

Which is the journal with high impact factor with good reputation and free in Data Mining?

which is the journal with high impact factor with good reputation and free in Data Mining?

07 August 2015 3,324 1 View

Which classifier is most used in field of healthcare for data mining?

Which classifier is most used in field of healthcare/ medicine?

04 May 2015 6,777 7 View

How can we handle missing values for classification in R miner?

Is it possible to handle missing values in R?

03 April 2015 1,313 3 View

Is there any source to get a work on research dissertation or paper as freelancer?

03 April 2015 3,777 1 View

How to use evolutionary algorithms with real parameters in ryu sdn controller with large scale?

Hi, I wanna to implement evolutionary algorithms in ryu sdn controller in mininet, i have some challenges, how i can run the big scale topo with one sdn contoller??? and another question is to...

21 July 2024 246 2 View

List of journals impact factors?

Dear colleagues, Is it possible to send me the list of journals impact factor for the year 2024 (classification is for the year 2023)? excel format if it is possible. Thank you in...

29 June 2024 2,102 3 View

Is there a plugin for weka to read *.sav files from SPSS?

Hello there. Several references say that there is a reader for .sav files from SPSS in Weka, but I just can´t find it. Can anyone provide help? Thanks in advance

12 June 2024 3,844 2 View

How can I begin quantum computing on my computer or laptop?

I am interested in designing, developing, and testing algorithms on my laptop or local machine. Do I require any specialized quantum hardware or an online quantum computing service? Is it possible...

10 June 2024 2,917 3 View

Where can I find a reliable(peer reviewed) source code for the QKD BB84 protocol?

I'm trying to implement BB84 on a network, however I don't have a source code that is backed by any organization or a peer reviewed paper. Any help would be appreciated. Thanks!

09 June 2024 5,786 1 View

How does the application of (GANs) for data augmentation impact the robustness and accuracy of image classification models?

How does the application of generative adversarial networks (GANs) for data augmentation impact the robustness and accuracy of image classification models?

09 June 2024 2,923 2 View

How can attention mechanisms be integrated with convolutional neural networks to enhance performance in image classification tasks?

09 June 2024 2,432 3 View

How spectral bands and indices like (NDVI, NDBI) together used as input before supervised classification? In ArcGIS pro or any other software?

How Satellite Bands (Landsat/Sentinal) and indices (NDVI/NDBI) were composite together (Layer stacked) (In a single layer) before performing supervised classification (MLC/SVM/RF etc)? How it...

06 June 2024 2,207 1 View

Multi-Task Learning Architecture for Inductive Learning ability ?

Hi folks, I'm a computer scientist PhD student, and I'm working on implementing Multi-Task Learning architecture for a better generalization aims, it will be throughout a Deep Learning model. I...

21 May 2024 8,589 1 View

Video annotation tool for action classification?

Hello everyone, I have a dataset of videos for action classification, where each video contains multiple actions. I need to annotate these videos with the name of each class and the start and end...

17 May 2024 5,293 2 View

Chris Biemann Popular answer

What Michael describes is known as the 'no free lunch theorem' of machine learning: Simply put, there is no best learning algorithm, only one best algorithm for a particular dataset.

Michal Burda

Hi, there is no simple answer to your question. Random Forests are very good, someone might prefer something else. Generally, it is a good idea to use more methods and then ensemble their results together (e.g. using voting or averaging).

Dr. Vaishali S. Parsania

Thanks Michal Burda....

Chris Biemann

Abbas Shojaee

I think the question should be improved first. The standard definition of accuracy from statistics literature says that an estimation that is less probable to be proved wrong is more accurate. For example if I say the temperature tomorrow is between -40 to +50 centigrade it is accurate, but not useful or informative to make a decision based on it. In contrast a prediction between 10-11 is very precise but can be wrong with a high probability. In machine learning we prefer to talk about: 'precision' and 'recall' based on true positive, true negative, false positive , false negative and 'F score' and 'G score' when a ground truth is available to compare results with. This is the case for classification.

The quality of an algorithm depends on the question and the data that you have in hand. How big is the data? Is the data longitudinal or not? is it high dimensional or low? is it consisted of regular shaped classes or not? what is the rate and structure of missings and if you can curate it? what is the type of majority of variables.

Based on above questions you would take different strategies for data curating, feature selection and classification and all of those decisions would affect your model performance.

Vasyl Kovalishyn

Dear Dr. Vaishali S. Parsania,

You can try also ANNs. Please see site OChEM

https://ochem.eu

Best regards,

Vasyl

Jia Uddin

You also can try using naivebayes classifier:

http://www.mathworks.com/help/stats/naivebayes.fit.html

Deyuan Zhang

You'd better know some basics of classification algorithms and know how to tune parameters. To get a good accuracy, selecting a classifier is not as important as tuning parameters. I recommend libsvm, you can find the mannual, pratical guide and FAQ on the Libsvm author's website. The author have a "standard" process to use libsvm for newbie, you can try it, although it is not based on weka(btw, weka can use libsvm for classification).

Shailendra Singh

Hi Vaishali, If you have high dimentional dataset then you prefer SVM. Even there is no single classification algorithms is best. But if you combine two algorithms then you can get best result.

Bayan Mohammed

yes

Thanks Andreas Theissler for suggestion...

Hi Vaishali

if u r searching for best algorithm then dont think of Weka software. u should not restrict your self with the software. there r many good algortithms which do not support weaka.you should use ensemble approach of algorith. wish u all the best.

Supratip Ghose

Zero-R....however Kappa should be a concern for understanding the bias of data like entropy...random forests is resilient to overfitting and subset of predictors give better results for accuracy for most of the outcome.

Hakan Gündüz

Selecting suitable machine learning algortihm is problem-dependent. For example, Naive Bayes Classifier which does not have complex parameter setting is generally used in text classiffication problems. Logistic Regression classifier may give better solutions in cost learning problems.

Udit Chakraborty

I don't think that this question can be answered in this manner. The choice of an algorithm is dependent on the dataset. It basically depends on the distribution and type of data you are handling. The only option to the best of my knowledge is to try and evaluate.

Iman (Abdollah) Dehzangi

I have intesively used WEka in my research. For my Case, Random Forest (using 100 base learners), AdaBoost.M1 (using J4.8 and 100 base learners), SVM (using SMO and p =3), and SVM (using RBF kernel with relative C and Gamma parameters) attained the best results. However, it is all depend on the problem that you are tackling. It might be different. I would suggest to use these classifiers as well as (ANN) to check your performance.

Check the following paper:

A Combination of Feature Extraction Methods with an Ensemble of Different Classifiers for Protein Structural Class Prediction Problem

Thanks Abdollah (Iman) Dehzangi for detail answer...

You are most welcome Vaishali,