How can I classify data with missing values using conventional classifiers?

More Ahmed Hamed's questions See All

Where to download gene expression dataset?

Is there any available repository to download gene expression dataset available for classification task in Machine Learning

04 May 2019 1,892 2 View

Any suggestions about computing the Euclidean distance between objects using mapreduce?

Given a data set with two numerical features with the aim of computing the Euclidean distance between objects is a time consuming problem. Is there any suggestions about how the key should look like?

01 February 2019 7,538 1 View

Structured Machine learning interview?

I'm asked to held an interview with the students who are applying to attend a training about "Structured Machine Learning" so, what do you suggest about this interview and what metric should I...

05 June 2018 6,933 3 View

Need convolutional neural network comprehensive tutorial?

Can you suggest me a well written tutorial for CNN and it's applications with examples.

01 February 2018 5,840 0 View

What is the difference between the word evaluate and validate?

Authors in their papers after introducing a new model, usually in the experimental section say either "To validate our model we....." or "To evaluate our model we...." what is the difference...

31 December 2016 3,769 0 View

Can someone provide an extensive survey about the conventional classifiers and Hybrid classifiers?

I need help about the difference between the conventional and hybrid classifiers and what are the advantages and disadvantages of each. Thanks in advance.

08 September 2016 6,357 3 View

How to plot the ROC curve?

After doing classification on weka using naive classifier, i found that the FP rate for the first and second classes are 0.038 and 0.656 and at the same time the TP rate are 0.344 and 0.962. My...

06 July 2016 5,147 2 View

What are the new powerful feature selection algorithm?

I know that PCA, KPCA, correlation, consistency, relief, and MrMr are Feature selection techniques, but, is there any new techniques to compare with? Thanks in advance

05 June 2016 7,986 2 View

What does the word CrossMark mean?

The CrossMark identification service from Crossref sends a signal to researchers that publishers are committed to maintaining their scholarly content. As a new researcher, I'm unable to understand...

05 June 2016 1,555 10 View

How to identify the distribution of the data set in matlab?

Given a data set (.mat file). How can I identify the distribution (Normal, Gaussian, etc) of the data in matlab? Is there any built-in function that helps to do this? For example, I'd like to...

05 June 2016 9,667 3 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Is there a problem with my RNA pellet?

Hello, I am currently having problems with RNA extraction. I am using mouse liver (C57BL6J), and I have extracted RNA from mouse liver before. Before this experiment, my final RNA pellets were...

11 August 2024 7,082 3 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Which Scopus Journal provides the most affordable fees?

"PUBLISHING IN A SCOPUS JOURNAL" Researchers are now at a cross road. The critical need to publish in a Scopus or ISI, etc journal is ever vital. Journal Publication fees must be submitted....

10 August 2024 8,621 1 View

Seeking Advice on Viability and Execution of Undergraduate Thesis Topic?

Hello everyone, I am currently developing a thesis proposal and would appreciate your input on its viability and how to effectively carry it out. My proposed topic is: "Does the perceived threat...

10 August 2024 8,992 0 View

Basis set and input instruction to calculate HOMO and LUMO using Orca for Imidazolium based organic salts?

Hey there, As a synthetic chemist delving into theoretical calculations for my imidazolium-based organic molecules, I would greatly appreciate any guidance on the appropriate input instructions...

09 August 2024 5,444 7 View

Who will be moral responsible for the death of thousands of people in the event of an earthquake?

Who will bear moral responsibility for the deaths of thousands of people in the event of an earthquake? Weeks and months remain before the onset of strong earthquakes that bring death to...

08 August 2024 6,134 12 View

RNA Extraction Using Hot Borate Method No Longer Working?

I've been performing RNA extraction on cotton petiole tissue for a few months now using the method described in the following paper, a derivative of the typical hot borate method...

08 August 2024 9,882 2 View

Mahdieh Askarian Popular answer

Hi Ahmed,

In addition to imputation methods such as KNN, regression and MLP for missed data, other approaches can be applied. Ensemble classifiers such as AdaBoost and Bayesian Network can classify without imputation.

Please find the review paper as an attachment.

Mahdieh Askarian

Ahmed Hamed

Thank you all ...... very helpful.

Mayur Narkhede

I would suggest that, It would be much more useful if you calculate the expectation values for the missing values and afterwards when you will encounter the missing data you can calculate the expectation for the expected values for that data.

James Dominic O'Shea

One thing you might consider is the computational complexity / overhead of using the different techniques if you wish to execute the classifiers in real time, particularly with internet access to the application (for example, look at how a simple non-AI system to decide whether or not tamiflu should be prescribed was overwhelmed in 2009 http://www.express.co.uk/news/uk/116041/Flu-website-is-overwhelmed-in-minutes). Sometimes it is worth considering whether using a relatively crude measure, such as substituting the mean (median / mode) will be "good enough" in view of its low computaitonal overhead.

Dr. Indrajit Mandal

hi friend

first you apply some data missing algorithms to fill it up or do some other task.

Then apply some algorithms.

As simple as that.

From

Ahmad Hassanat

Dear Ahmad,

the metric I invented give similarity in the range of [0,1[, this means if a value in a spesfic feature is missing it will not affect the final distance significantly.

I just have published a paper that proposes the new metric which was invariant to the dimensionality of the feature vector, you may find the paper at the link:

https://www.researchgate.net/publication/264995324_Dimensionality_Invariant_Similarity_Measure?ev=prf_pub

Article Dimensionality Invariant Similarity Measure

Johan A.K. Suykens

Dear all,

In the context of support vector machines it is possible to also change the problem formulation in the case of missing values, see e.g.

Pelckmans K., De Brabanter J., Suykens J.A.K, De Moor B., ``Handling Missing Values in Support Vector Machine Classifiers'', Neural Networks, vol. 18, 2005, pp. 684-692.

Best regards,

Johan Suykens

----------------------

Prof. Dr.ir. Johan Suykens

Katholieke Universiteit Leuven

Departement Elektrotechniek - ESAT-STADIUS

Kasteelpark Arenberg 10

B-3001 Leuven (Heverlee)

Belgium

http://www.esat.kuleuven.be/stadius/members/suykens.html