Which are the most efficient feature selection methods for one-class classification problems?

More Sajjad Fouladvand's questions See All

I have an accepted paper in a conference. They will published papers in a volume by IEEE Press as post-proceedings. What exactly is a post-procedeeng?

I have an accepted paper in an international conference. They are going to published all accepted papers in a volume by IEEE Press as post-proceedings. What exactly is an IEEE...

04 May 2015 9,335 1 View

Any advice on detection and molecular classification of birds using molecular detector DNA Barcoding ?

I'm intrigued if there exist a way to do this project "Detecting and molecular classification of lightweight birds using a molecular detector DNA Barcoding" using pattern recognition and machine...

02 March 2015 8,801 3 View

What is the bests ways for getting a high score in GRE exam?

I've read 504 words, essential words for toefl and1100 words. I'm going to take the GRE exam in 7 or 8 month (I just wanna apply for fall 2016 and so I guess I'd better take the exam in 7 or 8...

11 December 2014 3,617 2 View

How can we use Cross Validation methods for both parameter optimization and error evaluation simultaneously?

I want to build a machine learning model and unfortunately I have a limit and small number of samples for both training and testing phase. I've always used a validation set for parameter...

11 December 2014 8,220 10 View

Does anybody know an ISI journal with quick process in the area of natural computing, artificial immune systems, biologically inspired algorithm?

I'll appreciate if somebody introduces me a good ISI journal with quick process time. My paper is in the field of artificial immune system, biologically inspired algorithm, natural computing,...

10 November 2014 10,002 2 View

I'm creating a data set for machine learning tasks. How should I decide on the number of samples in the data set?

I'm gathering data and trying to generate a data set. I'm wondering how should I decide on the number of samples. How many samples should be measured. Besides, any other recommendation about...

10 November 2014 6,540 18 View

What are the best methods for discretization of continuous features?

I'm looking for a strong method to discretization of continuous features. I prefer implemented methods with prepared libraries or functions. Thanks in advanced

10 November 2014 730 13 View

How should we deal with the lack of training data in a machine learning task?

I want to work on a machine learning and pattern recognition task, but the size of data set is small and there are only 43 samples for both training and testing purposes. How about the bagging...

09 October 2014 5,057 15 View

What are the most interesting application of image processing, machine learning and pattern recognition in medicine these days?

Image Processing and machine learning concepts has been widely used in medicine. I'm going to work on the use of Artificial Intelligence techniques (preferably image processing and machine...

08 September 2014 835 5 View

Do you know any ISI journal related to pattern recognition, machine learning, signal processing and natural computing ?

I'll appreciate if you inform me about journals with a short reviewing process at most 7 or 8 month. Thanks in advanced

05 June 2014 5,192 1 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Swimming/space travel depends on the proprioceptive muscle spindles?

When the entire neocortex is ablated in rodents, although they are still able to swim, all the limbs move continuously and asynchronously (Vanderwolf 2006; Vanderwolf et al. 1978). Normal animals...

03 August 2024 835 3 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

Some new emerging problems on application of RL for scheduling in IoT networks?

I have seen plenty of existing works on applied Reinforcement Learning (RL) policies for optimized scheduling in IoT networks including Q-learning, DQNs, and the newer ones including PPO for...

01 August 2024 8,754 2 View

How to Compress Information Neurally?

Samuel Morse, the inventor of the Morse Code, understood that certain letters in the English language occurred more frequently than others (Gallistel and King 2010). To deal with this, Morse used...

01 August 2024 4,456 2 View

Sajjad Fouladvand Popular answer

Hello Dear Mandal

many thanks for your comment.

I can't use PCA. Because I want to perform a feature ranking method. Actually the problem is't dimensional reduction.

Dr. Indrajit Mandal

hello friend

for one class problem, just try PCA.... that will be enough.

I hope it helps you.

Best,

Sajjad Fouladvand

Hadi Fanaee-T

Did you look at this paper?

A New Feature Selection Method for One-Class Classification Problems

http://dx.doi.org/10.1109/TSMCC.2012.2196794

Allesiardo Robin

Hello Mandal,

This paper can certainly help you.

Feature Selection as a One-Player Game

http://www.icml2010.org/papers/247.pdf

Sanjay Garg

For one-class classification problem optimal set of features can be selected on the basis of statistical properties of features( correlation analysis can be used) or any simple optimisation technique e.g. Dynamic Programming, GA can be used.

Volker Lohweg

A classical method in One-Class-Classifier is usually to reduce the Intra-class-distance of you object in your M-dimensional feature space. A measure which is appropriate is e.g. DBSCAN-clustering among others. Furthermore, you can apply a feature selection procedure (see below) and then check whether you cluster radius is shrinked, and so on.

Paper:

Sensorless drive diagnosis using automated feature extraction, significance ranking and reduction.

Authors: Christian Bayer, Olaf Enge-Rosenblatt, Martyna Bator, Uwe Mönks

Book: ETFA Pg. 1-4 [Contents]

Year: 2013

Language: English

Type: conference (inproceedings)

Santanu Ghorai

Dear Friend,

If you want to rank your features you may use filter method of feature selection. This includes F-ratio, T-score and mRMR methods, etc. Among these I have found mRMR method is the best one to rank the features as it uses mutual information criterion. For details you may see the paper of Hanchuan Peng, Fuhui Long, and Chris Ding, "Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 27, No. 8, pp.1226-1238, 2005.

But the question of applying it to one class data set is difficult. If you have few samples of other class you may consider it as a binary classification problem and easily apply this method. If this is not possible an evolutionary method may be employed to rank the features with the aim that inclusion of a feature leads to the smallest increase of the radius of minimum bounding hyperplane that encloses all the samples.

Shiva Mlk

Hi, i think Discrete Wavelet Transform is good.

Tarik A. Rashid

PCA is very good.

Archana J S

I would like to suggest, spectral analysis method (PCA). Hope you too find it useful.

I would like to add a gneral comment on feature selection which I have stated in several blog.

There are many feature selection methods available like LDA, Fisher's Disriminant with Rayleigh coefficient, Intra-class-Minimizers, etc. What is usually not working is PCA! Why? PCA tries, based on a gaussian process (assumption), to measure the variance and sort the eigenvalues which are proportional to the variances in decending order. The assumption is that the main Eigenvalues (EWs) contains most of the information and therefore, we use the main components (EWs) for data reduction. So far so good. But: Applying this approach for feature reduction is risky because this assumes that the feature in itself is stable (invariant) AND the feature's variance contains all information for classification.

This assumption is wrong!

Real-world features contain artefacts (noise, etc.) and therefore, PCA generates in its main components EWs with the highest variance which are related to noise. Hence, you generate "new" features which are not stable.

Iff you can prove that your features are completely artefact-free and the features underly a gaussian process, then PCA might work - in all other cases PCA is, as I said, very risky.

Classical statistical methods to rank features in classification are Greedy forward selection Variable, Mutual information based, Backward elimination , Metropolis scanning / MCMC , penalized logistic regression etc.

Apart from other optimization techniques, DPSO ( Discrete Particle Swarm Optimization) is also used recently by some researchers and they are getting good results.

Mahmoud Omid

Hi, For ranking purposes in order of their importance (not compressing features into a new but lower dimension vector as PCA does) you can try Sensitivity Analysis. I do SA easily in NeuroSolutions 5.05.

Cong Gao

SVDD is particularly used for describing the boundary of one-class data. There are some different kernel functions can be used, such that you can get different boundary of data for classification.

Suhad Al Shoukry

https://ieeexplore.ieee.org/iel5/5326/6330018/06392459.pdf

Erik Cuevas

Neural Network is a good approach, since they found the classification structure from a learning process.