Support Vector Machines: How to find the minimum number of support vectors for a machine learning classification problem ?

More Titas De's questions See All

Support Vector Machines: How do I theoretically calculate the maximum number of support vectors required for a machine learning problem ?

For a machine learning problem, how do I theoretically calculate the maximum number of support vectors required to solve the machine learning problem ?

11 December 2019 6,932 8 View

Text Summarization: Are there standard NLP ( deep-learning ) techniques on paragraph summarization ?

Given a paragraph, are there standard techniques on paragraph summarization, like extracting relevant keywords. Anything in this direction will be help-ful

11 December 2019 2,771 6 View

Alternative to back-propagation: Are there other popular alternatives to back-propagation through which a neural network can learn ?

I was wondering if there are other popular alternatives to back-propagation through which neural networks can learn the weights.

11 December 2019 6,319 6 View

Deep Learning - transfer learning from one image classification dataset to another - Scientific process ?

Hi, is there any quantifiable scientific process to understand whether a deep learning model trained on a certain image classification dataset can be used for transfer learning for image...

10 November 2019 5,722 2 View

Machine Learning: Given a classification dataset, how can we quickly figure out that our dataset is really noisy ?

How can we figure out that our dataset is really noisy - both in terms of the data-points and labels ?

10 November 2019 9,155 1 View

Machine Learning Classification: How can we derive to the (theoretical) minimum classification error ?

Given a dataset, is there a systematic way to come to a maximum classification accuracy, beyond which no algorithm or classifier can improve the accuracy ?

10 November 2019 8,015 7 View

Machine Learning: Say we are given a classification dataset. Is there any systematic way to reduce the noise in the training labels ?

Reduction of noise in training labels.

10 November 2019 6,079 3 View

Apart from Self-Organizing maps, what are the most common neural network techniques used for clustering ?

Clustering - Neural Netowrks

08 September 2018 4,900 4 View

A 10-layer ResNet CNN trained from scratch gives lower classification accuracy than the same without residual connections. Any possible reasons ?

CNNs, Deep Learning, ResNet, Residual Connections

06 July 2018 722 5 View

Regarding CNNs in deep learning, can anyone tell me in which cases transfer learning increases the error or decreases the accuracy ?

Deep Learning - Transfer Learning

06 July 2018 5,100 5 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Swimming/space travel depends on the proprioceptive muscle spindles?

When the entire neocortex is ablated in rodents, although they are still able to swim, all the limbs move continuously and asynchronously (Vanderwolf 2006; Vanderwolf et al. 1978). Normal animals...

03 August 2024 835 3 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

How to load and plot 2D-PIV (particle image velocimetry) recorded velocity vector field in Techplot?

Dear Researchers I need to know, how to load and plot 2D-PIV (particle image velocimetry) recorded velocity vector field data in Tecplot? Thank You

02 August 2024 3,615 1 View

Some new emerging problems on application of RL for scheduling in IoT networks?

I have seen plenty of existing works on applied Reinforcement Learning (RL) policies for optimized scheduling in IoT networks including Q-learning, DQNs, and the newer ones including PPO for...

01 August 2024 8,754 2 View

Michael Woollard

A minimum of two support vectors are required for each decision hyperplane in the model. This follows from the observation that the margin at each decision boundary must be defined on each side of the dividing hyperplane by the closest data points, which are the support vectors.

Mohamad M. Awad

You can use a support vector machine (SVM) when your data has exactly two classes. An SVM classifies data by finding the best hyperplane that separates all data points of one class from those of the other class. The best hyperplane for an SVM means the one with the largest margin between the two classes.

The support vectors are the data points that are closest to the separating hyperplane; these points are on the boundary of the slab.

So back to your question how to find the minimum number of support vectors for a machine learning classification problem?

The data for training is a set of points (vectors) Tj along with their categories Cj. For some dimension D, the Tj ∊ R^D, and the Cj = ±1. The equation of a hyperplane is f(T)=T′β+b=0

where β ∊ R^d and b is a real number.

The following problem defines the best separating hyperplane (i.e., the decision boundary). Find β and b that minimize ||β|| such that for all data points (Tj,Cj),

Cjf(Tj)≥1.

So, the support vectors are the Tj on the boundary, those for which Cjf(Tj)=1.

For mathematical convenience, the problem is usually given as the equivalent problem of minimizing ||β||. This is a quadratic programming problem. The optimal solution (ˆβ,ˆb) enables classification of a vector z as follows:

class(z)=sign(z′ˆβ+ˆb)=sign(ˆf(z)).

ˆf(z) is the classification score and represents the distance z is from the decision boundary.

Muhammad Ali

This link is much helpful for understanding such optimization problem: https://stats.stackexchange.com/questions/313660/what-are-the-support-vectors-in-a-support-vector-machine

Also see:

https://stats.stackexchange.com/questions/278904/how-can-i-know-number-of-support-vectors-in-svm

Mokhaled Al-Hamadani

I’m interesting in this question too!

Herve Teguim

SVM classification method applies the concept of margin hyperplanes, which can be imagined as a surface maximizing the boundaries between the diﬀerent types of data in order to create subspaces with homogeneous observations.

To find the optimal hyperplane, the margin, i.e. the double of the distance between the hyperplane and the nearest training data points (called support vectors) is maximized.

Suppose we have a d-dimensional set of data points with labels -1 und +1 that we want to classify using SVM. For classifying, we need two margin hyperplanes at equal distance from the optimal hyperplane which separate the data points. Since we are in d-dimension, each margin hyperplane can be constructed using a minimum of d support vectors.

So for classifying we will need a minimum of 2*d support vectors.

Siavash Shadpey

Grid search for each of the unknown variables such as minimum number of support vectors in the classifier which results in the highest percentage in the classifier

Gérard Dreyfus

As mentioned in the first reply, the minimum number of support vectors is obviously 2, and the resulting optimal hyperplane is the perpendicular bisector of the line joining the two examples in the feature space.

Now if your question is "How do I calculate theoretically the minimum number of support vectors given the data that is available to me", the answer is: you can't. You must run your SVM software (which means making a lot of choices - kernel, regularization constant, hyperparameters of the kernel, ...) and see what happens.