SVM with large feature space?

More Bilal Esmael's questions See All

Are combined methods better than a single approach?

Are combined methods better than a single approach in machine learning?

02 March 2014 9,541 5 View

How does one choose which algorithm is best suitable for the dataset at hand?

How does one choose which machine learning algorithm is most suitable for a given dataset?

02 March 2014 9,450 10 View

What is the running time complexity of SVM and ANN?

What is the best, worst, and average running time complexity of SVM and ANN? Why most of machine learning papers report only the classification accuracy, and ignore the running time?

01 February 2014 8,452 5 View

How to make a classifier forget some wrong cases without re-training the whole system?

How to force a classifier to forget some wrong cases without re-training the whole system?

01 February 2014 5,207 2 View

What are the disadvantages of moving average filter when using it with time series data?

Moving-Average Disadvantages.

31 December 2013 10,375 9 View

How can I select the most informative features from a big feature set?

How can we select a feature subset from a huge amount of features (around 1500 features) that will produce the highest possible classification accuracy? Most of the feature selection algorithms do...

11 December 2013 2,935 47 View

When and why do we need data normalization?

Data normalization means transforming all variables in the data to a specific range. My question is when and why do we need data normalization?

10 November 2013 8,014 36 View

What is the difference between machine learning and data mining?

Machine Learning vs. Data Mining.

10 November 2013 10,102 32 View

How can we solve an overfitting problem?

Overfitting avoidance.

10 November 2013 8,903 7 View

How to use HMM for Multivariate time series classification

How can I use HMM to classify multivariate time series. The given time series should be segmented to different-length segments, and for each segment a label (class) should be assigned.

10 November 2013 2,355 7 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Is there an English Translation of the Carl Moller text: ZUR VERGLEICHENDEN ANATOMIE DER SILURIDEN?

I recently came across an anatomy text by Carl Moller that was published in 1915 but it is in German or Dutch neither of which I can understand. I would like to know if there is an English...

10 August 2024 4,347 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Do you know best mines of western part of Afghanistan?

I want to know more about Mn deposits in west of Afghanistan.

07 August 2024 3,427 1 View

How to convert a privately loaded document into a public document?

I attempted to make a privately uploaded text public but a window appeared that said an error occurred. There was no explanation provided as to why there was an error or what might be done to...

05 August 2024 8,025 7 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Could dyes amplify the spectrum of light to a specific wavelength?

I am interested to know the behavior of dyes toward light. Specifically, Blue dyes re-emit the spectrum, especially from the green zone (known as principal in LED lamps, and blue dyes are known...

05 August 2024 3,290 1 View

How to report results of Generalised Linear Mixed Models in a journal article?

Hi everyone, If you have written or come across any papers where Generalised Linear Mixed Models are used to examine intervention (e.g., in mental health) efficacy, could you please share the...

04 August 2024 4,130 4 View

Adrian Letchford Popular answer

Hi Bilal,

This is one of the many reasons why I love kernel models!

Kernel models are exactly the same as linear ones, except they first transform the data. Now, the math shows that we're transforming into an even bigger space, so if you're inputs have 1,000 features (dimensions), the kernel space could be 100,000 or even infinite.

But, sweep away that math, and all we are really doing is calculating the distance between each data point. The kernel machine then uses those distances as input.

So if you have 10 data points, you have 45 distances between them. It doesn't matter if your data has 5 dimensions or 1,000, you still have only 45 distances.

The trade off, though, is the number of distances increases dramatically as the number of points increase. I'm sure you know that the number of distances between n points is a triangle number: n*(n-1)/2

Dmytro Prylipko

MLP tries to reliably estimate huge number of parameters (the number depends also on the number of inputs, i.e. features) in order to build a 'decision model'.

In contrast to that, SVMs with kernels do not operate on the source feature space explicitly. Instances from the training set appear only as arguments of the kernel function. In such a way the dimensionality of the feature space is always 'behind the scene'. Moreover, usage of kernels often increase the dimensionality (virtually) up to the infinite-dimensional space (in case of RBF kernel).

Adrian Letchford

Katharina Morik

The reason is that SVM optimization works on the norm of the vectors and not on the dimensions directly. Hence, if there are many zero feature values, the Euklidian length, i.e. the norm, is much less than the number of dimensions. For texts, this is clearly true, since most of the words in a vocabulary don't occur in a specific text. In other applications, e.g., genetic data, however, this is not true, because there is a non-zero for most of the dimensions and then SVM also suffers from the high dimensionality.

Krishna Rohit Datla

In addition to above answers, My intuition says, when compared to a low dimensional space, In a high dimensional space the number of possible decision boundaries that can be drawn to separate classes will be high, so the amount of effort needed in optimization is much less(because of multiple possible set of parameter values) and this works as an advantage to SVM.