In binary classification, how does linear SVM decide the weight/coefficients of a feature?

More Gabby Xiong's questions See All

How to analysis Mossbauer Spectroscopy data?

Scenario: I got two data files (α-Fe and Fe-containing samples), and the file names contain the max velocity value. Confusion: 1. How should I plot the X-axis? Is +/- velocity with points...

07 May 2024 3,244 0 View

The failure for thiolated -Aptamer binding onto Gold surface?

Hello. I have try to immobilize the thoilated aptamer onto the gold surface such as Gold coated silicon wafer and the gold coated screen printed electrode. However, it is failed, after aptamer...

29 April 2024 4,054 1 View

Does the diversity of gut flora disappear in nematodes hatched from decontaminated eggs?

Several studies have now reported the relationship between the N2 nematode host and its gut microbiome. If nematodes are contaminated by bacteria during the cultivation and are chemically...

08 April 2024 4,130 0 View

The profile of E coli F1F0 size-exclusion chromatography ?

Does anyone has the profile of E coli. ATPase complex (F1Fo) size-exclusion chromatography (using a Superose 6 column), I mean what the profile looks like, how many peaks? Are they all well separated?

28 December 2023 1,278 0 View

Where to buy the vermiculie from Stanta Olalla? And the ones named Ojen?

I wish someone working in the clay material field could help me about the question. Recently we are running out of the vermiculite from St. Olalla (Spain), I tried to looking for websites where...

18 October 2023 1,439 0 View

Does AOAC standard method 981.12 apply for only solid samples or is it only for liquid samples?

How would you determine the pH content of a biofilm specifically chitin, Is AOAC standard method 981.12 applicable for it's determination of pH. If not what AOAC method can be used for it. What...

02 October 2023 1,859 1 View

Is their a melting point for chitin in general?

Our research about chitin and chitosan regarding film formation, basically we are trying to find an exact literature or answer about chitin's melting point.

25 September 2023 4,967 2 View

Why Huang's CPFEM UMAT code can't run on abaqus + oneapi?

I can run Huang's CPFEM UMAT well on abaqus + inter parallar studio xe. But when I run the same model and UMAT on abaqus + oneapi, error occurs without any possible error messages. Since the...

22 September 2023 9,524 2 View

Are there any cell bodies in the IPL of the olfactory bulb?

Most descriptions of the olfactory bulb (particularly in mice) say that the axons of the projection neurons (mitral cells and tufted cells) run through the IPL, but I cant seem to find any mention...

24 July 2023 5,055 0 View

Donut shape staining in the cell (subcellular localization)?

Hi, anyone knows what's the subcellular localization for that gray staining? Thanks.

13 March 2023 9,803 0 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

How to choose the journal?

Hello I want a suitable journal in the field of remote sensing and machine learning to be judged quickly. Thank you for your guidance Thanks

01 August 2024 1,799 4 View

A Question about Phd thesis?

Hello everyone What is your opinion about the introduction of an expert decision support system in which the rules are extracted from existing data without human intervention, instead of being...

31 July 2024 5,785 4 View

The use of data from PubChem for commercial purposes?

Hi, I'm curious to know if data on chemical compounds from PubChem, such as water solubility properties, can be used to train a machine learning model for commercial purposes. Will this infringe...

30 July 2024 8,707 1 View

How can we improve transfer learning techniques to make models generalize better across different tasks and domains with limited labeled data?

Machine Learning

24 July 2024 2,487 3 View

How can AI technology to enhance the agricultural productivity?

Farmers no longer have to apply water, fertilizers, and pesticides uniformly across entire fields. Instead, they can use the minimum quantities required and target very specific areas, or even...

22 July 2024 8,296 3 View

How to Select the most suitable machine learning algorithm depending on the characteristics of the given dataset ?

I'm working on a project that involves analyzing a new dataset, and I'm at the stage of selecting the most appropriate machine learning algorithm. The dataset consists of both numerical and...

22 July 2024 6,097 7 View

The best source for amplification of ADAM17 prodomain?

hi every one I am making vector construction (for fusion proteins) and in this moment I wanna to amplification of ADAM17 prodomain with PCR. to yet, I couldn't amplified the ADAM17 prodomain with...

21 July 2024 8,660 1 View

Stanley Ebhohimhen Abhadiomhen

In multivariate case, I would want to believe such feature appearing way more in the positive side may appear as noise in the negative side. Another case is that, it is redundant feature which is not a determinant for the target variable, in both situation.

Sergio Cofre-Martel

Hey Gabby.

Unlike many ML and DL algorithms, SVM works with support vectors, and thus the weights are either \alpha or 0 for the features. If the weight of a feature is \alpha, it is said to be a support vector, and if 0 the model disregards the feature entirely.

In order to have a linear SVM with overlapping classes, a slack parameter must be given to the model (I believe the common notation is with \epsilon). The advantage of the slack parameter is that it allows misclassification. If so, the two most likely scenarios that I can think of are:

1) the conflicting feature will end up with a weight of 0 (i.e., not a SV) and therefore it does not affect your model.

2) If the optimization results in that feature being a SV, then it will most likely end in the class where its labels are repeated most. It will also depend on the classes of the features around it since, in the end, the SV of a linear model is just the "equilibrium" of the "center of mass" between the classes' SVs. So, as Stanley mentioned above, the feature will be just "noise" in your classification.

In the case of redundant or highly correlated features in a multidimensional dataset, it's likely that only one of the redundant features will be a SV and the rest will be disregarded. Unless these points are very close to the decision boundary.

I might be missing a couple of things here but I hope this helps.

Cheers

Lakchchayam Divya Khare

Gabby Xiong

If we talk about linear SVM , the result is a hyperplane and that help separates the classes . Weights represent this hyperplane by giving you the coordinates of a vector which is orthogonal to the hyperplane. These are the coefficients given by hyperplane.

Regards

Hi, thank you all so much for your insightful answers!

This is actually a follow-up question, I followed the answer from Sergio Cofre-Martel, and looked into the support vectors chosen by the trained linear SVC model. I found there are a lot of cases that are counter intuitive to me, I was wondering if you can provide me with more insights?

In my project, I have binary classification, and the features are with binary values, so I treat a sample contains feature f if I have f=1 in the sample, and vice versa. I looked at the feature occurrence among the chosen support vectors for the top 100 positively weighted features (I.e. top 100 largest positive weights in the model), and I found 45 out of the top 100 features are actually used more among the support vectors for the negative class than among those for the positive class. I was originally expecting more features to have the sign of their weight to be correlated with their usage among support vectors, as the model has high accuracy. I'm find this to be counter-intuitive and not sure what could be the reasons behind it, I would be really grateful if you can provide me some insights on this!

This also makes me think that it is not possible to predict whether a feature will be assigned with a positive / negative / zero weight in SVM, even when we know its usage among training samples, or usage among support vectors? Or can we?

Thank you fo your time in advance!

Best Regards

Hey Gabbi,

I'm glad that my answer was of help to you.

I'm not sure if I fully understand the concerns that you have. From what I can gather, after you trained your binary SVM classifier, you extracted or printed out the support vectors obtained by the model. Correct?

Depending on how you extracted this information I can see 2 possible scenarios:

1 - You extracted the lagrangian weight associated with the dual optimization (beta in the article linked below). It is my understanding that this value must be greater than zero for SVs and zero for non-SV. This doesn't seem to be your case since it appears that you have negative values (?).

2 - You extracted the actual vectors chosen as SV, i.e., now you basically have a subset of your original training input features (X). In this case, you should not worry about the balance between SV from one class or the other. It is not unlikely that one class might have many SVs and the other just a few, even with ratios like 1:10.

I don't know if this is of any help or not. If I have misunderstood your question please let me know.

All the best,

Sergio

https://www.analyticsvidhya.com/blog/2020/10/the-mathematics-behind-svm/