What is the problem with epsilon=0 in regression problems with Support Vector Regression?

More Hamed Farhadi's questions See All

How to estimate total kinetic energy with an accelerometer?

Hi everyone. There's a MEMS sensor which has an accelerometer giving tri-axial angular velocity and linear acceleration. I wanted to measure the total kinetic energy which has 2 parts:...

09 October 2018 5,866 5 View

Can anyone share idea on excess pore water pressure estimation during dam construction?

Dear researchers/engineers I was reviewing a report of a dam monitoring during construction. considering a specific piezometer, it is installed at time t1 and elevation z1. but the casing datum is...

06 July 2018 2,110 3 View

Could you suggest literature or standards for hydraulic design of tunnels/conduits through embankment dams?

Dear all, assume we need to know what is the allowable velocity through the diversion tunnel which would not harm the concrete lining of the tunnel. and think of a gated tunnel (sluice gates in...

04 May 2018 9,145 2 View

Could anyone guide me with undistorting images and particle tracking?

I have captured frames of a video. A video of a particle moves through a flume. I wanted first to undistort it but I didn't capture calibration images. I just know some measures, for example the...

11 December 2016 8,576 18 View

How to derive acceleration from MEMS sensor outputs?

We used a sensor in a pebble that when it moves we could have acceleration statistics. but the outputs are labeled with some titles which I'm not sure how I can use them to produce a graph (time...

10 November 2016 7,468 5 View

How to implement selection algorithms like Self-Organizing Map to split input data to specific subsets?

I want to use SOM and Maximum Dissimilarity Algorithm (MDA) to split Input dataset to three subsets ,i.e., Training, Testing and validation subsets for regression problems. To use these subsets...

03 April 2016 5,652 4 View

How can I plot an X-Y plot with a colored valued? (Colormap plot)

Dear researchers, I wanted to show how different values of parameters in a model change the error of a result. for example I want to plot an X-Y figure with parameters 'a' in x-axis and 'b' in...

11 December 2015 8,811 6 View

How to plot confidence and prediction interval?

I have two tables; 1) observation data 2) prediction data I want to plot the confidence boundary and prediction interval for this plot(observed vs predicted). Thanks for any help

09 October 2015 4,501 3 View

Is it possible to solve a PDE with elemental boundary conditions with Finite Difference Method?

If we have a problem which its boundaries or the the condition of solution is some panels or elements not grids, which must be confirmed. Is there a way to solve it numerically with FDM?

10 November 2014 3,447 7 View

How to implement Linear Genetic Programming.

Anyone can recommend any software or codes to implement linear genetic programming? Thanks in advance.

08 September 2014 9,549 5 View

Is there an alternative to a multinomial regression which allows the DV to be non mutually exclusive?

I am trying to analyse data from a survey examining what variables affect teachers perceived barriers to incorporating technology into their classroom. I have 5 predictor variables however my DV...

06 August 2024 1,752 3 View

In order to run Multinomial Logistic Regression, is it required that the data be in the long format?

I am using unit level data (IHDS round 2) & Stata 17

06 August 2024 5,725 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

How to choose the journal?

Hello I want a suitable journal in the field of remote sensing and machine learning to be judged quickly. Thank you for your guidance Thanks

01 August 2024 1,799 4 View

Normality assumption for linear regression is The assumption of normality is whether for residual errors or predictor variavble?

When we conduct linear regression, there are several assumptions. The assumption of normality is whether the residual errors are normally distributed, not whether a predictor is normal?

31 July 2024 6,164 3 View

A Question about Phd thesis?

Hello everyone What is your opinion about the introduction of an expert decision support system in which the rules are extracted from existing data without human intervention, instead of being...

31 July 2024 5,785 4 View

The use of data from PubChem for commercial purposes?

Hi, I'm curious to know if data on chemical compounds from PubChem, such as water solubility properties, can be used to train a machine learning model for commercial purposes. Will this infringe...

30 July 2024 8,707 1 View

How can we improve transfer learning techniques to make models generalize better across different tasks and domains with limited labeled data?

Machine Learning

24 July 2024 2,487 3 View

Behrouz Ahmadi-Nedushan Popular answer

If epsilon is zero, we can expect overfitting

Parameter ε controls the width of the ε-insensitive zone, used to fit the training data. The value of ε can affect the number of support vectors used to construct the regression function. The bigger ε, the fewer support vectors are selected. On the other hand, bigger ε-values results in more flat estimates.

"The value of epsilon determines the level of accuracy of the approximated function. It relies entirely on the target values in the training set. If epsilon is larger than the range of the target values we cannot expect a good result. If epsilon is zero, we can expect overfitting. Epsilon must therefore be chosen to reflect the data in some way. Choosing epsilon to be a certain accuracy does of course only guarantee that accuracy on the training set; often to achieve a certain accuracy overall, we need to choose a slightly smaller epsilon."

http://kernelsvm.tripod.com/

http://www.svms.org/parameters/

Behrouz Ahmadi-Nedushan

Hamed Farhadi

Thanks dear Dr. Ahmadi-Nedushan,

So if we get good results for testing phase as for training phase epsilon=0 doesn't make any harm, right?

Ingo Steinwart

eps=0 leads to median estimation, see http://arxiv.org/pdf/1102.2101v1

eps>0 leads, in most cases, also to median estimation, and, usually to less support vectors, see http://papers.nips.cc/paper/3466-sparsity-of-svms-that-use-the-epsilon-insensitive-loss

over- and underfitting is controlled by the regularization parameter "C" (or lambda) as well as by the kernel width if a Gaussian RBF kernel is used

Elmer Fernandez

Hamed. If the best fit is achieved with epsilon = 0, that's for sure means that you will need to change the kernel, since probable the SVM cannot find a linear relationship in the space you test.