Regarding Overfitting and Computational Learning Theory (COLT)

05 May 2017 1 1K Report

The aspect of over-fitting is typically viewed from the perspective of both- accuracy and model complexity.

To mitigate over-fitting, we usually have the practical approach of having k-fold validations, training-validation-test set.

Question: theoretically, can we leverage the Statistical Learning theory (COLT) to draw bounds on the confidence of how well learning has happened,

and how well classification can happen over unseen examples?

Example, many a time we consider the minimum number of samples (upper bound/sufficient samples) needed to learn (VC Dimension).

Agreed, it is an overestimate, plus |H| or VC Dimension may be unknown in practice.

The other perspective wherein, given the number of samples 'm', and 'delta' (probability of failure) -

we find error bound (on unseen examples)- Can we theoretically interpret this error bound to be an estimate of over-fitting?

Appreciate any pointers.

Andrea Paudice

Typically this generalization bound you are talking give you a bound on the estimation of your algorithm. In other words, this is the component of the error which is due to the fact that your are learning from a finite sample and so your algorithm might fail in identifying the hypothesis that optimize its objective.

A large estimation error is correlated to a likely overfitting. As an example, the estimation error resulting from the VC analysis is something proportional to \sqrt{\frac{VC(H) + \ln(1/delta)}{m}}. If the sample size m and the confidence parameter delta are fixed, then using more "complex" hypothesis classes leads to a larger estimation error. What is happening is that enlarging the class H gives you more opportunities to fit well the training data but not hitting the optimal predictor in the class. The effect is likely to be a small training error but an true error, namely overfitting.

Badges
Science topic

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Is it possible to plot the atom-projected band structure using GPAW?

Hi, I'm currently working on a project where I need to plot the atom-projected band structure using GPAW. I've been able to calculate the band structure for my material, but I'm having trouble...

07 August 2024 269 3 View

Should I include H atom into C3N5 when i am doing DFT modelling?

Hi all, my experimental XPS results shown that my C3N5 sample consists of N-H bond, hence in this case I should incorporate the N-H bond into my DFT modelling. However, I do notice several papers...

07 August 2024 8,414 2 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

RNA later for the preservation of RNA in fecal samples at room temperature for one day (37°C)?

I am planning to collect human fecal samples for metatranscriptomic analysis using MGI. These samples are from indigenous people living in a region with high temperatures. I will have access to a...

06 August 2024 1,367 3 View

Are there any good simple systems or platforms to recommend?

In order to show people the beauty of control and enhance enthusiasm for learning control theories, are there any good simple systems or platforms to recommend?

05 August 2024 10,034 1 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

If we are using snowball sampling technique, how do we justify the true representativeness of the sample statistically? is there any statistical test?

Are there any statistical methods to justify your sampling technique using SPSS or AMOS?

05 August 2024 9,153 4 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View