Machine Learning Classification: How can we derive to the (theoretical) minimum classification error ?

More Titas De's questions See All

Support Vector Machines: How do I theoretically calculate the maximum number of support vectors required for a machine learning problem ?

For a machine learning problem, how do I theoretically calculate the maximum number of support vectors required to solve the machine learning problem ?

11 December 2019 6,932 8 View

Text Summarization: Are there standard NLP ( deep-learning ) techniques on paragraph summarization ?

Given a paragraph, are there standard techniques on paragraph summarization, like extracting relevant keywords. Anything in this direction will be help-ful

11 December 2019 2,771 6 View

Support Vector Machines: How to find the minimum number of support vectors for a machine learning classification problem ?

How do I theoretically calculate the minimum number of support vectors required to solve a machine learning problem ?

11 December 2019 9,368 7 View

Alternative to back-propagation: Are there other popular alternatives to back-propagation through which a neural network can learn ?

I was wondering if there are other popular alternatives to back-propagation through which neural networks can learn the weights.

11 December 2019 6,319 6 View

Deep Learning - transfer learning from one image classification dataset to another - Scientific process ?

Hi, is there any quantifiable scientific process to understand whether a deep learning model trained on a certain image classification dataset can be used for transfer learning for image...

10 November 2019 5,722 2 View

Machine Learning: Given a classification dataset, how can we quickly figure out that our dataset is really noisy ?

How can we figure out that our dataset is really noisy - both in terms of the data-points and labels ?

10 November 2019 9,155 1 View

Machine Learning: Say we are given a classification dataset. Is there any systematic way to reduce the noise in the training labels ?

Reduction of noise in training labels.

10 November 2019 6,079 3 View

Apart from Self-Organizing maps, what are the most common neural network techniques used for clustering ?

Clustering - Neural Netowrks

08 September 2018 4,900 4 View

A 10-layer ResNet CNN trained from scratch gives lower classification accuracy than the same without residual connections. Any possible reasons ?

CNNs, Deep Learning, ResNet, Residual Connections

06 July 2018 722 5 View

Regarding CNNs in deep learning, can anyone tell me in which cases transfer learning increases the error or decreases the accuracy ?

Deep Learning - Transfer Learning

06 July 2018 5,100 5 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Swimming/space travel depends on the proprioceptive muscle spindles?

When the entire neocortex is ablated in rodents, although they are still able to swim, all the limbs move continuously and asynchronously (Vanderwolf 2006; Vanderwolf et al. 1978). Normal animals...

03 August 2024 835 3 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

How combine yolo with Faster R-CNN?

I want a model that is balanced with accuracy or speed, faster rcnn has high accuracy while yolo have fast speed. i am thinking to combine them to get a hybrid model to achieve both speed and accuracy

02 August 2024 3,104 0 View

Some new emerging problems on application of RL for scheduling in IoT networks?

I have seen plenty of existing works on applied Reinforcement Learning (RL) policies for optimized scheduling in IoT networks including Q-learning, DQNs, and the newer ones including PPO for...

01 August 2024 8,754 2 View

Michal Rapczynski

In the idealized case with a given closed dataset and a classification (fixed number of classes) problem, the maximum accuracy would be 100%. Every sample is sorted into the correct class. You can't be better than that. But...

...the problems arise when you introduce new data. The generalization capability of your approach and possible overfitting can only be tested on new data. You should separate your dataset before training (train, test, validation set) but every time you rerun your models you reuse data and can overfit it.

Titas De

Hi Michal Rapczynski , I shouldn't have used the word theoretical maximum. Rather I meant practical maximum in case of noisy true labels. How would we tackle this situation ?

Do you mean with your 'target data is noisy', that the dataset does not have a correct class label for every sample? In this case, the minimal error would be very hard to quantify. It would be dependent on the size of the dataset, the relative number of wrong labels, and different for each used model and training run if you randomize your dataset before training.

Michal Rapczynski : Isn't there a way to sample the training set multiple times and then somehow find the irreducible error ?

For a linear model, there could be a way to solve it mathematically. If you use a non-linear model then you would probably have to test all the possible permutations, which could take very long (days to years depending on the model and data).

Thanks Michal Rapczynski

Shankar Lal

Article Instance Based Classification for Decision Making in Network Data