How many minimum instances are needed for running deep learning algorithms esp for conventional NN and RNN ??

More Farshid Rayhan's questions See All

Is it wrong to reference a paper like this " [23] showed a innovative approch" ?

I have read some papers who referenced like this " [23] showed an innovative approach" instead of one of the most common way "author X in 2017 showed an innovative approach [23]" . My question...

01 February 2018 8,619 6 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Swimming/space travel depends on the proprioceptive muscle spindles?

When the entire neocortex is ablated in rodents, although they are still able to swim, all the limbs move continuously and asynchronously (Vanderwolf 2006; Vanderwolf et al. 1978). Normal animals...

03 August 2024 835 3 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

Some new emerging problems on application of RL for scheduling in IoT networks?

I have seen plenty of existing works on applied Reinforcement Learning (RL) policies for optimized scheduling in IoT networks including Q-learning, DQNs, and the newer ones including PPO for...

01 August 2024 8,754 2 View

How to Compress Information Neurally?

Samuel Morse, the inventor of the Morse Code, understood that certain letters in the English language occurred more frequently than others (Gallistel and King 2010). To deal with this, Morse used...

01 August 2024 4,456 2 View

Jayaram M.A

There are no hard and fast rules. There is a major doubt about kind of problems that are to be dealt with deep learning algorithms. How deep is this deep is matter of empiricism.

Stefan Lattner

Usually, it's "the more the better" - when having less instances, you need smaller models in order not to overfit, and the overall accuracy will be smaller. But networks usually learn, even with a few instances only. In any way, expect an improvement of the classification accuracy when you increase your dataset (and model) size.

Farshid Rayhan

how about 200,000 instances for a binary class dataset ? is it enough ?

It depends on your input data what's enough - best is just to try it. For comparison - a baseline MNIST handwritten digit dataset consists of 50 000 training instances for 10 classes, and this is definitely enough. Everything above 10 000 is at least not "very small" - if the data within classes is very similar, you need less - if it is very differing, you need more. Just try it. 200 000 seems enough for most problems.

if 50 000 for instances for 10 classes are enough then 200,000 for 2 class is also ok ???

It's hard to tell without knowing what data it is - but usually 200,000 is enough, yes.

thanks a lot

Carlos Sampedro Pérez

Hi Farshid,

I strongly recommend to you this lecture from the course "Learning From Data" provided by Caltech:

https://www.youtube.com/watch?v=Dc0sr0kdBVI

In this lecture, the concepts of the Vapnik–Chervonenkis (VC) inequality and VC dimension are explained in detail. The VC inequality is the mathematical concept for explaining how good our generalization error is regarding the number of training samples (among other variables).

Regards,

Carlos.

Thanks a lot

Ram Sarkar

Generally it is said that in deep learning, model needs more data than the conventional pattern classifiers like SVM, ANN etc. It is because in deep learning we don't use any handcrafted features, so the model learns from the patterns of raw images. Therefore, in these auto-learning models, if we have large number of image classes to to be classified then we need to provide large number of pattern samples. Otherwise, it would not perform well, when the test image will have extreme variations. We have worked in deep learning environment to classify handwritten digit recognition, and for that we have used different sets of training and testing samples (like 100, 200, 500, 1000 samples per digit class). Hope this helps.

very helpful ....thanks

Essaid El Bachari

Thanks lot for informations.