What are the best ways to handle very small dataset in deep learning?

More A. S. M. Shihavuddin's questions See All

What are the best algorithms to find clusters inside a huge network without any parameters?

Could you please refer me to the related publications or tools? or maybe mention the name of the tools publicly available? Thanks in advance.

01 January 1970 3,113 3 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Swimming/space travel depends on the proprioceptive muscle spindles?

When the entire neocortex is ablated in rodents, although they are still able to swim, all the limbs move continuously and asynchronously (Vanderwolf 2006; Vanderwolf et al. 1978). Normal animals...

03 August 2024 835 3 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

Michael Ulrich

The problem is that NN implicitly learn the prior distribution of the data, if you are using the posterior distribution (approximated by cross-entropy or MSE) as the training loss.

So the "proper" way would be to use a different loss function for training (which is however probably difficult to implement). You can try a logarithmic loss (e.g. mean_squared_logarithmic_error in Keras), but I am not sure whether it helps.

Alternatively, you can try to increase the number of samples of your small class, to make your dataset more balanced. The best way would be recording new samples of this class. If this is not possible, then you can try to increase the number of samples with augmentation, or simple repetition. You mentioned that you already tried augmentation, but did you augment the data to a degree that the classes were nearly balanced in the dataset?

Or you can search for "imbalanced classification" in Google, perhaps something useful comes out.

Regards,

Michael

A. S. M. Shihavuddin

Thank you very much Michael for your suggestions. I would definitely try with logarithmic loss. I did not make the class balance, as augmentation was performed on all classes equally, I will try that one out too.

Regards

Shihav

Thuong Nguyen Canh

The other way you should try are generating synthetic data and transfer learning.