Increasing the data set, why the validation loss does not decrease while the training loss decrease significantly?

More Xin Gao's questions See All

How to display grain boundaries in RVE model?

Many literatures have shown that the RVE model shows the grain boundaries between different grains. How can this be achieved using DAMASK+paraview?

21 July 2024 9,224 1 View

How would I calculated ddPCR spiked nucleic acid recovery calculations?

Hello everyone, I'm in the initial stages of conducting a study involving the ddPCR (Bio-Rad QX200, QX600) and I ran into some problems involving recovery % calculations using direct...

02 July 2024 6,011 2 View

Potato handbook crop of the future--PDF?

Whoever has an electronic version of this book, sell it to me，《potato handbook crop of the future》Thanks

15 June 2024 4,404 0 View

Any advice on analyzing two-factor gene expression study?

Hello Research Community, I am currently working on a gene expression study involving two factors: treatment (control vs. treatment) and time points. The results from a two-way ANOVA using GLM...

06 June 2024 2,310 2 View

Who sold me this book《Plant Nutrition Diagnostics: Potato》?

《Plant Nutrition Diagnostics: Potato》

03 June 2024 1,663 2 View

Does the zirconium isotope fractionate as through the column?

Is the zirconium isotope of geological sample fractionated when passing through the column, and how to determine whether this fractionation is or not？

26 May 2024 6,721 2 View

In C++, why is the constructor of a virtual base class called first, while the object of the virtual base class is placed last?

In C++, why is the constructor of a virtual base class called first, while the object of the virtual base class is placed last? What are the advantages of doing this?

07 May 2024 1,583 1 View

Through what theories can we aptly explain the social networks and subcultures that people form around cats?

Just like the stray cat protection organizations on campus, they are spontaneous and not organized by the school. They provide a series of services for kittens in life and death. And the source of...

11 April 2024 4,944 0 View

PMSCV for protein transit protein expression?

We are using pMSCV for transit expression of a protein.: 1. The gene was cloned in (between XhoI/EcoRI). 2. Transfection with lipofectmin 3000 to 293t cell. 3. After 48 hrs, GFP can be...

04 April 2024 8,350 4 View

DAB staining for biocytin filled interneurons?

Hi! We are using DAB to stain biocytin filled (20-40min) interneurons in spinal cord. As shown in the figure, the soma is stained well, but not the dendrites. Could someone tell me what might be...

25 March 2024 838 4 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Which distribution type should I use when calculating the average particle size from TEM image? and how to calculate the error ?

average particle size calculation from TEM

04 August 2024 2,921 1 View

Swimming/space travel depends on the proprioceptive muscle spindles?

When the entire neocortex is ablated in rodents, although they are still able to swim, all the limbs move continuously and asynchronously (Vanderwolf 2006; Vanderwolf et al. 1978). Normal animals...

03 August 2024 835 3 View

Training for new staff?

I am looking for some training for new staff that will be starting in a self contained classroom with students with ASD. Most new staff have little to no experience working with students with ASD....

03 August 2024 6,717 3 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

Jiarong Chen

When you increase the data set, do the training set and validation set increase? Or, only the training set. It looks like overfitting in this case, the model mainly focus on the training set, and I think it might be the optimal point for your research. For example, when we want to complete various information, like A, B, and C, if you only provide A, the performance must be lower than the expected performance. Hoping it is useful for you!

Xin Gao

Jiarong Chen Thanks for your answer！Only the training set increases in this case. Two sets of experiments were done, one(E1) containing a sampled training set and the other(E2) containing an unsampled training set. The validation set remains the same in these two experiments. The result shows E2 has a lower training loss, but the validation loss is almost the same as that in E1.

Get it！It seems that E1+E2 is not better than E1, and E2 might be useless. I think you can try E2 for training, and discuss the different results. Do these datasets exist the domain gap?

Jiarong Chen Thanks again! I will try your advice. And there is no domain gap between these datasets. I guess that the data distribution of E2 is similar to that of E1, increasing the training set does not improve the model performance, though the training loss has a lower value.

Ok, you are welcome. I am glad to hear from you.

Best