[Deep learning] How to appraise the effect of each part of a multiple loss, and tunning the trade-off factor correspondently?

29 September 2021 1 6K Report

It's very common to use multiple loss. People usually multiply each single loss with a trade-off factor, and take summation of them. Just like the example below (A loss for generator in WGAN-GP).

g_loss = -diff + lambda1 * grandient_panelty + lambda2 * mse_loss g_loss.backward()

So the problem arouse: how do I appraise the effect of each loss, so as to tunning the trade-off factor lambda ? In the above-mentioned WGAN-GP case, the last mse_loss is a custom loss I add in to the total loss. So how should I adjust the factor lambda2 to ensure that the mse_loss would take effect but won't be excessively dominant.

Of course, a hyper-parameter tunning may solve this problem, but I'm searching for a more elegant solution—I want to appraise the effect of loss directly and quantitatively, and set the factor according to the appraisal.

At the first glance of this problem, I used to intuitively think like this:

Ok, I would plot the varying curve of each loss. After comparing the magnitude of each loss, I would assign a larger factor λ to the smaller one to promote it.

But deep thinking, I found it's wrong, makes not sense. Because it is the grad of the loss that really matters, and a simple calculus knowledge told me that, the value of function f(x) has no relevance to its derivative df(x) . Therefore, a loss with bigger magnitude dose not promise a bigger grad back-propagated to the network, and dose not promise a larger effect apparently.

I couldn't make it out, and come to ask you is there any good choice to appraise the loss effect directly and quantitatively? Do I have to print the grad of each loss and analyse them?

Nawab Khan

Nice question

More Yp Zuo's questions See All

How to differentiate coal combustion in biomass combustion?

PAHs can indicate coal combustion, but how can they more accurately indicate that it is coal combustion?

26 February 2024 9,449 2 View

Can I combine continuous and categorical exposure when doing meta-analysis?

I am a bit confused about this question. I am currently attempting to do a meta-analysis on pesticide exposure and telomere length (outcome). The exposure(s) were basically presented as either a...

17 January 2024 3,267 3 View

What is the difference between Kohn-Sham band gap and material projects system's band gap?

what is the difference between Kohn-Sham band gap and material projects system's band gap

02 December 2023 7,137 4 View

Is there any sugeestions for the mouse skin paraffin section HE staining?

Hello, I am now doing the HE staining of mouse skin paraffin section, but my slices always looks like not being well stained by eosin. I tried different staining time and use the dyes from...

06 September 2023 6,041 2 View

Why NBIA Data Retriever always reports this error?

I'm trying to download datasets from TCIA with NBIA Data Retriever. However, while starting a new download task with .tcia file, NBIA Data Retriever always reports this error. How can I fix...

05 September 2023 6,119 0 View

Is there any suggestion for mice skin paraffin section?

Hi Everyone, I am trying to get mice skin paraffin sections after 4% PFA fixation. However, after tissue processing, the skin become a little bit harder. And When I cut the paraffin sample, it...

03 July 2023 799 3 View

Will the human insulin act upon the L6 cell lines (derived from rat skeletal muscle)?

I am working on natural and synthetic insulin mimetics and insulin potentiating agents. I am assessing the effect compounds on glucose uptake in L6 myotubes, for that I want use Insulin as...

22 June 2022 1,179 2 View

How to detect intracellular labile iron pool in live cells?

Does anyone know have to detect intracellular labile iron pool in live cells? Or does anyone have recommendations of essay kits for such detecting purposes?

20 January 2022 105 2 View

Why in steel there is no coherency between austenite/ferrite?

I read a paper address that "The effect of elastic strain energy is neglected in the model since the precipitation of sigma phase is simulated only in base metals where there is no coherency...

29 August 2021 4,473 1 View

How to optimize the range of aluminum alloy casting process ?

surface finish; fluid ;micro structure

18 August 2017 4,789 6 View

How do I conformally do PR spin coating on trench structure?

I did PR spin coating on trench structure. I used AZ P4620 PR and the thickness or PR is around 11um. The substrate is Si. And my trench structure depth is 141um(negative way). Even though I...

08 August 2024 2,298 0 View

Do you think can be any diamond in A type eclogites?

I want to know more about diamond ore deposits in world.

08 August 2024 1,514 0 View

U you think We need a website software of Blackbody radiation law expert software?

A website software of Blackbody radiation law expert software can used through the following web site. http://39.105.188.151:3000/index

07 August 2024 1,706 0 View

Enhancing Critical Thinking Skills for Slow Learners: A Review of Empirical Studies?

to identify themes in question with APA style references

07 August 2024 2,239 5 View

Are there any instruments for studying time similar to the way it is in space?

There are a huge number of methods for studying objects in space, according to the senses (and not only). Mechanical, thermal, optical, acoustic, electrical, magnetic, based on particle beams,...

06 August 2024 7,102 0 View

How to preform densitometry on SDS-page bands?

I ran a SDS-page of a bacterial lysate and I want to quantify protein concentration in a specific band. I was thinking of using a standards ladder or make some standards are different...

05 August 2024 9,805 3 View

How can I find access to heavy metal reference doses from the EPA and WHO websites?

Reference dose and Maximum acceptable concentrations HMs

03 August 2024 8,230 4 View

Do you think can be any gas and oil bearing rocks in Eastern part of Iran?

I want to know more about petroleum deposits in Iran.

02 August 2024 8,725 3 View

How to convert g/kg Humic acid dose to kg/ha?

I used humic acid at 0.044 g/kg soil in my pot experiment. But finally, I have to recommend kg/ha. Each pot's soil weight was 11 kg. What is the solution?

02 August 2024 7,186 6 View

What is the best sampling strategy?

I am conducting a qualitative study that uses interviews to investigate the perceptions of teachers about a particular leadership practice and I am focusing on 3 schools which have a total number...

01 August 2024 8,457 10 View