What is the reason of choosing negative initial conditions in Neural network system?

More Pratap Anbalagan's questions See All

Could anyone provide CIF files of α, β, and γ-graphyne structures?

Dear ResearchGate community, If anyone within the community has access to or can provide these CIF files, your assistance would be greatly appreciated. Additionally, any relevant insights or...

01 March 2024 8,234 0 View

How does pH vary in biofilm reactors?

Lab scaled biofilm reactor are operated for treatment of wastewater with initial pH = 7.44, the pH of effluent after treatment from reactor increased to 8.15. The percentage removal of COD is...

21 August 2023 3,547 6 View

Can we use Schiff bases for removal of heavy metal ion from the water and how ?

Water purification

08 August 2023 321 1 View

Can we enhance the solid state electrical conductivity of metal complexes by using another metal ion?

Solid state electrical conductivity.

08 August 2023 8,050 2 View

What was the nature of ancient overseas maritime interactions of Kasi, Kosala and Magadh with Southeast Asia and eslewhere?

The maritime history of the mahajanapadas upstream on the Ganges River is often not discussed, even if it is evident that due to ancient sea-ports existing at the mouth of the river Ganges, in the...

05 June 2023 9,649 0 View

Can be used mixed metal complexes in drug delivery ?

Kindly share any recent research paper regarding the same?

19 May 2023 5,718 3 View

How to plot the phonon dispersion ?

I have done the ph.x , q2r.x , and the matdyn.x commands for a system that I am currently working on. All the three codes ran error free. Now, what is the next step to plot the phonon dispersion?...

18 May 2023 9,769 1 View

I am running protein-ligand simulation in GROMACS. My ligand_ini.pdb and ligand.gro files are not generating properly.Can someone help?

After running of python script downloaded from MacKerell lab website a file is generated: ligand_ini.pdb. After viewing it in VMD it does not look like the ligand any more.There is no error in...

02 May 2023 3,149 2 View

How do I make a bash script to execute gmx mdrun for generating checkpoint files in the interval of 1 min?

There is an error in my bash script. Here, I have taken i as the variable. And I expect to have -cpi md_0_1.part1.cpt and -cpo md_0_1.part2.cpt in the first command, gmx - cpi md_0_1.part2.cpt and...

25 April 2023 3,192 0 View

How much mpi and openmp should i give for band calculation in burai of heterostructure unit cell consist of 118 molecules ?

the workstation I'm using contains 32 core k points - 3 3 1 energy cutoff - 40 .

19 April 2023 9,412 2 View

What is the reason for current dropping in OER , LSV curve?

I tried four trials of the same Copper Phosphides sample in Alkaline medium ( 0.5M KOH) with Hg/HgO reference electrode and Pt as counter electrode. I used 0.001 V/s scan rate for first three...

10 August 2024 3,629 1 View

What may be the reasons for failures of Tube toi Tube Sheet Joints in Boiler Drum ?

We have observed that tube to tube sheet joint leaked in our boiler and needs to overcome same by knowing the root cause.

08 August 2024 3,161 0 View

What is the problem with these tissue culture plants?

All plants are green but some of these plants becomes yellow. I did not found any reason. Please help me to find out the real problem.

01 August 2024 589 4 View

Reason for discontinuities in my Band structure?

Hey All! I am wondering what might be wrong with my band structure. I did the calculations using VASP and plotted the results using Origin. Although I have tried changing various input...

25 July 2024 2,920 11 View

Is it fare for editors of reputed Journals to put on hold manuscripts more than six months without review updates, Are they harassing researchers?

It is being seen that Editors of some reputed Journals are putting on hold the manuscripts and not updating review states even passing more than six months from submission. Researchers are...

24 July 2024 8,596 2 View

Which software tools are best for enhancing diagnostic accuracy in chest X-ray imaging using image reconstruction and neural networks?

I am reaching out to seek your valuable advice and recommendations regarding the best software tools to use for this research. Specifically, I am looking for software with a user-friendly...

22 July 2024 3,794 1 View

Has anyone tried freeze flow samples in -80C freezer or liquid nitrogen before processing?

Our lab has some radioactive cell samples that need to be kept frozen for at least 30 days before flow analysis. For some reason, the samples stored in liquid nitrogen did not perform as well as...

15 July 2024 9,975 1 View

How can I extract the mathematical equation from existing Neural Network Model?

There exists a neural network model designed to predict a specific output, detailed in a published article. The model comprises 14 inputs, each normalized with minimum and maximum parameters...

14 July 2024 2,714 3 View

My Cyclic voltammetry curve doesn't show cathodic peak of Cu2+ to Cu+. how to solve this problem?

I performed CV of Cu-MOF in a buffer solution of pH 8, and we did not find any peak corresponding to Cu2+ to Cu+. What is the reason behind this? Please give your valuable suggestions.

08 July 2024 2,388 4 View

Hello, I’m growing three species of microalgae under heterotrophic and mixotrophic conditions. Both the two turned yellow?

What could be the reason for mixotrophs to be yellow too instead of green? Thank you

08 July 2024 594 2 View

Thomas W Kelsey Popular answer

You don't want the all the initial weights to be zero, because then you are not breaking any symmetries in the network structure. (Initial bias weights of zero are fine, though).

You also don't want all the initial weights to be positive, since (on average) half the weights after training will be negative.

Assuming a logistic activation function and normalised input data, we don't set initial weights to be too large, since the derivatives are then small, and learning will take longer.

So the standard approach is to choose random uniform initial weights between -1 and 1. Some of the negative initial weights will become positive, and some of the positive ones will become negative, but we don't know which ones (unless we do some unsupervised learning as a pre-process), so we just guess.

Note that some packages allow more sophisticated heuristics, such as initial weights bewtween -k and k where k is sqrt(6/(number node inputs + number node outputs)) for tanh activation nodes.

But the basic idea is the same: trained neural nets have negative weights, so we initialise with some negative weights.

Thomas W Kelsey

Md Junayed Hasan

I am quoting from @ Inanc Gumus because he explained this issue very nicely..

Imagine that someone has dropped you from a helicopter to an unknown mountain top and you're trapped there. Everywhere is fogged. The only thing you know is that you should get down to the sea level somehow. Which direction should you take to get down to the lowest possible point?

If you couldn't find a way to the sea level and so the helicopter would take you again and would drop you to the same mountain top position. You would have to take the same directions again because you're "initializing" yourself to the same starting positions.

However, each time the helicopter drops you somewhere random on the mountain, you would take different directions and steps. So, there would be a better chance for you to reach to the lowest possible point.

This is what is meant by breaking the symmetry. The initialization is asymmetric (which is different) so you can find different solutions to the same problem.

In this analogy, where you land is the weights. So, with different weights, there's a better chance of reaching to the lowest (or lower) point.

Also, it increases the entropy in the system so the system can create more information to help you find the lower points (local or global minimums)."

check this link:

https://stackoverflow.com/questions/20027598/why-should-weights-of-neural-networks-be-initialized-to-random-numbers