Refer to the attached image (Source: https://medium.com/mlreview/understanding-lstm-and-its-diagrams-37e2f46f1714)

Why do we need two extra sigmoids (marked with red in fig), when the outcome of the first one (marked with blue in fig) can be passed to the input and output gates as well?

Doesn't it increase the computational burden by requiring dedicated weights and biases for the additional sigmoids?

More Utkarsh Singh's questions See All
Similar questions and discussions