I am trying to understand LSTM with KERAS library in python. I found some example in internet where they use different batch_size, return_sequence, batch_input_shape but can not understand clearly. I read the KERAS documentation but could not get those yet. Please help me to understand those.
another thing is, when I tried with small batch size the loss is smaller and performs better than higher batch size.. please explain why.
Thanks in advance.