Can anyone explain "batch_size", "batch_input_shape", return_sequence=True/False" in python during training LSTM with KERAS?

More S. A. Nahian's questions See All

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

How to increase simulation box size?

We intend to study the interaction between peptides and polymer (like PP, PE and PS) through MD simulations using Martini force fields ( Martini 2 for PP and Martini 3 for PE, PS). We have...

08 August 2024 4,842 0 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

I am trying to obtain microstructure for Mg-Zn-Sn alloy?

Any suggestions with respect to etchant composition and holding time?

27 July 2024 6,925 2 View

How to get Scopus Author Index ??

18 July 2024 4,080 3 View

Dear researchers. pl help how to plot jablonski energy level graph and magnetic hysteresis curve in origin?

through origin software

17 July 2024 4,991 0 View

Do software tools exist to assess the economic and technical practicality of introducing new food products, such as yogurt with modified starch?

This question explores the world of food innovation and asks if there are computer programs that can analyze the financial and technical feasibility of introducing new food products. For instance,...

13 July 2024 7,446 0 View

My nanoparticle has a lower fluorescence life time of 2 ns (usual life time between 3-10 ns). what are the inferences I can get from this?

what all details we will get from fluorescence life time data

10 July 2024 505 1 View

Alternative binders other than Nafion solution?

Hi Everyone, I plan to deposit a catalyst (TS-1@Co-PDA, in the core: TS-1 zeolite with a shell of Polydopamine designed with Cobalt) on a rotating ring-disk electrode (RRDE) to evaluate the...

29 June 2024 8,203 3 View

Journal Report Impact Factor 2024?

Has the new journal impact factor for 2024 been released? pls send me updates if anyone has it.

19 June 2024 6,333 1 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

After COVID-19 it has seen that EFL learners technological affiliation has raised. In addition, in the post-COVID period learners started to engage AI technologies like ChatGPT while learning...

08 August 2024 8,964 4 View

Request Python code?

Request Python code from this article : Gender equity of authorship in pulmonary medicine over the past decade. THANKS!

08 August 2024 6,242 2 View

What are examples of AI for good projects a teacher can assign to students?

So I am organizing an AI seminar. What are possible AI projects in the AI for good spirit? something the students can do and have an impact?

08 August 2024 9,437 4 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

How to design human-centered classroom in the age of A.I.?

08 August 2024 347 5 View

Why does everyone use vs code?

Visual Studio Code (VS Code) has become a popular choice among developers for several reasons: 1. **Free and Open Source**: VS Code is free to use and open source, making it accessible to...

07 August 2024 7,013 4 View

The Bigger You Are, the Harder You Fall (some lessons from Dinosaurs)?

Evolutionary fitness is based on an organism’s ability to adapt rapidly to changing environmental circumstances. Large-bodied mammals have been equipped with large brains (and hence a high...

06 August 2024 4,849 2 View

Konstantinos Kontakis Popular answer

From my experience with Keras and assuming you are familiar enough with neural networks:

batch_size denotes the subset size of your training sample (e.g. 100 out of 1000) which is going to be used in order to train the network during its learning process. Each batch trains network in a successive order, taking into account the updated weights coming from the appliance of the previous batch.

return_sequence indicates if a recurrent layer of the network should return its entire output sequence (i.e. a sequence of vectors of specific dimension) to the next layer of the network, or just its last only output which is a single vector of the same dimension. This value can be useful for networks conforming with an RNN architecture.

batch_input_shape defines that the sequential classification of the neural network can accept input data of the defined only batch size, restricting in that way the creation of any variable dimension vector. It is widely used in stacked LSTM networks.

Concerning your last inquiry, I suppose that the smaller batch size eased the overall computational cost -while at the same time- the classification accuracy of your dataset was improved due to the increased number of weights' updates. However, the performance and accuracy of your tested network also depends on various other parameters (like its dataset size, learning rate, momentum, etc), so don't take for granted that a smaller batch size will always wield better results compared to a higher one (a rather small size may 'generate' noise and lead to bad training).

Konstantinos Kontakis

S. A. Nahian

I really appreciate your help Mr. Konstantinos Kontakis. It helps.. :)

Thanks

Aldo Algorry

I have the same question, I a priori agreed with Konstatinos but I'm a bit confused seeing this piece of code from Keras Documentation http://keras.io/getting-started/sequential-model-guide/

data_dim = 16

timesteps = 8

nb_classes = 10

batch_size = 32

# expected input batch shape: (batch_size, timesteps, data_dim)

# note that we have to provide the full batch_input_shape since the network is stateful.

# the sample of index i in batch k is the follow-up for the sample i in batch k-1.

model = Sequential()

model.add(LSTM(32, return_sequences=True, stateful=True,

batch_input_shape=(batch_size, timesteps, data_dim)))

model.add(LSTM(32, return_sequences=True, stateful=True))

model.add(LSTM(32, stateful=True))

model.add(Dense(10, activation='softmax'))

If you see the comment " the sample of index i in batch k is the follow-up for the sample i in batch k-1.", it appear that a batch is the unit of information for each time step. But on the other hand the batch_input_shape is a 3D shape,

Someone can explain me wich is the meaning of this parameters; batch_size, timesteps, data_dim?