Hello,

I am working on a machine learning project that maps EMG / IMU (input) to predict joint angles (output).

However, as I split my data into training and testing sets, my X_train and Y_train need to have the same number of rows to map inputs to outputs. Since EMG was sampled at 2000 Hz whereas motion capture was sampled at 120 Hz, should I upsample my joint angle data or should I downsample the EMG data? My intuition is that it is 'better' to downsample because in real applications, you would not have 'fake' data added to the system. Is that a correct assumption or does upsampling vs. downsampling not matter in this situation?

For example, if I have 10 seconds of walking data, I would want my training shapes as:

Shape = (# rows, # columns)

Downsample scenario:

X_train.shape = (120Hz * 10sec, # of EMG sensors = 5) = (1200, 6)

Y_train.shape = (1200, 1)

OR

Upsample scenario:

X_train.shape = (2000Hz * 10 sec, # of EMG sensors = 5) = (20000, 5)

Y_train.shape = (20000, 1)

More David Stephen Hollinger's questions See All
Similar questions and discussions