Hello,
I am working on a machine learning project that maps EMG / IMU (input) to predict joint angles (output).
However, as I split my data into training and testing sets, my X_train and Y_train need to have the same number of rows to map inputs to outputs. Since EMG was sampled at 2000 Hz whereas motion capture was sampled at 120 Hz, should I upsample my joint angle data or should I downsample the EMG data? My intuition is that it is 'better' to downsample because in real applications, you would not have 'fake' data added to the system. Is that a correct assumption or does upsampling vs. downsampling not matter in this situation?
For example, if I have 10 seconds of walking data, I would want my training shapes as:
Shape = (# rows, # columns)
Downsample scenario:
X_train.shape = (120Hz * 10sec, # of EMG sensors = 5) = (1200, 6)
Y_train.shape = (1200, 1)
OR
Upsample scenario:
X_train.shape = (2000Hz * 10 sec, # of EMG sensors = 5) = (20000, 5)
Y_train.shape = (20000, 1)