I would like to clarify two queries during training of Hidden Markov model

Data set background

I have 2 list say St & Rt. Length of my list len(St) = 200 & len(Rt) = 100 Each element in list is numpy array of size 100*5 Each list contains vehicle driving data which perform some maneuvers each.

I have attached a picture below of my data set (i.e St[0] single element in list St which is an numpy nd array of size 100*5)

Requirement

I would like to perform training using Hidden Markov model(Baum Welch/ EM Algo) to get the parameters for each of my list which contains specific maneuver data.

Questions

Q 1 : How should I select or determine the number of States and number Gaussian Mixtures for my model ? Is there is any method to find appropriate numbers ?

Q 2 : As I have attached a picture of how my data set look like. Whether my Data should represent any particular or specific distributions before i provide as an input into the model ? (like normal, uniform) If so how can I confirm, what type of distribution my data have ?

I would be to receive any suggestion/sources based on my above question. I am a new bee in learning HMM with GMM, any help would be appreciated.

Ref Picture attached

Similar questions and discussions