I have a sound sample, and by applying window length 0.015 and time step 0.005, I have extracted 12 MFCC features for 171 frames directly from the sample using a software tool called PRAAT.
Now I have all 12 MFCC coefficients for each frame. I want to process them further, making a 39-dimensional matrix by adding energy features and delta-delta features and applying dynamic time warping. How do I deal with the coefficients and how do I make delta-delta coefficients?