The feature set for my project is pitch values in the form
features = (p1,n1,p2,n2,p3,n3.....pt,nt)
where p1 is the pitch value and n1 is the duration (number of times that pitch value is repeated)
I am currently using FDTW to compare two signals after remaking the inputs in the form and normalizing them by their mean and standard deviation.
(p1,p1,p1,p1..... ,,, p2,p2,p2,p2 .... and so on)
I have sampled each the hum and original tune to compare with and used the algorithm to determine similarity. Which makes the comparisons very slow. Currently the time complexity for a single hum to song comparison is
O(n) = n3log(n
FDTW has given me an accuracy of 60-90% based on the correctness of hummed tune.
I am unclear about how to use Machine Learning for my project. Is it even required and will it improve the time complexity or accuracy?