The feature set for my project is pitch values in the form

features = (p1,n1,p2,n2,p3,n3.....pt,nt)  

where p1 is the pitch value and n1 is the duration (number of times that pitch value is repeated)

I am currently using FDTW to compare two signals after remaking the inputs in the form and normalizing them by their mean and standard deviation.

(p1,p1,p1,p1..... ,,, p2,p2,p2,p2 .... and so on)

I have sampled each the hum and original tune to compare with and used the algorithm to determine similarity. Which makes the comparisons very slow. Currently the time complexity for a single hum to song comparison is

O(n) = n3log(n

FDTW has given me an accuracy of 60-90% based on the correctness of hummed tune.

I am unclear about how to use Machine Learning for my project. Is it even required and will it improve the time complexity or accuracy?

Similar questions and discussions