Now I am implementing bag of words method on 1 million image data-set.  For getting 20000 words or class centers, I use 2 million sift features . How ever it is too slow to getting centers of each class. 

If anyone have experience of using KNN on large data-set, I hope sharing your experience ...

Similar questions and discussions