Now I am implementing bag of words method on 1 million image data-set. For getting 20000 words or class centers, I use 2 million sift features . How ever it is too slow to getting centers of each class.
If anyone have experience of using KNN on large data-set, I hope sharing your experience ...