Count-min sketch is very influential and is wonderful invention by Graham Cormode and Muthukrishnan. It has many application, one of which is finding the most frequent items in data streams.
I don't understand how the identity of the items are recovered from the sketch. Since the technique uses hash functions, which are uni-direction. This mean that given an item X and a hash function H, we have H(X)==>y. Now there is now way that if you have "H" and "y" you can recover "X".
Since count-min sketch uses hash functions to map items to sketch. Thus after processing enough items, say millions of items, given the sketch and the hash functions, how they find the frequent items, if they don't know the items. Well the answer is trivial if you know the items.
So I really don't understand, how they manage to do so. I have been reading different papers but can't understand it. I will be thankful if you can explain it.
Article An improved data stream summary: The Count-Min Sketch and it...