I have recently gained interest in big data/data mining, and have been reading up on my own. When I surfed through Internet about it, I found mechanisms like map-reduce jobs,distributed file system etc.
I want to know what are the major algorithms that are used in this field for real time problems.