Since more and more distributed machine learning systems such as Hadoop, Spark, Graph processing systems are emerging, it is necessary to know the advantages and disadvantages of those systems. Does anybody know a survey paper about this?
Hadoop, Spark, Graph processing systems are towards processing of large amount of data in distributed environments.you can find a good comparison about existing framworks in "A survey on platforms for big data analytics" paper