I need to finding duplicate items for discover the Association Rules in the Bigdata. Common algorithms for solving the Association Rules are Apriori, FP-Growth and others...but I don't use these algorithms. I want to improve the finding time of duplicate items, guide me please