data mining issues are the performance, accuracy, robustness, scalability and interpretability of the minor and the relations between them. i.e. you must be careful not to increase the performance and on the other hand lose the accuracy of the minor.
for the data repository i suppose you to visit this site