Data Mining is applied Machine Learning. The former focuses more on the practical aspects of deploying machine learning algorithms on large datasets, whereas the later focuses on modeling, prediction, classification and computational efficiency for large datasets. Also, please check the following question in RG with lots of good answers.
In machine learning ,we hear about supervised learning,unsupervised learning and semi- supervised learning.In data mining ,we learn few techniques such as classification,association,and clustering. As per my knowledge,while introducing these classification ,association and clustering techniques to students ,will link these concepts to supervised and unsupervised learning techniques.so in data mining few concepts are inherited from machine learning
You can look at CRISP-DM methodology (Cross-Industry Standard Process for Data Mining). Data mining is a few-stage process. Machine learning special algorithms and methods that are used in one step of this process - modeling (see Fig 3. - table of tasks during Data Mining here http://www.dataprix.net/en/reference-model-crisp-dm )
If you use weka you will have to convert data file into arff format. With R, csv file can be used directly. R has very good functions for visualization of associations rule. Documentation of the package "arules" is available.
With simple function interface, It is easier to use than weka GUI.
As a novel paradigm for exploring knowledge, Data Mining is the computational process of discovering information and knowledge in large data sets or big data. It can be applied by Statistics, Machine Learning, Databases systems, etc. As one of the critical tools for applying Data Mining, Machine learning evolved from the study of pattern recognition and computational learning theory in artificial intelligence. As a result, Machine Learning mostly mentions a wide range of tools for pattern analysis, whereas, Data Mining mostly refers to the whole process of information/knowledge discovery.