In data mining, a most basic concept we need to know is the difference between classification and clustering? Please give some web links and attach papers.
Both classification and clustering split data into groups. However, in classification the groups are specified before hand (supervised learning), while in clustering the groups (or clusters) are based on the similarities of data instances to each other (unsupervised learning). For more information, please see the attached link.