what is the use of applying clustering before classification?
Hi Jabbar,
I advise you to take a look at this URL:
https://www.quora.com/In-data-mining-before-any-classification-should-we-always-do-a-clustering-first-on-the-training-dataset
HTH.
Depends on the number of samples you have and their dimensions.
If dimensions>>number of samples, then it would be good to do PCA or other types of dimensionality reduction before classification
You may also check a previous discussion:
https://www.researchgate.net/post/Advantages_of_using_Integration_of_Hierarchical_Agglomerative_Clustering_and_C_45_Decision_Tree?_tpcectx=qa_overview_following&_trid=VQZU7ONY9CdKhgOsNYbfBtzw_
Hi, A.J.,
By now You got some reasonable answers; in particular, the link given to the webpage where there is much to read.
I will try to give the anwer in own words.
CLASSIFICATION is used when one knows allready classes into which the data
vectors should be classified. The classes are fixed, e.g. 3 kinds of health state.
How to make the classification? One needs for this an classification algorithm.
The algorithm may be derived in a variety of ways. Usually one obtaines
them using discriminant analysis.
CLUSTERING. If i have a new data set and do not know (much) about the classes
into which eventually the data set might be subdivided. Then I try to find
some 'natural' clusters of the data. Usually the user has to define K. the number
of clusters into which he wants subdivide his data. E.g., A plethora of K-means algorithms may be used for that purpose. The applied algorithm works
in an unsupervised way according to some mathematical priciple. The algorithm
has no knowledge about eventually classes the data points belong to,
The algorithm works (makes clustering) about his own principles, e.g. the most similar data vectors are linked together.
Finally, I obtain K clusters.
Now my job is to find out, if the obtained clusters are meaningful for me.
Myself, I am employing for this purpose some unsupervised graphical methods,
for example Kohonen's self-organizing maps. The nb of variables/individuals
does not matter much here.
Cluster analysis and Discriminant analysis include a wide range of topics.
and combined with data may provide exciting results of the elaboration.
Hope to hear on Your results
Good luck
A.B.
thanks one and all for your valuable suggestions
we want to organize conference -proceedings should be published in elsevier/science direct .Can any one send me the contact address of elsevier india
31 December 2017 9,684 3 View
shall i get the resource material on artificial intelligence and remote sensing
06 July 2017 4,419 3 View
iam acting as an editor for springer communiactions and computing. i need one active reserahcer from U.S or U.K to act as co editor. those who are interested can mail their CV to...
03 April 2017 3,496 0 View
what is the new direction to closed frequent itemset mining? shall I get advanced papers in closed frequent item set mining
01 February 2017 9,000 1 View
recently my paper has been accepted in bio medical research journal.its credentials are ISSN: 0970-938X (Print)
31 December 2016 9,836 0 View
does any one clarify my doubt that SCImago and science citation index are same?
08 September 2016 1,781 0 View
can any body send the list of some good conferences for computers -springer and elsevier in 2016/2017
08 September 2016 1,688 2 View
Dear all will any body suggest good journals in networks domain.Preferably unpaid SCI indexed journals.
07 August 2016 4,102 2 View
I Need Gure KDD data set for IDS.will any one send me the data set in ARFF format
07 August 2016 4,472 1 View
can anyone help in find out the steps in average one depende algorithm in data mining
07 August 2016 7,875 1 View
Dear colleagues, Is it possible to send me the list of journals impact factor for the year 2024 (classification is for the year 2023)? excel format if it is possible. Thank you in...
29 June 2024 2,102 3 View
How does the application of generative adversarial networks (GANs) for data augmentation impact the robustness and accuracy of image classification models?
09 June 2024 2,923 2 View
How can attention mechanisms be integrated with convolutional neural networks to enhance performance in image classification tasks?
09 June 2024 2,432 3 View
How Satellite Bands (Landsat/Sentinal) and indices (NDVI/NDBI) were composite together (Layer stacked) (In a single layer) before performing supervised classification (MLC/SVM/RF etc)? How it...
06 June 2024 2,207 1 View
Hi folks, I'm a computer scientist PhD student, and I'm working on implementing Multi-Task Learning architecture for a better generalization aims, it will be throughout a Deep Learning model. I...
21 May 2024 8,589 1 View
Hello everyone, I have a dataset of videos for action classification, where each video contains multiple actions. I need to annotate these videos with the name of each class and the start and end...
17 May 2024 5,293 2 View
Estimating hourly surface runoff for the last 27 years (1992-2018) using multiple information including rainfall, soil moisture, topography, land use/land cover, soil classification.
06 May 2024 5,629 0 View
I want to ask that, is there any method to forecast EV load data recorded at irregular time intervals? I have read some articles related to EV load data forecasting. In these articles authors has...
05 May 2024 4,346 0 View
Cropping pattern can explain may thing. crop diversity, soil and climate and many more. it is emergence to identify the right cropping pattern. But right cropping pattern identification is much...
24 April 2024 8,541 4 View
Ancient India believed in traditional system of schooling called Gurukul, however sociopolitical developments and modernization of culture gradually wiped away the tradition. This eventually...
20 April 2024 9,414 0 View