what is the use of applying clustering before classification?
Hi Jabbar,
I advise you to take a look at this URL:
https://www.quora.com/In-data-mining-before-any-classification-should-we-always-do-a-clustering-first-on-the-training-dataset
HTH.
Depends on the number of samples you have and their dimensions.
If dimensions>>number of samples, then it would be good to do PCA or other types of dimensionality reduction before classification
You may also check a previous discussion:
https://www.researchgate.net/post/Advantages_of_using_Integration_of_Hierarchical_Agglomerative_Clustering_and_C_45_Decision_Tree?_tpcectx=qa_overview_following&_trid=VQZU7ONY9CdKhgOsNYbfBtzw_
Hi, A.J.,
By now You got some reasonable answers; in particular, the link given to the webpage where there is much to read.
I will try to give the anwer in own words.
CLASSIFICATION is used when one knows allready classes into which the data
vectors should be classified. The classes are fixed, e.g. 3 kinds of health state.
How to make the classification? One needs for this an classification algorithm.
The algorithm may be derived in a variety of ways. Usually one obtaines
them using discriminant analysis.
CLUSTERING. If i have a new data set and do not know (much) about the classes
into which eventually the data set might be subdivided. Then I try to find
some 'natural' clusters of the data. Usually the user has to define K. the number
of clusters into which he wants subdivide his data. E.g., A plethora of K-means algorithms may be used for that purpose. The applied algorithm works
in an unsupervised way according to some mathematical priciple. The algorithm
has no knowledge about eventually classes the data points belong to,
The algorithm works (makes clustering) about his own principles, e.g. the most similar data vectors are linked together.
Finally, I obtain K clusters.
Now my job is to find out, if the obtained clusters are meaningful for me.
Myself, I am employing for this purpose some unsupervised graphical methods,
for example Kohonen's self-organizing maps. The nb of variables/individuals
does not matter much here.
Cluster analysis and Discriminant analysis include a wide range of topics.
and combined with data may provide exciting results of the elaboration.
Hope to hear on Your results
Good luck
A.B.
thanks one and all for your valuable suggestions
we want to organize conference -proceedings should be published in elsevier/science direct .Can any one send me the contact address of elsevier india
31 December 2017 9,600 3 View
shall i get the resource material on artificial intelligence and remote sensing
06 July 2017 4,307 3 View
03 April 2017 3,308 0 View
what is the new direction to closed frequent itemset mining? shall I get advanced papers in closed frequent item set mining
01 February 2017 8,904 1 View
31 December 2016 9,728 0 View
08 September 2016 1,682 0 View
can any body send the list of some good conferences for computers -springer and elsevier in 2016/2017
08 September 2016 1,581 2 View
Dear all will any body suggest good journals in networks domain.Preferably unpaid SCI indexed journals.
07 August 2016 4,008 2 View
can anyone help in find out the steps in average one depende algorithm in data mining
07 August 2016 7,775 1 View
I Need Gure KDD data set for IDS.will any one send me the data set in ARFF format
07 August 2016 4,367 1 View
What Characteristics makes CNN work better?
03 March 2021 1,458 4 View
i am try to classify the x-ray images. During classification , can i block unwanted images (except x-ray image).
03 March 2021 7,100 1 View
The range of forms and sizes of cavities in rock is very big. Is there a classification scheme for rock cavities regarding their size and shape?
02 March 2021 4,511 3 View
Do you have some recommendations approaches on Image processing and classification of plant diseases using TensorFlow/Keras?
02 March 2021 1,493 4 View
I want to differentiate between normal and abnormal movement. For this reason, I am looking forward to this type of dataset.
01 March 2021 9,485 1 View
I want to work on medical image(disease) detection and classification. Where can I found Image data sets?
24 February 2021 3,130 3 View
I am doing a project on classification of EEG signals of Alzheimer's disease and control groups. Previously, I extracted the alpha power of the EEG signals and found out the strengths. I want a...
18 February 2021 8,502 2 View
Hi, In EEG datasets, I have taken 200 samples X32 channels X 300 trials. For this particular scenario, I like to do binary classification .can you please guide me on whether shall I proceed in...
17 February 2021 5,132 8 View
I have a 3D Image of size 100x100x100. I have features (feature dimension 25) extracted for each pixel by considering a small pacth around it. So the input to 1D CNN is 100x100x100x25 reshaped to...
17 February 2021 941 4 View
I am working on a project to identify sensitive information from text and I am looking into the below categories: -adult content -health -politics -crime Can someone suggest an open source tool...
16 February 2021 4,844 4 View