Custering and Classification?

More Jebari Chaker's questions See All

Do you know any commercial simulators for Access Control Policies implementations/enforcements?

25 October 2017 2,426 3 View

Which the best buffer can be used for DNA hybridization

Hi all, Recently we started working on DNA hybridization. The DNA probe was adsorbed electrostatically onto cationic polymers. We have used 50mM tris-buffer with 150 mM of NaCl, but it looks there...

12 July 2017 4,556 6 View

Stability of Square wave voltammetry or differential pluse voltammetry

Hi All, we are working on electrochemical reduced graphene modified (erGO) electrodes for sensing of small DNA sequences. We tried to adsorb Fc-DNA probe on the graphene surface and after that...

18 June 2017 5,035 9 View

Is it possible to explain the increase the band gap of Hybrid Organic-Inorganic Crystals while compressing the system (under pressure)?

By Ab Initio Calculations, it seems that the band gap increases under compression for some hybrid materials (type Copper Hydroxide Acetate), we try to find an analogy with similar cases in...

10 March 2016 2,181 1 View

Can you provide some information about chaos psychology and its possible application in comparative literature?

chaos psychology as a literary theory, or tin comparative literature

25 November 2015 5,090 4 View

Is there Any model explaining the origin of Magnetoelectric coupling (Observed clearly experimentally)?

This question is motivated by theroetical investigation we have done, trying to highlight ME coupling in some known Multiferroics. Indeed, despite carefull considerations of symmetry and...

27 May 2015 5,257 5 View

NDN final users!

Dear 'Named Data Networking' researchers, As you know the current Vanilla NDN works over IP. If we assume that NDN architecture become the used Internet Architecture instead of TCP/IP, what will...

01 January 1970 920 5 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Why does my protein refolded to beta sheet during thermal denaturation analysis?

Hi! So i attempted to understand a novel protein behavior towards heat application by analyzing its secondary structure change. I subjected the protein to a thermal denaturation analysis using...

06 August 2024 1,989 3 View

Samer Sarsam

Yes, this is possible to be done.

Kind regards

Dr. Samer Sarsam

Shafagat Mahmudova

Dear Jebari Chaker ,

Classification and clustering are two methods of pattern identification used in machine learning. Although both techniques have certain similarities, the difference lies in the fact that classification uses predefined classes in which objects are assigned, while clustering identifies similarities between objects, which it groups according to those characteristics in common and which differentiate them from other groups of objects. These groups are known as "clusters".

https://blog.bismart.com/en/classification-vs.-clustering-a-practical-explanation

Regards,

Shafagat

Jebari Chaker

Thank you Samer and Shafagat for your answers.

I am worry about the classification performance.

Because the classification will be based on the labels produced by the clustering step and as we all know that clsutering depends on the data distribution.

I mean, is there any difference between human labelling and clustering for labelling?.

Regards

Usually, after grouping the instances/records using a clustering algorithm, a manual labeling process needs to be accomplished. In this sense, instances of a specific cluster will have a unique label (class).

Now, once you have fully labeled data, you can perform the classification technique via e.g., SVM algorithm. Then, evaluate its prediction quality using the stratified 10-fold cross-validation technique, for instance. As a result, you can assess SVM's performance with different evaluation metrics like accuracy, Kappa statistic, ROC curve, etc.

HTH.

So at the end manuel labelling is there. So it cannot be fully automatic. isn't?

By the way I am planning to use LDA (topic modeling) to label my data.

Yes, you are right. The manul labeling needs to be done at the very early stage. Then, machine learning has to learn the human labeling strategy.

LDA is unsupervised learning algorithm that can be used for a topic modeling task. You can extract collection of topics from texts using LDA algorithm. Then, label these topics manually via domain-based knowledge.

Thank you so much.

Just I want to make sure that there no way to label data automatically

The automatic labeling depends on the manual labeling that must be done first. Therefore, we train machine learning algorithms to learning our labeling to automatically perform future labeling.

so at the end I can say that manual labelling is more trust than automatic labeling. isn't?

Also, I can say that even manual labelling is not accurate 100%.

Manual labeling depends on how you use your knowledge to label your data. For instance, you can label your data to either 'cat' or 'dog' because you know that these categories are in your data.

If I will use topic modelling to find labels. then I will assign the labels to the data. is it valid?

Yes, topic modeling techniques (e.g., the classical Latent Dirichlet Allocation (LDA) algorithm) allow you to extract text-related topics. Once you have these topics, you can understand them and assign appropriate labels manually to each topic.

So automatic labelling is not valid?. I am right?

Automatic labeling depends on manual labeling (i.e., the training set) that needs to be learned by machine learning algorithms (try many machine learning algorithms to know the one that has the best prediction capability). Thus, make sure your manual labeling is accurate as much as possible.

Thank you Samer.

My concern is:

what if the manuel labeling is not accurate?

In this case, how can judge about the automatic labeling?

You are welcome, Chaker.

Wrong manual labeling leads to inaccurate prediction results. In this case, no point in judging the automatic labeling (i.e., predictions).