Hello Reseachers , can you please suggest me some methods or anything to analyse the clustering of organic molecules in MD simulation?

More Aarti Kumari's questions See All

Why Do TDS and EC Increase with Larger Wastewater Volumes, While BOD and COD Decrease?

I have carried out MFC experiments on three different volumes, 50, 500 and 1000 mL of wastewater. Results after MFC treatment shows that TDS and EC are more in larger volumes of water i.e. TDS and...

09 August 2024 9,621 0 View

How to enrich pig excreta for increasing nutrient quality organically ?

Pig slurry is rich in major and minor nutrients. Is there any way to improve / Enrich its manure quality to be used in agriculture organically ? please share your knowledge.

09 August 2024 5,605 2 View

Is it possible to plot the atom-projected band structure using GPAW?

Hi, I'm currently working on a project where I need to plot the atom-projected band structure using GPAW. I've been able to calculate the band structure for my material, but I'm having trouble...

07 August 2024 269 3 View

Unusual intensity drop in some sections of chromatograms in DDA?

Hi, we have measured tryptic peptides using both DDA and DIA method on QExactive. In DDA replicates i saw unusual intensity drops occurring at the same sections of chromatograms in DDA replicates...

07 August 2024 3,218 4 View

Leaf area of tomato ?

Hi How can this equation Ln(LA) = 1.038 + 0.89 ln(X) be applied to calculate the leaf area of a tomato? Can you explain with an example and what is the substitution of Ln and ln?

06 August 2024 2,508 2 View

Why did the authors extrapolate a phenotype that they experimentally proved in one bacterial strain across the whole genus of the organism?

I aim to be as skeptical as possible regarding whether a pair of orthologous genes results in the same phenotype in their different but related bacterial organisms under similar environmental...

05 August 2024 6,787 4 View

How to preform densitometry on SDS-page bands?

I ran a SDS-page of a bacterial lysate and I want to quantify protein concentration in a specific band. I was thinking of using a standards ladder or make some standards are different...

05 August 2024 9,805 3 View

XRD Analysis is showing only Calcium carbonate. It is not showing other compounds. Can anyone help me get the other compounds?

XRD Analysis is showing only Calcium carbonate. It is not showing other compounds. Can anyone help me get the other compounds

04 August 2024 3,019 3 View

Which solvent is better to dissolve with secondary metabolites extracted from fungi?

I work on MCF7 cell cell for anticaner purpose and I wa to do drug preperation the drug ( secondary metabolites extracted from Aspergillus) My question which solvent is better with these secodary...

03 August 2024 4,725 2 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

• What the possible Persistent Organic Pollutants and Heavy metals present in fluorspar, sediments, and water bodies around its mining area?

Approximate concentrations are require in compared with the WHO permissible limts

11 August 2024 2,723 1 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Basis set and input instruction to calculate HOMO and LUMO using Orca for Imidazolium based organic salts?

Hey there, As a synthetic chemist delving into theoretical calculations for my imidazolium-based organic molecules, I would greatly appreciate any guidance on the appropriate input instructions...

09 August 2024 5,444 7 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

Ma'Mon Abu Hammad

Amer Dababneh Roshdi R Khalil Hassan Al-Zoubi Mohammad Alaroud M. Al Horani H Horani

Fatimah Altuhaifa

Hello;

you can use Hierarchical clustering following these steps.

Perform Hierarchical Clustering: Apply the Hierarchical Clustering algorithm to your dataset of organic molecules near the protein. This algorithm builds a hierarchical structure of clusters by iteratively merging or splitting clusters based on their similarity.

Determine the Number of Clusters: Use a suitable method, such as the dendrogram or a criteria like the elbow method or silhouette score, to determine the optimal number of clusters in the hierarchical structure. This will help you define distinct clusters for further analysis.

Assign Cluster Labels: Based on the identified number of clusters, assign cluster labels to each organic molecule in your dataset. Each molecule will be associated with the label of the cluster it belongs to.

Calculate Cluster Sizes: Count the number of organic molecules in each cluster to determine the cluster sizes. This step will provide you with the count or size of each individual cluster.

Probability Distribution: Plot a probability distribution graph to visualize the frequency or probability of different cluster sizes. The x-axis represents the cluster sizes, and the y-axis represents the probability or frequency of occurrence of each cluster size.

Normalize the Distribution: Normalize the probability distribution by dividing the frequency or count of each cluster size by the total number of clusters. This normalization step will give you the relative probability of each cluster size, facilitating comparison and interpretation.

Plot the Graph: Generate a graph that depicts the probability distribution of different cluster sizes. You can use a bar plot, histogram, or line plot to visualize the distribution.

Interpretation: Analyze the probability distribution graph to understand the different cluster sizes and their probabilities. Look for peaks or modes in the distribution that indicate the most common or dominant cluster sizes. Calculate statistical measures such as mean, median, and standard deviation to further characterize the cluster size distribution.

there are other type of clustering such as K-means Clustering but this consider a traditional method, so if you doing these steps for publishing a paper it would be better to use different method than k - means, or build k-means with Hierarchical Clustering, Gaussian Mixture Models (GMM), then compare the result of all the three methods.

if you are doing that for homework just use k - means