Can someone please guide on cluster analysis to make identity statuses?

Hello dear Rabeeya Farooq ,

Cluster analysis is a statistical technique used to identify groups, or clusters, within a dataset based on similarities between observations. In the context of identity statuses using the Dimensions of Identity Development Scale (DIDS) by Koen Luyckx, cluster analysis can help identify distinct patterns or profiles of identity development among individuals.

Here are the steps to perform cluster analysis and then use K-means clustering to create identity statuses from DIDS scores:

1. Data Preparation: Collect data using the DIDS scale, which typically includes responses to various items related to identity development. Ensure that the data is cleaned and formatted properly before proceeding with the analysis.

2. Standardization: Before conducting cluster analysis, it's essential to standardize the variables (DIDS scores) to ensure that they are on the same scale. Standardization involves converting the raw scores into z-scores, which represent the number of standard deviations away from the mean. This step is crucial because it prevents variables with larger scales from dominating the clustering process.

3. Cluster Analysis: Once the data is standardized, you can perform cluster analysis using a method such as K-means clustering. K-means clustering aims to partition the observations into a pre-specified number of clusters (identity statuses) based on the similarity of their DIDS scores. The algorithm iteratively assigns each observation to the nearest cluster centroid (mean) and updates the centroids until convergence.

4. Determining the Number of Clusters**: Before applying K-means, it's essential to determine the optimal number of clusters. This can be done using techniques such as the elbow method, silhouette method, or hierarchical clustering. These methods help identify the number of clusters that best capture the underlying structure of the data.

5. Interpreting the Clusters: Once the clusters are generated, it's crucial to interpret them to understand the distinct identity statuses they represent. This involves examining the mean DIDS scores within each cluster and identifying the key characteristics or patterns associated with each status.

6. Validation and Interpretation: After identifying the clusters, it's essential to validate them using external criteria or theoretical frameworks related to identity development. This ensures that the clusters are meaningful and interpretable in the context of identity theory.

7. Reporting and Publication: Finally, document the results of the cluster analysis, including the method used, the number of clusters identified, and the characteristics of each cluster. Consider publishing your findings in peer-reviewed journals to contribute to the scientific understanding of identity development.

Good luck and Happy clustering !

Samawel JABALLI

Inès François

K-mean is used for numerical value only. I don't know the DIDS scale but if outputs are qualitative like a survey. You will need to convert it into numeric variables (perhaps percentages). If you obtain percentages, you won't need to use z scores. However, to avoid multicolineraity effect when you will run your clustering analysis, you will need to introduce n variables minus 1.

https://www.researchgate.net/post/For_a_K-mean_cluster_analysis_when_variables_are_percentages_have_we_to_use_n-1_variables

Rabeeya Farooq

Thank you for the input. It is a quantitative study where we need to do hierachal cluster analysis and then K means

can you tell me how the validation part is done?

Can anyone send me the detailed protocol of lentiviral titer determination and its invitro transduction into Hela cell line or MSCs?

I plot a graph between absolute value of log j (x-axis) and overpotential (y-axis) in origin by tafel extrapolation? is there need for any adjustment?

My Cdl results for HER in basic media (1M KOH) are not satisfactory. Is there any possible solution to get correct one?

How to transduce lentiviral vector into ThP1 macrophages and Mesenchymal stem cells?

Novel and future lubricants and additives for hybrid electric vehicles?

Can anyone please assist me in how to perform GITT both for batteries and supercapacitors?

While constructing a microbiome from soil isolated bacteria, which aspects are to be focused???

How can we write the system GMM model (mathematical equation)?

Which is the simplest method to obtain Silicon nanowires?

Hi I am a new in VASP. I have done some electronic calculations like DOS and band structure. Now i want to do calculate vibrational DOS of LI7P3S11?

How to learn more about SPSS and its Application?

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

Baseline drift in HPLC? What causes this?

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

How are iso-frequency contours plotted?

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Why does my protein refolded to beta sheet during thermal denaturation analysis?