I have with me currently a Cancer Registry dataset containing demographic and clinical data in 10 years. It has both continuous and discrete variables.
One of the things of interest is to find meaningful clusters within this dataset. These clusters are characterized by the different demographic data such as age, gender, ethnicity, occupation, etc. It is also part of the study to know which parameters strongly characterize these meaningful clusters as well. The clinician collaborator is interested to determine risk assessments and/or conduct predictive analysis on this dataset.
So, how can we implement the hierarchical clustering for this dataset?