I have a dataset which contains 110 data points. This data set is divided in to three categories, say, A,B and C and each categories are sub-categorized into 3 to 5 categories: A1,A2,A3,A4,A5, B1,B2,B3, and C1,C2,C3, and C4. I split this dataset into two subsets. How would I know if each of the categories are well represented by the subsets and if the subsets represent the dataset well?
Thanks