I have a dataset which contains 110 data points. This data set is divided in to three categories, say, A,B and C and each categories are sub-categorized into 3 to 5 categories: A1,A2,A3,A4,A5, B1,B2,B3, and C1,C2,C3, and C4. I split this dataset into two subsets. How would I know if each of the categories are well represented by the subsets and if the subsets represent the dataset well?

Thanks

More Reuben James Quintal Buenafe's questions See All
Similar questions and discussions