I have a dataset with nearly 1000 participants. I am trying to understand the factor structure of a COVID scale (23 items) before using it in my further analyses. Then I am planning to conduct CFA to confirm the structure obtained from EFA (Principal Axis Factoring and Varimax).
According to the literature, it is advised to randomly split the dataset into 2 halves, run EFA in the first half, then CFA in the second half. This is okey. However, I have a main problem here. I randomly splitted (%40) the dataset into 2, ran EFA on the first half and I obtained a structure with 3 factors. Then my data file accidentally closed and I had to ran the random splitting again on the main dataset (%40), then ran EFA on the first half again, but I obtained totally different factor structure with 4 factors. Then I tried again and again, every time I randomly split (%40) my main dataset, I obtain different factor structures. I am so confused and don't know how to interpret this. I appreciate your answers please.
Evren Morgul
PhD