Hi, recently I'm building classification models to classify between 3 classes (1,2,3). I have data for 3 species (A, B and C). I combined species A and B for training, validation and testing. The model shows a quite high accuracy (~90%). However, when I used that model to predict 3 classes in species C, the model perform badly, accuracy ~50%. I guess it's mismatch in data problem but I don't know what to do. I cannot use species C for training because I want my model to be generalized, which can predict 3 classes in many new more species in the future and it cannot be re-train every time it sees a new species. I also visualized each feature between species A+B and C and they're looked not so different. What should I do in this case? Thank you!

Similar questions and discussions