I have a landuse classification and want to do an external validation of it (which is not based on the training data).

Because 1 of my 3 classes is really rare, I cannot safely implement a random sample for the validation.

How do I pick a number for the sample of a stratified validation? Does a stratified validation need to be proportional (between the strata), or can I use the same sample size for all stratas?

As my rare class is also my most important class, I do not want to under-represent it, but I am also afraid to create some bias without being conscious of it.

And I cannot find literature that gives a clear hint or maybe I just got confused about that topic. Is there any fixed that should be used? Or is there a proportion between all classified pixels and the amount of pixels that have to be validated per class.

I am thankful for all hints or literature advice!

Similar questions and discussions