I'm using a genetic algorithm to solve a search problem.
Within a generation, I calculate the fitness value of the individuals by comparing the individuals with each other (Pairwise Comparison). The comparison is made by human users. Hence, this is a case of Interactive Genetic Algorithm. It means that there is not a real mathematical function: the objective is to increase people's level of appreciation or agreement.
The fitness value of each individual is calculated as the outcome of the pairwise comparison. The fitness value within each generation is normalised between 0-1. Plus, the whole 0-1 range is used. Meaning that in each generation there will be the best individual with fitness value at 1 and the worst individual with fitness value at 0. (It also means that the average fitness value of each generation will be always around 0.5.)
As usual, individuals with the higher fitness value are elected to migrate to the new generation. However, in the new generation, the comparison of the elected individuals with the new ones will yield to different fitness values.
Let's say that the fitness function is relative within a generation, hence not comparable across generations.
This makes it hard to check if the evolution is bringing to a better population.
Is there a methodology, or best practice, to check for the convergence genetic algorithm where the fitness function is valid relatively only within a single generation?