Due to the lack of possibilities to evaluate and compare cognitive architectures in agents in a formal way, what are the possibilities? Are competitions such as the bot prize appropriate? Or do we have to test them empirically in comparison to humans (i.e. classic psychological experiments)?