Kaggle and other platforms and communities have been holding deep learning/machine learnin/data mining online competitions to promote related research and commercialization. In such kind of competitions, it is often stateted that, e.g., third party dataset or the testing partition of a specific dataset, is not allowed for training.
However, is there any way to detect if a team has used forbidden data?