If OpenAI would publish the models before the competition, then one could verify that they were not tinkered with.
Assuming that there exists a way for them to prove that a model is the same, at least. Since the weights are not open, the most basic approach is void.