Galileo has deduced the law of gravity (1/2 g t^2) by observing balls rolling on an inclined plane. However, without the occam razor, there is no reason to infirm the following law: until today the law of gravity is 1/2 g t^2, and tomorrow the law of gravity will be -1/2 g t^2. This law satisfies the criterium of Karl Popper. It is a scientific law. Without the occam razor we have to wait tomorow hoping that I was wrong and Galileo was right.
We have the same problem for machine learning. Given a set of data, and a powerful learning machine such as SVM or ANN, there are an infinity of solutions which fit the data. To make the problem well posed, I need to find the simplest that fit the data.