In sklearn.svm.SVC the default value of the parameter gamma is 'scale', i.e. gamma = 1 / (n_features * X.var()). What is the explanation for this default choice of gamma and why does it work so well (at least for my dataset, I couldn't beat this value with extensive grid-search for gamma)?