02 February 2015 3 7K Report

I've got ~1M points of {-1, 1}-labeled data in ~100 dimensions, each dimension standardized to have minimal values of 0 and maximal values of 1. Can anyone recommend some quick-and-easy heuristics for setting the kernel bandwidth of an RBF SVM? I'm sure there must be some great heuristics out there based on typical between-point distances. Thanks, much!

EDIT: Googling around has found me Jaakola's Heuristic, based on the median distance between any two points of opposite class. Other recommendations appreciated!

Similar questions and discussions