I have seen literature reporting the use of statistical techniques like Mann-Whitney U test and Cohen's D effect size to identify most suitable subset of features for a classifcation problem. But can't this be achieved by employing feature selection and classification using wrapper method? Is there any advantage in using both i.e. aply statistical techniques first and further employ wrapper method on the derived subset of features?