I'm attempting to do feature selection on the variables on my dataset, but unfortunately the overwhelming majority of them are categorical. Currently they've all been encoded into binary variables, meaning that using something like p values will tell me only whether a specific level of a variable is important - not whether the variable itself is. Is there a recommended or best practice to approach this kind of problem?