In my knowledge: Feature scaling is to standardize and normalize data. Feature selection is to optimize for best features.
In python, feature scaling is enough to get good accuracy %.
But in MATLAB, Normalization concept in feature scaling is required with optimization to get good accuracy %.
Any other extra benefits?