I am using MG-RAST to analyse metagenome datasets and have used the Subsystems, KO, NOG, and COG databases for annotation of the functional genes. However, I don't know enough about the different databases so to choose which would be the most reliable library for my dataset. I have environmental samples and am determining microbial metabolic potential in terms of hydrocarbon degradation.