I have worked on metabolomics data and I am not sure the principle of analysis. First is to conduct a PCA. Literature review says it is to test the overall stability of the system. I do not understand it. Then PLS-DA could select some metabolites. Permutation for cross-validation is to check if over-fitting exists or not. I really do not understand the last bit. Why R2 less than 0 means no over-fitting? What if R2 is over 0 on the figure after permutation? I also do not understand how this permutation is done to cross-validate. Could anyone explain it to me? Many thanks.

More Cheng Song's questions See All
Similar questions and discussions