I am doing Principle Component Analysis of my proteomics data.

There are two groups, one is healthy control and the other one is patients.

Originally I have 90 biomarkers, and after feeding all the 90 biomarkers for PCA analysis, I have PC1 (23.3%) and PC2 (19.2%).

Then I reduced biomarkers to 23, which I did t-test and found the 23 biomarkers are significanly different between control and patients.

Then after feeding only the 23 biomarkers for PCA analysis, I have PC1 (39.3%) and PC2 (27.9%).

Now my question is, how should I interprate the results?

In the first time, PC1+PC2 is 42.5%; when I reduce the number of biomarkers, PC1+PC2 is 67.2%, so can I draw the conclusion that the two groups are clustered better in the second time than in the first time?

Or, are there any specific number that we could refer to evaluate the quality of a PCA?

More Heshuang Qu's questions See All
Similar questions and discussions