Can I calculate the scores of new test subjects from an existing PCA?

Yes, Engelbert Buxbaum, you can calculate the component scores for new test subjects using the existing loading matrix from your principal component analysis (PCA). Here’s how:

Standardize the New Data: Use the means and standard deviations from your original dataset to standardize the new data.

Project onto Principal Components: Multiply the standardized new data by the loading matrix from the original PCA to get the component scores.

To streamline this process, you might find OriginPro software particularly useful. It offers powerful tools for PCA and can easily handle the standardization and projection steps for new data.

Richroy Richard

Yes, you can use the existing loading matrix from your principal component analysis (PCA) to calculate the component scores for new test subjects. Here’s how you can do it:

Center the new data: Ensure the new test subjects' data is centered using the same means as the original dataset. If you originally standardized the data (i.e., scaled to have zero mean and unit variance), you should standardize the new data in the same way using the means and standard deviations of the original dataset.

Calculate component scores: Use the loading matrix from the original PCA to transform the new data. The component scores for the new subjects can be calculated by multiplying the centered new data matrix with the loading matrix (principal components).

Here is a step-by-step outline of the process:

Step-by-Step Process

Center (and potentially scale) the new data:Calculate the mean (and standard deviation, if scaling) of the original data. Subtract the mean (and divide by the standard deviation, if scaling) from the new data to center (and scale) it.

Calculate the component scores:Let XnewX_{\text{new}}Xnew be the matrix of centered new data. Let PPP be the loading matrix from the original PCA. The component scores TnewT_{\text{new}}Tnew for the new data can be obtained by: Tnew=Xnew⋅PT_{\text{new}} = X_{\text{new}} \cdot PTnew=Xnew⋅P

Example

Suppose you have the following:

μ\muμ (mean vector of original data)
σ\sigmaσ (standard deviation vector of original data, if you standardized)
PPP (loading matrix from PCA)

Let XnewX_{\text{new}}Xnew be the new data matrix.

Xnew, standardized=Xnew−μσX_{\text{new, standardized}} = \frac{X_{\text{new}} - \mu}{\sigma}Xnew, standardized=σXnew−μCenter the new data:Xnew, centered=Xnew−μX_{\text{new, centered}} = X_{\text{new}} - \muXnew, centered=Xnew−μIf standardized:

Tnew=Xnew, standardized⋅PT_{\text{new}} = X_{\text{new, standardized}} \cdot PTnew=Xnew, standardized⋅PCalculate the component scores:Tnew=Xnew, centered⋅PT_{\text{new}} = X_{\text{new, centered}} \cdot PTnew=Xnew, centered⋅PIf standardized:

This approach allows you to project the new subjects' data onto the principal components derived from the original dataset, thus enabling you to analyze the new subjects in the context of the existing PCA model.

Dear Fabrizio, may there be any interest, to measure position and angle deformation of bridges using a new 'Vignetting field stop procedure"?

How do I calculate the rotated score matrix after PCR?

How to learn more about SPSS and its Application?

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

Baseline drift in HPLC? What causes this?

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

How to define an anisotropic material with asymmetric elastic compliance/stiffness matrix in ANSYS APDL?

Is it true that $\det(V(A))$ may be only $\pm 1$, depending on $n$, for the last symmetric tridiagonal matrix $A$?

How are iso-frequency contours plotted?

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?