The aim of the data pretreatment (transformation and preprocessing) before of PCA or other multivariate analysis is to remove mathematically the sources of unwanted variations. These variations cannot be removed naturally during the data analysis.
The Principal Component Analysis (PCA) is a method based on spectral analysis of the matrix of coefficients of linear correlation. The principal components are linear combinations of the original variables of the data table analyzed. This descriptive method has been developed for the detection of linear relations between variables.
However, if the relationships between the variables analyzed are not linear, the values of correlation coefficients can be lower. Thus, it is sometimes useful to transform the original variables prior to the Principal Component Analysis to "linearize" these relationships.
One can of course use standard transformations (logarithm, power, Box-Cox) to linearize, thus approaching the multidimensional normal model. Note also that these data transformations change the metric used: Correspondence Analysis is a principal component analysis of the conditional distributions table where the Euclidean distance between profiles is the distance of the Chi-Square on counting data or frequency. Many other linear (eg, discriminant analysis, rank-PCA) or curvi-linear (logarithm, Box-Cox transform) methods can be characterized by an adequate processing of the table data corresponding to a particular metric (eg, metric Mahanalobis, Spearman correlation coefficient).
However, these transformations are not always optimal, so it is advisable to use piecewise polynomial (splines) to better approach the regular functions than the too rigid polynomial functions. This kind of PCA methods are called Spline based PCA (B-spline for positive coefficients) in case of independent transforms of each variable or Kernel PCA. in case of the transform of the whole data table.
For an introduction to this type of spline analyses, see:
Models for Multivariate Data Analysis 1 Introduction, by PC Besse
www.math.univ-toulouse.fr/~besse/pub/mda.ps
and also:
Insight Dreamed of a PCA, by P. C. Besse.
www.math.univ-toulouse.fr/~besse/pub/dreampca.ps
A more complete treatment of the Spline based PCA is provided by:
PCA-based Dimension Reduction for Splines, by A. van der Linde
If you have an experiment where the range of values are significantly different, say voltage, concentration, and pH, normalizing the data before PCA is essential. If you rescale a voltage from 1.0V 2.0V to # of electrons, PC 1 will be # of electrons. If your pH is rescaled to [H+], it will be PC 2. If conc is scaled to number of molecules, it will become the new PC 1. If you normalize your data, the scale does not matter. The PC's will be what they should be.
Some claim that normalizing some data is not needed because the variability is about the same for all of your data. If this is true. PCA will give the same results if you normalize or not. So, normalizing your data will have no effect on the result.
For PCA preparation, when the data is already on a Log scale, such as pH data, wouldn't Log-transforming it be pointless? Or would it be carried out to make it match the other variables?