Usually PCA is used as a dimensionality reduction step before CCA for fusion in most applications. However, care must be taken as PCA retains components that have most variance in one data set and therefore, these might not be the same components that have most correlation between two sets. A joint PCA-CCA approach, thus, is a better solution. However, for some applications, PCA as a dimensionality reduction step might not always work and therefore other techniques such as sparse CCA might provide more interpretable results.
So the reason of using PCA before CCA, is the low performance of the CCA technique with high dimensional data. Therefore, the PCA used to reduce the dimensionality before the fusion process