I am trying to implement statistical analysis on some data. I have the NCEP2 data set for 4 climate variables and the ERA40 data set for the same variables for a region. Given a grid of 9x10, I have 90 locations and so I have 360 variables for a particular day. I have 12000+ such days. I've implemented PCA on this. What I cannot understand is :

  • How can I get the predicted values of more than one location when I use all variables as the predictors for any location?
  • Am I doing this right or am I missing something?

I don't need the exact implementation. I just want to be clear about how the data flows from the grid format to the output. I'm trying to implement the following papers:

Similar questions and discussions