Pardon, I am novice when it comes to handling meteorological time series data. My questions are - what is the best statistical approach for examining the similarities between two time series data sets, as well as accuracy of the forecasted data against the actual?
For instance, I have existing forecasted (not training) mean temperature time series data drawn from an open source API (OpenWeatherMap) which provides future 3-hourly predictions all way up to 5 days for a city. For instance, if data is drawn from the API today , I will obtain current measures of temperature and future predicted estimates all at a 3-hourly interval going up to 5-days.
I also have actual observed 1-hourly mean temperature time series data drawn from an actual weather station from the same city.
I have matched the data according via date and time which has the actual observed data for temperature from the weather station, and what it was predicted to be in 3hr, 6hrs, 9hrs going up to 5-days from the API (see link to a slimmer version of the dataset).
https://drive.google.com/file/d/1zuQ9ympJV0H5Nf75tHLBjNogadhbo1Ip/view?usp=sharing
I have used the Mean Absolute Percentage Errors to perform such calculations for making the comparisons, and Taylor Diagram.. However, I think it is too simplistic of an approach and it would not allow me to make examination for any seasonal similarities.
Any help, advice on the proper approach would be well-received.
Anwar.