Hi all,
I have a question on how to use/interpret Gumbel's law. Suppose I have 6 years of data on total yearly rainfall on a particular region, from 3 sites (mm/year). Say
Site 1: 541, 781, 978, 1421, 1017, 1267
Site 2: 1169, 1283, 1209, 1211, 761, 1076
Site 3: 998, 1219, 630, 798, 662, 989
Recall these are yearly values, not daily or monthly maxima. If I group the data per site, I get a sample of maximum values: x1 = {1421, 1283, 1219}. But if I group the data per year I get a different sample: x2 = {1169, 1283, 1209, 1421, 1017, 1267}. The two samples yield different Gumbel parameters y = 1/0.78 sd(x) * [max(x) - mean(x) + 0.45*sd(x)], namely
y1 = 1.984392
y2 = 2.422774
These lead to different probability and recurrence periods:
y1: Prob[ yearly rainfall in region > 1421] = 0.128 (1 in ~8 years)
y2: Prob[ yearly rainfall in region > 1421] = 0.0849 (1 in ~12 years)
Which one should I use for prediction of future rainfall, or are they both correct and it's just a matter of interpretation of the result? You can assume all data points are drawn from the same underlying normal distribution (which they actually are, with mean 1000, sd 200).
Thanks!