Important remarks on the article by I. Blinova and F.-M.-Chmieleski; J. Eur. Orc. 45 (2-4): 255

23 January 2014 3 4K Report

Here are a few remarks on the article „Does climate influence the variation in traits of terrestrial orchids (Orchidaceae) symmetrically in various functional groups across European latitudes?“ (by I. Blinova and F.-M.-Chmieleski; J. Eur. Orc. 45 (2-4): 255 – 284. 2013)

The article itself is interesting (Intraspecific variation corresponds to a parabolic curve within species ranges with its maximum at the ecological optimum) but contains some severe errors and the statistical evaluation and the conclusions from these evaluations, respectively, are obscure.

The only equation on page 260 is wrong: The x-coordinate of the extreme value of a parabola is –b/(2a) whether or not it is a maximum or minimum (first derivative of y=ax**2+b*x+c must equal zero; ** means „to the square of“). (The authors claim that it is –b/(2a) for the maximum and b/(2a) for the minimum.) If it is a maximum or minimum is determined only by the sign of „a“ (the second derivative must be less or greater zero). See, e.g., http://de.wikipedia.org/wiki/Extrempunkt or remember the curve sketching at school.

Almost all references to the Tables are wrong (e.g. on page 261: „in Table 3“ would have to be „in Table 4“; „Table 4, 5 and 6“ on page 262 ought to be „Table 5,6 and 7“; „Tables 5 and 6“ on page 281 should be „Tables 5 and 7“; „Table 6“ in the legend of Figure 13 would have to be „Table 7“ and so on).

„R**2“ is called „Correlation“ (see legend of Table 5) but it is nowhere explained if R**2 is the coefficient of determination concerning „linear regression“ [in this case, the value of R**2= 0.99 (i.e. R=0.995) in Table 5 seems inappropriate high] or regarding „parabolic regression“ (in this last case the term „correlation“ would be wrong) or both (depending on the context).

The most serious problem is that the multiplicity is not taken into account (especially in Table 5 and Table 7). In Table 5 there are 280 simultaneous significance tests (8 meteorological parameters, 7 species, 5 traits), and even if there is no signal at all (i.e., the H0 is true and R**2=0, resp.), the probability to get one or more significant results is 1-(1-p)**280 = 99.9994% (assumed that the ‘tests’ are almost statistically independent; (1-p) is the selected significance level and p the p-value; p = 0.05 in the article. The probability that more than k tests are significant (without any signal) can be derived easily with the Binomial distribution). Performing such multiple tests, one should use adapted tests, or the p-value had to be reduced drastically (see „Holm’s Sequentially-Rejective Bonferroni Method“ or the „Simple Bonferroni Method“, e.g., in Shaffer, J. P. (1995): Multiple Hypothesis Testing. Annu. Rev. Psychol. Vol. 46, 561-584).

Because the specific p-values are not shown in Table 5 and 7, it is impossible to recalculate „adjusted significance levels“ and all significant (bold) R**2-values could be archieved by poor accident.

Hence all conclusions from these uncertain results are very doubtful.

Article Does climate influence the variation in traits of terrestria...

Alexander Lerchl

Very interesting comments! Did you consider contacting the Journal? Or did you think about writing a comment in PubMed (PubMed Commons)?

Klaus Blümel

I have no plans concerning this specific article. But since I know so many articles, especially in biometeorology and phenology, which neglect the multiplicity problem, I consider to write an article about the "danger of not taking account the statistical multiplicity". Not only "simple multiple test scenarios" such as in the above article are affected by this problem even more the "stepwise multiple regression" is.

By the way: Using only the first 6 groups from Table 5 of the above article, the probability, that as many as or even more than the 12 “R**2”-values (which are significant in this table within the first 6 groups) are significant, is about 32% even if there is no “Correlation” at all! (There are n=210 tests, p=0.05, q=1-0.05=0.95, sigma=sqrt(n*p*q)=3.16, my=n*p=10.5; now you can use the normal approximation N(my,sigma) of the binomial distribution to get the result 32% for the probability that k>=12 (with k=number of false significant results).)

I only corrected "derivative" twice (before I wrote "derivation" what is not a commonly used term in mathematics) in the "question".

Date of publication of Simplicula and S. atriceps Hampson (Lepidoptera: Noctuidae)?

Ich möchte gern meine Publikationen im Volltext herunterladen?

R2 falls when adding constructs in Amos?

How to update references?

We are searching for an (redox) indicator dye to assess yeast metabolism that works in a pH range from 4 – 7?

MGLTools 1.5.7 no molecules showing in the display problem?

How can the "influence-of-presumed-influence effect" (IPI) be manipulated in an experimental research method?

Which Statistical Analysis?

Source for Nicotiana tabacum cv. Wisconsin 38?

Could production of different rhEPO's for use in patients with different blood types prevent EPO-induced pure red cell aplasia ?

How is energy cycled through the Earth's climate system and how do matter cycle and energy flow through the rock cycle?

If we are using snowball sampling technique, how do we justify the true representativeness of the sample statistically? is there any statistical test?

What are possible strategies can be used to analyze data under sequential explanatory mixed method approach?

How does energy flow through Earth's systems and how does that affect climate and biosphere affect the flow of matter and energy on Earth?

Which is more important: The human factor or the technological factor in combating climate change?

Why 3 replicates for most biological assays? Is it enough to examine the data fits normal distribution?

Given the current level of natural phenomena cause by the climate change and environmental pollution, will the AI find the technological solutions?

Request for Advice: Starch Metabolism Research Project?

Can the limit of quantification (LOQ) of an analytical method fall outside its linear dynamic range, or must it always be within it?

Pragmatic inquiry research design?